Discrete Cosine Transform Spectral Pooling Layers for Convolutional Neural Networks

Pooling operations for convolutional neural networks provide the opportunity to greatly reduce network parameters, leading to faster training time and less data overfitting. Unfortunately, many of the common pooling methods such as max pooling and mean pooling lose information about the data (i.e., they are lossy methods).

Recently, spectral pooling has been utilized to pool data in the spectral domain. By doing so, greater information can be retained with the same network parameter reduction as spatial pooling.

Study the pooling effects on the image below:

We see the effects of max pooling (first row), mean pooling (second row), Discrete Fourier Transform (DFT) pooling (third row), and Discrete Cosine Transform (DCT) pooling (fourth row) when the image is restored. Notice that the bird image is much better retained with the spectral methods.

This project implements pooling using the DCT instead of the DFT given the DCT’s ability to pack more information into fewer spectra. The project compares different DCT pooling and convolution implementations, which is important because the convolution process cannot be done in the DCT domain (as it can be done in the DFT domain).

This work uses a tensor-based convolutional neural network in Matlab (link at bottom of page). The CNN framework is described in the figure below:

This work was presented at the International Conference on Artificial Intelligence and Soft Computing in Zakopane, Poland (June 2018).

https://link.springer.com/chapter/10.1007/978-3-319-91253-0_23

Currently, an updated code base (written for Python Tensorflow) is being prepared for release on Github. My latest project code (Matlab) can be found at https://github.com/jamessealesmith/ConvNetDCT

Check out some more work with this project on the Deep Autoencoder page!

Publications

. Discrete Cosine Transform Spectral Pooling Layers for Convolutional Neural Networks. International Conference on Artificial Intelligence and Soft Computing (ICAISC), 2018.

PDF Project