Группа авторов

Computational Statistics in Data Science


Скачать книгу

Max Pooling and Average Pooling. The Max Pooling layer returns the maximum value from the portion of the image covered by the kernel matrix. The Average Pooling layer returns the average of all values covered by the kernel matrix. The convolution and pooling process can be repeated by adding additional convolutional and pooling layers. Deep convolutional networks have been successfully trained and used in image classification problems.

stat08316fgz002
.

      4.2 Convolutional Layer

      4.3 LeNet‐5

stat08316fgz003
.

stat08316fgz004

      LeNet‐5 of LeCun et al. [8].

      Source: Modified from LeCun et al. [8].

      The first layer (C1) is a convolutional layer, which consists of six kernel matrices of size 5 times 5 and stride 1. Each of the kernel matrices will scan over the input image and produce a feature matrix of size 28 times 28. Therefore, six different kernel matrices will produce six different feature matrices. The second layer (S2) is a Max Pooling layer, which takes the 28 times 28 matrices as input. The kernel size of this pooling layer is 2 times 2, and the stride size is 2. Therefore, the outputs of this layer are six 14 times 14 feature matrices.

      Source: LeCun et al. [8].

Indices of output matrices
1 1 5 6 7 10 11 12 13 15 16
2 1 2 6 7 8 11 12 13 14 16
3 1 2 3 7 8 9 12 14 15 16
4 2 3 4 7 8 9 10 13 15 16
5 3 4