Data Science

Max pooling attempts to create an abstract representation of an image using fewer dimensions
For each iteration of convolution, max pooling involves calculating the maximum value from the portion of values covered by the filter
We take the max of certain regions in order to detect some feature (given by a filter) in our input image
A high value tends to indicate that a feature exists in that region
This will reduce the dimensionality of an input image
Meaning, max pooling layers benefit in the following ways:
- They are less prone to overfitting (i.e. accuracy benefit)
- They don't require parameter learning (i.e. speed benefit)

downsampling

There are two forms of pooling:
- Max pooling
- Average pooling
Average pooling is rarely ever used
Average pooling is only sometimes used to reduce the dimensions of an image, while attempting to best capture the image's properties

Max pooling doesn't rely on any learnable parameters
Instead, max pooling only relies on the following hyperparameters:
- $f^{[l]}$ : The size of the filter in the $l^{th}$ layer
- $p^{[l]}$ : The amount of padding in the $l^{th}$ layer
- $s^{[l]}$ : The stride in the $l^{th}$ layer
We almost always set $p=0$
The most common choices of hyperparameters are the following:
- $f=2$ and $s=2$ (and $p=0$ )
- $f=3$ and $s=2$ (and $p=0$ )
An input image will have the following dimensions:

n_{h}^{[l]} \times n_{w}^{[l]} \times n_{c}^{[l]}

\lfloor \frac{n_{h}^{[l]}-f}{s} + 1 \rfloor \times \lfloor \frac{n_{w}^{[l]}-f}{s} + 1 \rfloor \times n_{c}^{[l]}

maxpool

Max pooling attempts to create an abstract representation of an image using fewer dimensions
For each iteration of convolution, max pooling involves calculating the maximum value from the portion of values covered by the filter
We take the max of certain regions in order to detect some feature (given by a filter) in our input image
A high value tends to indicate that a feature exists in that region
This will reduce the dimensionality of an input image
Meaning, max pooling layers benefit in the following ways:
- They are less prone to overfitting (i.e. accuracy benefit)
- They don't require parameter learning (i.e. speed benefit)

Convolutional Layer

LeNet-5 Implementation

Pooling Layer