deeplearning.ai - 卷积神经网络

卷积神经网络
吴恩达 Andrew Ng

filter (过滤器) (通常是奇数阶)

$(\begin{array}{ccc} 1 & 0 & - 1 \\ 1 & 0 & - 1 \\ 1 & 0 & - 1 \end{array})$ 边缘检测
convolution operation *
bright pixels on the left and dark pixels on the right

$(n \times n) * (f \times f) = (n - f + 1) \times (n - f + 1)$
- output will shrink
- pixels in the corner are used only once, so we loss information near the edge of the image
解决上述两个问题的方法 —— Pad
- with an additional border of one pixel all around the edges
- pad with zeros by convention
- so the output will be $(n + 2 p - f + 1) \times (n + 2 p - f + 1)$
Valid Convolution: No paddings (p = 0)
Same Convolution: Pad so that output size is the same as the input size
f 通常是奇数
- 便于 Same Convolution 的操作
- 存在中心点 central pixel

in mathematic, (convolution) before calculation the filter needs a flipping operation (沿副对角线的镜面翻转)
in ML we usually do not use flipping operation, actually it should be cross-correlation, but by convention we call this convolution
卷积满足结合律

height, width, channels(depth)
图片和过滤器的通道数必须相等
$n \times n \times n_{c} * f \times f \times n_{c} \to (n - f + 1) \times (n - f + 1) \times n_{c}^{'}$

$n_{c}$ : number of channels; $n_{c}^{'}$ : number of filters
detect $n_{c}^{'}$ features

Convolutional Layer (Conv) 卷积层

Pooling Layer (Pool) 池化层

Fully Connected Layer (FC) 全连接层

reduce the size of representation to speed up computation and make some of the features it detects a bit more robust
no parameters to learn, just a fixed function, has no weights
最后将池化的结果平整化为一个列向量

识别数字

$f = 2, s = 2$ 使输入的高和宽减少一半
两类卷积的形式

一个卷积层和一个池化层一起作为一层；或者分为单独的两层
一般计算网络层数时，只看具有权重的层
池化后的结果和全连接层的单元作笛卡尔连接？？
not to invent your own settings of hyper parameters, but to look in the literature
随着层数的增加，高度和宽度都会减少，信道数会增加