日常吐槽

为什么明明英文论文动辄8,9页，十来页的，结果最后看人家翻译成中文，也并不多！！？？我还是看的很吃力？？

【SegNet】 A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

反(上)卷积-反(上)池化-上采样

【SegNet】 A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

反卷积 Deconvolution

如上图的（a）。反卷积操作并不能还原出卷积之前的图片，只能还原出卷积之前图片的尺寸。那么到底反卷积有什么作用呢？通过反卷积可以用来可视化卷积的过程，反卷积在GAN等领域中有着大量的应用。

【SegNet】 A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

反池化 Unpooling

【SegNet】 A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

在池化的过程中，上图中，橙色一个区进行最大池化，绿色一个区进行最大池化。记录最大值以及最大值的位置，在uppooling的过程中，原来的最大值重新出现在了它原来的位置，其他位置给0.

上采样 Unsampling

在第一个图中的b)图，在于UnSampling阶段没有使用MaxPooling时的位置信息，而是直接将内容复制来扩充Feature Map。从图中即可看到两者结果的不同。

DeConvNet

Learning Deconvolution Network for Semantic Segmentation
ICCV2015

FCN的不足

DeConvNet是针对FCN进行了改善。

下图是FCN 的结构

【SegNet】 A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

1）FCN 因为其固定尺寸receptive field只能解决单尺度的semantics ，对于过大过小的目标分割都有可能有问题
the network can handle only a single scale semantics within image due to the fixed-size receptive field. Therefore, the object that is substantially larger or smaller than the receptive field may be fragmented or mislabeled.

【SegNet】 A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

如图，a）图由于bus太大了，挡住了person,bicycle,car的分割。b)图则由于person太小了，根本检测不到2333.

2）FCN 的 deconvolution procedure 太粗糙太简单，FCN 的 deconvolution procedure输入尺寸只有16 × 16，将这个尺寸通过 bilinear interpolation 放大到输入图像尺寸。目标很多细节信息丢失

DeConNet的架构

【SegNet】 A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

DeconvNet 和 SegNet 的结构非常类似，只不过 DeconvNet 在 encoder 和 decoder 之间使用了 FC 层作为中介，用于增强类别的分类。

卷积层：使用VGG-16（去除分类层），把最后分类的全连接层去掉，在适当的层间应用Relu和Maxpooling。增加两个全连接层（1x1卷积）来强化特定类别的投影。
反卷积层：卷积层的镜像，包括一系列的 unpooling，deconvolution，Relu 层
网络输出：概率图，和输入图像大小相同，表明每个像素点属于预定义类别的概率