深度学习笔记

时间：2016-10-27 12:44:19 阅读：110 评论：0 收藏：0 [点我收藏+]

Assume the output from a layer in CNN is N × N × d dimension, which is the output of d filters for N × N spatial cells. Each spatial cell is computed from a receptive field in the input image.

The receptive fields of all the spatial cells in the input image can highly overlap with each other. The size of one receptive field can be computed layer by layer in CNN. In a convolution (pooling) layer, if the filter (pooling) size is a×a and the stride is s, then T ×T cells in the output of this layer corresponds to [s*(T ? 1) + a] × [s*(T ? 1) + a] cells in the input of this layer. For example, one cell in the CONV5 (the 5th convolutional)layer of CNN model (imagenet-vgg-m) [40] corresponds to a 139 × 139 receptive field in the 224 × 224 input image (cf. Fig. 4).

深度学习笔记

原文：http://www.cnblogs.com/linkboy1980/p/6003152.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)