Chuhui Xue——【arxiv2019】MSR_Multi-Scale Shape Regression for Scene Text Detection
Chuhui Xue, Shijian Lu, Wei Zhang
针对任意文字检测(水平、倾斜、曲文),通过网络来regress文字的边界像素点来得到text region。
整个检测的流程包括:
Fig. 1: Scene text detection using the proposed multi-scale shape regression network (MSR): For scene texts with arbitrary orientations and shapes in (a), MSR first predicts dense text boundary points (in red color) as shown in (b) and then locates texts by a polygon (in green color) that encloses all boundary points of each text instance as shown in (c).
Fig. 3: Structure of proposed multi-scale network (for two-scale case): Features extracted from layers Conv2 - Conv5 of two network channels are fused, where features of the same scale are fused by a Concat UpConv as illustrated and features from the deepest layer of the lower-scale channel are up-sampled to the scale of the previous layer for fusion.
Fig. 4: Illustration of ground-truth generation: Given a text annotation polygon in (a), triangulation is performed over the polygon vertices to locate the vertices (green points in (b)) of the central text region in blue color in (c). For each centraltext-region pixel tp (in blue color in (d)), the nearest point on the text annotation box b p in yellow color is determined as the nearest text boundary point as shown in (d), and the distance between t p and bp is used to generate ground-truth distance maps as shown in (e) and (f)
损失函数
点分类(Dice coefficient)
最近boundary point的dx、dy回归(Smooth_L1)
ICDAR13
MSRA-TD500
CTW1500
Total-Text
论文速读(Chuhui Xue——【arxiv2019】MSR_Multi-Scale Shape Regression for Scene Text Detection)
原文:https://www.cnblogs.com/lillylin/p/10408880.html