首页 > 其他 > 详细

CVPR 2016 paper reading (2)

时间:2017-03-01 00:43:02      阅读:164      评论:0      收藏:0      [点我收藏+]

1. Sketch me that shoe, Qian Yu, Feng Liu, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, Cheng Change Loy, in CVPR 2016.

A unique characteristic of sketches in the context of image retrieval is that they offer inherently fine-grained visual description - a sketch speaks for a ‘hundred‘ words.

技术分享

fine-grained sketch-based image retrieval (SBIR)面临三个挑战:

1) visual comparisons not only need to be fine-grained but also executed cross-domain; (sketch和photo是两个不同的domains)

2) free-hand (finger) sketches are highly abstract, making free-grained matching harder, and most importantly;

3) annotated cross-domain sketch-photo datasets required for training are scarce.

this paper introduces two instance-level SBIR datasets consisting of 1432 sketch-photo pairs in two categories (shoes and chairs), collected by asking participants to finger-sketch an object after observing a photo. Besides, a total of 32,000 ground-truth triplet ranking annotations are provided for both model development and performance evaluation.

技术分享

this paper uses the annotated triplets as supervision to train triplet CNNs. The goal is to learn a feature mapping f that maps photos and sketches to a common feature embedding space, in which photos similar to pariticular sketches are closer than those dissimilar ones.

--> Triplet loss:

技术分享

 实验做得比较周密,考虑了4个步骤的pretrain和fine-tune阶段,并加入了data augmentation(包括stroke removal和stroke deformulation),每一种改进都获得了性能提升。

CVPR 2016 paper reading (2)

原文:http://www.cnblogs.com/wm123/p/6481515.html

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!