大连理工大学主页平台管理系统戚金清 Language-aware weak supervision for salient object detection 中文主页

戚金清

副教授硕士生导师
性别：男
毕业院校：东京工业大学
学位：博士
所在单位：信息与通信工程学院
学科：通信与信息系统. 信号与信息处理
电子邮箱：

访问量：

开通时间：..

最后更新时间：..

移动版主页

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Language-aware weak supervision for salient object detection

点击次数：

发布时间：2019-11-04

论文类型：期刊论文

发表时间：2019-12-01

发表刊物：PATTERN RECOGNITION

收录刊物：SCIE、EI

卷号：96

ISSN号：0031-3203

关键字：Saliency detection; Natural language; Textual-visual pairwise; Self-supervision

摘要：Natural Language Processing has achieved remarkable performance in multitudinous computer tasks, but the potential capability of textual information has not been completely explored in visual saliency detection. In this paper, we learn to detect salient object from natural language by addressing the two essential issues: finding a semantic content matching the corresponding linguistic concept and recovering fine details without any pixel-level annotations. We first propose the Feature Matching Network (FMN) to explore the internal relation between the linguistic concept and visual image in the semantic space. The FMN simultaneously establishes the textual-visual pairwise affinities and generates a language aware coarse saliency map. to refine the coarse map, the Recurrent Fine-tune Network (RFN) is proposed to enhance its predicted performance progressively by self-supervision. Our approach only leverages the caption to provide important cues of salient object, but generates a fine-detailed foreground map at a detecting speed of 72 FPS without any post-processing. Extensive experiments demonstrate that our method takes full advantage of textual information of natural language in saliency detection, and performs favorably against state-of-the-art approaches on the most existing datasets. (C) 2019 Elsevier Ltd. All rights reserved.

上一条：Multi-attention guided feature fusion network for salient object detection

下一条：Salient Object Detection via Multiple Instance Learning