樊鑫

个人信息Personal Information

教授

博士生导师

硕士生导师

主要任职:软件学院、大连理工大学-立命馆大学国际信息与软件学院院长、党委副书记

性别:男

毕业院校:西安交通大学

学位:博士

所在单位:软件学院、国际信息与软件学院

学科:软件工程. 计算数学

电子邮箱:xin.fan@dlut.edu.cn

扫描关注

论文成果

当前位置: 樊鑫的主页 >> 科学研究 >> 论文成果

Unsupervised depth estimation from monocular videos with hybrid geometric-refined loss and contextual attention

点击次数:

论文类型:期刊论文

发表时间:2020-02-28

发表刊物:NEUROCOMPUTING

收录刊物:EI、SCIE

卷号:379

页面范围:250-261

ISSN号:0925-2312

关键字:Unsupervised; Monocular video; Attention; Hybrid geometric-refined loss

摘要:Most existing methods based on convolutional neural networks (CNNs) are supervised, which require a large amount of ground-truth data for training. Recently, some unsupervised methods utilize stereo image pairs as input by transforming depth estimation into a view synthesis problem, but need stereo camera as an additional equipment for data acquisition. Therefore, we use more available monocular videos captured from monocular camera as our input, and propose an unsupervised learning framework to predict scene depth maps from monocular video frames. First, we design a novel unsupervised hybrid geometricrefined loss, which can explicitly explore more accurate geometric relationship between the input color image and the predicted depth map, and preserve depth boundaries and fine structures in depth maps. Then, we design a contextual attention module to capture nonlocal dependencies along the spatial and channel dimensions in a dual path, which can improve the ability of feature representation and further preserve fine depth details. In addition, we also utilize an adversarial loss to discriminate synthetic or realistic color images by training a discriminator so as to produce realistic results. Experimental results demonstrate that the proposed framework achieves comparable or even better results than those trained with monocular videos or stereo image pairs. (C) 2019 Elsevier B.V. All rights reserved.