个人信息Personal Information
教授
博士生导师
硕士生导师
主要任职:软件学院、大连理工大学-立命馆大学国际信息与软件学院院长、党委副书记
性别:男
毕业院校:西安交通大学
学位:博士
所在单位:软件学院、国际信息与软件学院
学科:软件工程. 计算数学
电子邮箱:xin.fan@dlut.edu.cn
Unsupervised depth estimation from monocular videos with hybrid geometric-refined loss and contextual attention
点击次数:
论文类型:期刊论文
发表时间:2020-02-28
发表刊物:NEUROCOMPUTING
收录刊物:EI、SCIE
卷号:379
页面范围:250-261
ISSN号:0925-2312
关键字:Unsupervised; Monocular video; Attention; Hybrid geometric-refined loss
摘要:Most existing methods based on convolutional neural networks (CNNs) are supervised, which require a large amount of ground-truth data for training. Recently, some unsupervised methods utilize stereo image pairs as input by transforming depth estimation into a view synthesis problem, but need stereo camera as an additional equipment for data acquisition. Therefore, we use more available monocular videos captured from monocular camera as our input, and propose an unsupervised learning framework to predict scene depth maps from monocular video frames. First, we design a novel unsupervised hybrid geometricrefined loss, which can explicitly explore more accurate geometric relationship between the input color image and the predicted depth map, and preserve depth boundaries and fine structures in depth maps. Then, we design a contextual attention module to capture nonlocal dependencies along the spatial and channel dimensions in a dual path, which can improve the ability of feature representation and further preserve fine depth details. In addition, we also utilize an adversarial loss to discriminate synthetic or realistic color images by training a discriminator so as to produce realistic results. Experimental results demonstrate that the proposed framework achieves comparable or even better results than those trained with monocular videos or stereo image pairs. (C) 2019 Elsevier B.V. All rights reserved.