陈喆

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:男

毕业院校:大连理工大学

学位:博士

所在单位:信息与通信工程学院

学科:信号与信息处理. 通信与信息系统

办公地点:大连理工大学创新园大厦A526室

联系方式:0411-84706005-3526

电子邮箱:zhechen@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

基于动态贝叶斯网络的音视频联合说话人跟踪

点击次数:

发表时间:2008-01-01

发表刊物:自动化学报

所属单位:软件学院

期号:9

页面范围:1083-1089

ISSN号:0254-4156

摘要:Multi-sensor data fusion technique is applied to speaker tracking problem, and a novel audio-visual speaker tracking approach based on dynamic Bayesian network is proposed. Based on the complementarity and redundancy between speech and image of a speaker, three kinds of perception methods, including sound source localization based on microphone array, face detection based on skin color information, and maximization mutual information based on audio-visual synchronization, are proposed to acquire the tracking information. In the framework of dynamic Bayesian network, particle filtering is used to fuse the tracking information, and perception management is achieved to improve the tracking efficiency by information entropy theory. Experiments using real-world data demonstrate that the proposed method can robustly track the speaker even in the presence of perturbing factors such as high room reverberation and video occlusions.

备注:新增回溯数据