龚晓峰

个人信息Personal Information

教授

博士生导师

硕士生导师

主要任职:信息与通信工程学院副院长

其他任职:电子技术教研室主任

性别:男

毕业院校:北京理工大学

学位:博士

所在单位:信息与通信工程学院

学科:通信与信息系统. 信号与信息处理

办公地点:海山楼B511

联系方式:QQ:51574683

电子邮箱:xfgong@dlut.edu.cn

扫描关注

论文成果

当前位置: 龚晓峰的个人主页 >> 科学研究 >> 论文成果

An improved BLUES with adaptive threshold of condition number for separating underdetermined speech mixtures

点击次数:

论文类型:会议论文

发表时间:2012-07-15

收录刊物:EI、Scopus

页面范围:694-698

摘要:Speech separation has been studied for decades, to which one challenge is the underdetermined problem, where there are more sources than microphones. To solve this problem, Pedersen et al. proposed recently an effective algorithm called BLUES (BLind Underdetermined Extraction of Sources) by combining ICA and time-frequency masking, and it works well on instantaneous/convolutive mixtures of both speech and music. One key ingredient to BLUES is the stopping criterion of the separation process, where the condition number of the outputs is compared with a fixed threshold in the original version. However, as audio recordings are always varying in speech sources and their number, using a fixed threshold would not fit in with these changes, and then deteriorate the overall performance. As such, we propose a threshold update strategy to improve BLUES by adapting the threshold with an increasing rate to find the most suitable condition number. A new criterion based on detection of the number of the sources is then presented to stop the algorithm. The experiments are carried out by using the synthetic and real recorded underdetermined mixtures. The results show that our approach obtains improved performance compared to the original BLUES when the number of the speeches included in the underdetermined mixtures is increased. ? 2012 IEEE.