![]() |
个人信息Personal Information
教授
博士生导师
硕士生导师
主要任职:信息与通信工程学院副院长
其他任职:电子技术教研室主任
性别:男
毕业院校:北京理工大学
学位:博士
所在单位:信息与通信工程学院
学科:通信与信息系统. 信号与信息处理
办公地点:海山楼B511
联系方式:QQ:51574683
电子邮箱:xfgong@dlut.edu.cn
Speech Separation Based on Semi-blind Kurtosis Maximization with Magnitude and Energy Distance
点击次数:
论文类型:会议论文
发表时间:2012-08-21
收录刊物:EI、CPCI-S、Scopus
页面范围:50-53
关键字:semi-blind source separation; speech separation; complex-valued kurtosis maximization; closeness measure; reference signal
摘要:Frequency-domain blind source separation (BSS) is efficient for separating convolutive speeches by reducing time-domain convolutive mixtures to instantaneous mixtures of complex-valued speeches at each frequency bin, but suffers from permutation ambiguity. Considering that the semi-blind complex kurtosis maximization (KM) algorithm can separate complex-valued signals in a fixed order by incorporating magnitude priors about the sources as references, we here apply it to perform speech separation in frequency domain. As the closeness measure between the BSS estimate and the reference is vital for the semi-blind KM algorithm to extract a specific source when the reference is determined, we examine two different closeness measures in this study. One is based on magnitude of the reference that is originally used by the semi-blind KM algorithm, and the other is based on energy of the reference. We define a distance between the source of interest and the others in terms of the closeness measure, and compare the distances for frequency-domain speech signals and the performances of speech separation by using the two closeness measures. The results demonstrate that the distance using the new closeness measure is larger than that using the original one due to energy matching between the estimate and the reference, and the semi-blind KM using the new closeness measures obtains better performance for frequency-domain speech separation.