个人信息Personal Information
教授
硕士生导师
性别:女
毕业院校:大连理工大学
学位:博士
所在单位:信息与通信工程学院
学科:信号与信息处理
办公地点:创新园大厦A516房间
联系方式:maxh@dlut.edu.cn
电子邮箱:maxh@dlut.edu.cn
Audio Source Separation from a Monaural Mixture Using Convolutional Neural Network in the Time Domain
点击次数:
论文类型:会议论文
发表时间:2017-01-01
收录刊物:EI、CPCI-S
卷号:10262
页面范围:388-395
关键字:Monaural source separation; Convolutional neural network; Deep learning
摘要:Audio source separation from a monaural mixture, which is termed as monaural source separation, is an important and challenging problem for applications. In this paper, a monaural source separation method using convolutional neural network in the time domain is proposed. The proposed neural network, input and output of which are both time-domain signals, consists of three convolutional layers, each of which is followed by a max-pooling layer, and two fully-connected layers. There are two key ideas behind the time-domain convolutional network: one is learning features automatically by the convolutional layers instead of extracting features such as spectra; the other is that the phase can be recovered automatically since both the input and output are in the time domain. The proposed approach is evaluated using the TSP speech corpus for monaural source separation, and achieves around 4.31-7.77 SIR gain with respect to the deep neural network, the recurrent neural network and nonnegative matrix factorization, while maintaining better SDR and SAR.