马晓红

个人信息Personal Information

教授

硕士生导师

性别:女

毕业院校:大连理工大学

学位:博士

所在单位:信息与通信工程学院

学科:信号与信息处理

办公地点:创新园大厦A516房间

联系方式:maxh@dlut.edu.cn

电子邮箱:maxh@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Audio Source Separation from a Monaural Mixture Using Convolutional Neural Network in the Time Domain

点击次数:

论文类型:会议论文

发表时间:2017-01-01

收录刊物:EI、CPCI-S

卷号:10262

页面范围:388-395

关键字:Monaural source separation; Convolutional neural network; Deep learning

摘要:Audio source separation from a monaural mixture, which is termed as monaural source separation, is an important and challenging problem for applications. In this paper, a monaural source separation method using convolutional neural network in the time domain is proposed. The proposed neural network, input and output of which are both time-domain signals, consists of three convolutional layers, each of which is followed by a max-pooling layer, and two fully-connected layers. There are two key ideas behind the time-domain convolutional network: one is learning features automatically by the convolutional layers instead of extracting features such as spectra; the other is that the phase can be recovered automatically since both the input and output are in the time domain. The proposed approach is evaluated using the TSP speech corpus for monaural source separation, and achieves around 4.31-7.77 SIR gain with respect to the deep neural network, the recurrent neural network and nonnegative matrix factorization, while maintaining better SDR and SAR.