个人信息Personal Information
教授
博士生导师
硕士生导师
性别:男
毕业院校:大连理工大学
学位:博士
所在单位:信息与通信工程学院
学科:信号与信息处理. 通信与信息系统
办公地点:大连理工大学创新园大厦A526室
联系方式:0411-84706005-3526
电子邮箱:zhechen@dlut.edu.cn
Melody Extraction From Polyphonic Music Using Particle Filter and Dynamic Programming
点击次数:
论文类型:期刊论文
发表时间:2018-09-01
发表刊物:IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
收录刊物:SCIE
卷号:26
期号:9
页面范围:1620-1632
ISSN号:2329-9290
关键字:Melody extraction; Bayesian filtering; particle filter; dynamic programming; music information retrieval
摘要:Melody extraction from polyphonic music is one important but challenging task in the music information retrieval community. In this paper, a new melody extraction method based on the particle filter and dynamic programming is proposed. The constant-Q transform is first introduced for multiresolution spectral analysis of polyphonic music. Then, the melody extraction is modeled in the Bayesian filtering framework, and the particle filter is used to get a rough melody contour. Specially, the pitch transition probability of adjacent frames is approximated according to the statistical analysis based on one publicly available dataset, and the likelihood of frame-wise pitches is defined by considering pitch salience, spectral smoothness, and timbre similarity. After that, the preliminary melodic contour obtained by particle filter is smoothed to achieve the frame-wise pitch range limitation. Finally, the dynamic programming is used to accurately track the final melodic contour. The proposed method requires no prior information, and is suitable for both instrumental and vocal melodies. The experimental results show that the performances of the proposed method is robust among four publicly available datasets comparing with the state-of-the-art methods, and it achieves the highest averaged raw pitch accuracy and raw chroma accuracy performances with lower octave errors.