个人信息Personal Information
副教授
硕士生导师
性别:女
毕业院校:大连理工大学
学位:硕士
所在单位:数学科学学院
电子邮箱:yangyc@dlut.edu.cn
Markov model plus k-word distributions: a synergy that produces novel statistical measures for sequence comparison
点击次数:
论文类型:期刊论文
发表时间:2008-10-15
发表刊物:BIOINFORMATICS
收录刊物:SCIE、PubMed、Scopus
卷号:24
期号:20
页面范围:2296-2302
ISSN号:1367-4803
摘要:Motivation: Many proposed statistical measures can efficiently compare biological sequences to further infer their structures, functions and evolutionary information. They are related in spirit because all the ideas for sequence comparison try to use the information on the k-word distributions, Markov model or both. Motivated by adding k-word distributions to Markov model directly, we investigated two novel statistical measures for sequence comparison, called wre.k.r and S2.k.r..
Results: The proposed measures were tested by similarity search, evaluation on functionally related regulatory sequences and phylogenetic analysis. This offers the systematic and quantitative experimental assessment of our measures. Moreover, we compared our achievements with these based on alignment or alignment-free. We grouped our experiments into two sets. The first one, performed via ROC (receiver operating curve) analysis, aims at assessing the intrinsic ability of our statistical measures to search for similar sequences from a database and discriminate functionally related regulatory sequences from unrelated sequences. The second one aims at assessing how well our statistical measure is used for phylogenetic analysis. The experimental assessment demonstrates that our similarity measures intending to incorporate k-word distributions into Markov model are more efficient.