个人信息Personal Information
副教授
硕士生导师
性别:男
毕业院校:大连理工大学
学位:博士
所在单位:软件学院、国际信息与软件学院
办公地点:大连经济开发区大连理工大学软件学院
联系方式:15641190702
电子邮箱:piaoy@dlut.edu.cn
A hybrid method for XML clustering
点击次数:
论文类型:会议论文
发表时间:2010-12-18
收录刊物:EI、Scopus
页面范围:286-290
摘要:An effective XML cluster method called neighbor center clustering algorithm (NCC) is presented in this paper, whose similarity is obtained through both structural and content information contained in XML files. Structural similarity is measured by the idea of Longest Common Subsequence, while content similarity is achieved using TF-IDF principles. It reduces computation complexity by avoiding direct search for cluster centers. Experiments show that the NCC can obtain high purity and F-measure value and is suitable and applicable for clustering XML with both homogenous and heterogeneous structures. ? 2010 IEEE.