Hits:
Indexed by:会议论文
Date of Publication:2017-01-01
Included Journals:Scopus
Page Number:71-74
Abstract:XML documents have both structural and semantic information, bringing data integration and deep utilization based on XML more precise description and versatile expression. But in the meanwhile traditional NLP and DM methods can't be applied directly. Feature dimension reduction and general similarity of XML based on tensor analysis are discussed. Considering the correlation between XML's structure and content, a tensor based model for describing XML documents and an MMI method to XML's dimension reduction is presented. Since structure and content are not independent with each other, a tensor based algorithm to calculate general similarity from a non-linear angle is designed to show their relationships and effects to its performance. ? 2016 IEEE.