个人信息Personal Information
副教授
硕士生导师
性别:男
毕业院校:大连理工大学
学位:博士
所在单位:软件学院、国际信息与软件学院
办公地点:大连经济开发区大连理工大学软件学院
联系方式:15641190702
电子邮箱:piaoy@dlut.edu.cn
From Text to XML by Structural Information Extraction
点击次数:
论文类型:会议论文
发表时间:2015-01-01
收录刊物:CPCI-S
页面范围:448-452
关键字:text structural information; information extraction; conditional random fields; XML expression
摘要:Facing tremendous volume of semi-structured XML and non-structured free text, network information retrieval is one of the most research hotspots in dealing with these data more efficiently, precisely and uniformly. Many traditional IR methods ignore text semantics and their labeling result has usually only one level, lacking of context expression as well, therefore structure extraction from free text and its conversion to XML format are studied, with a CRF based algorithm SIECRF provided. Experiment results are analyzed, showing its efficiency to extracting text structure and has a good application future.