朴勇

个人信息Personal Information

副教授

硕士生导师

性别:男

毕业院校:大连理工大学

学位:博士

所在单位:软件学院、国际信息与软件学院

办公地点:大连经济开发区大连理工大学软件学院

联系方式:15641190702

电子邮箱:piaoy@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

From Text to XML by Structural Information Extraction

点击次数:

论文类型:会议论文

发表时间:2015-01-01

收录刊物:CPCI-S

页面范围:448-452

关键字:text structural information; information extraction; conditional random fields; XML expression

摘要:Facing tremendous volume of semi-structured XML and non-structured free text, network information retrieval is one of the most research hotspots in dealing with these data more efficiently, precisely and uniformly. Many traditional IR methods ignore text semantics and their labeling result has usually only one level, lacking of context expression as well, therefore structure extraction from free text and its conversion to XML format are studied, with a CRF based algorithm SIECRF provided. Experiment results are analyzed, showing its efficiency to extracting text structure and has a good application future.