李忠富

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:男

毕业院校:哈尔滨建筑大学

学位:博士

所在单位:建设管理系

学科:工程管理. 管理科学与工程. 项目管理

扫描关注

论文成果

当前位置: 大工 李忠富 主页 >> 科学研究 >> 论文成果

Web-Based Text Mining for Extracting Relationships among Policies of Building and the Construction Industry

点击次数:

论文类型:会议论文

发表时间:2014-09-27

收录刊物:EI、Scopus

页面范围:237-245

摘要:Web-based data mining is an emerging technology that is increasingly being applied in Decision Support Systems in many industries. The objectives of this study are to develop a Web-based text mining system specific to the building and construction industry for web content gathering, handling and analysis. An authoritative website in the housing industrialization field is chosen as a case on which to carry out this study by means of four steps, as follows. First, a web crawler module for gathering the original web articles has been constructed. Second, the web content processing module is used to parse the HTML tags and segment the text content of each page. Then, the relational database deployed in a cloud platform is used to store the processing result. Finally, the Vector Space Model and TF-IDF algorithm are used to represent articles and calculate the relationship among all the articles gained in the web crawler module. As the government issues the news and policies online continuously, it is possible for people to know the key points and trends embodied in these policies in time by way of the proposed text mining system. ? 2014 American Society of Civil Engineers.