个人信息Personal Information
教授
博士生导师
硕士生导师
性别:男
毕业院校:哈尔滨建筑大学
学位:博士
所在单位:建设管理系
学科:工程管理. 管理科学与工程. 项目管理
Web-Based Text Mining for Extracting Relationships among Policies of Building and the Construction Industry
点击次数:
论文类型:会议论文
发表时间:2014-09-27
收录刊物:EI、Scopus
页面范围:237-245
摘要:Web-based data mining is an emerging technology that is increasingly being applied in Decision Support Systems in many industries. The objectives of this study are to develop a Web-based text mining system specific to the building and construction industry for web content gathering, handling and analysis. An authoritative website in the housing industrialization field is chosen as a case on which to carry out this study by means of four steps, as follows. First, a web crawler module for gathering the original web articles has been constructed. Second, the web content processing module is used to parse the HTML tags and segment the text content of each page. Then, the relational database deployed in a cloud platform is used to store the processing result. Finally, the Vector Space Model and TF-IDF algorithm are used to represent articles and calculate the relationship among all the articles gained in the web crawler module. As the government issues the news and policies online continuously, it is possible for people to know the key points and trends embodied in these policies in time by way of the proposed text mining system. ? 2014 American Society of Civil Engineers.