副教授 博士生导师 硕士生导师
性别: 男
毕业院校: 大连理工大学
学位: 博士
所在单位: 建设管理系
学科: 工程管理
办公地点: 综合实验四号楼517室
电子邮箱: shjiang@dlut.edu.cn
开通时间: ..
最后更新时间: ..
点击次数:
论文类型: 会议论文
发表时间: 2014-09-27
收录刊物: EI、Scopus
页面范围: 237-245
摘要: Web-based data mining is an emerging technology that is increasingly being applied in Decision Support Systems in many industries. The objectives of this study are to develop a Web-based text mining system specific to the building and construction industry for web content gathering, handling and analysis. An authoritative website in the housing industrialization field is chosen as a case on which to carry out this study by means of four steps, as follows. First, a web crawler module for gathering the original web articles has been constructed. Second, the web content processing module is used to parse the HTML tags and segment the text content of each page. Then, the relational database deployed in a cloud platform is used to store the processing result. Finally, the Vector Space Model and TF-IDF algorithm are used to represent articles and calculate the relationship among all the articles gained in the web crawler module. As the government issues the news and policies online continuously, it is possible for people to know the key points and trends embodied in these policies in time by way of the proposed text mining system. ? 2014 American Society of Civil Engineers.