任志磊

个人信息Personal Information

教授

博士生导师

硕士生导师

任职 : 软件工程研究所副所长

性别:男

毕业院校:大连理工大学

学位:博士

所在单位:软件学院、国际信息与软件学院

电子邮箱:zren@dlut.edu.cn

扫描关注

论文成果

当前位置: 任志磊 >> 科学研究 >> 论文成果

Source code fragment summarization with small-scale crowdsourcing based features

点击次数:

论文类型:期刊论文

发表时间:2016-06-01

发表刊物:FRONTIERS OF COMPUTER SCIENCE

收录刊物:SCIE、EI、ISTIC、CSCD

卷号:10

期号:3

页面范围:504-517

ISSN号:2095-2228

关键字:summarizing code fragments; supervised learning; crowdsourcing

摘要:Recent studies have applied different approaches for summarizing software artifacts, and yet very few efforts have been made in summarizing the source code fragments available on web. This paper investigates the feasibility of generating code fragment summaries by using supervised learning algorithms.We hire a crowd of ten individuals from the same work place to extract source code features on a corpus of 127 code fragments retrieved from Eclipse and Net- Beans Official frequently asked questions (FAQs). Human annotators suggest summary lines. Our machine learning algorithms produce better results with the precision of 82% and performstatistically better than existing code fragment classifiers. Evaluation of algorithms on several statistical measures endorses our result. This result is promising when employing mechanisms such as data-driven crowd enlistment improve the efficacy of existing code fragment classifiers.