任志磊

个人信息Personal Information

教授

博士生导师

硕士生导师

任职 : 软件工程研究所副所长

性别:男

毕业院校:大连理工大学

学位:博士

所在单位:软件学院、国际信息与软件学院

电子邮箱:zren@dlut.edu.cn

扫描关注

论文成果

当前位置: 任志磊 >> 科学研究 >> 论文成果

A More Accurate Model for Finding Tutorial Segments Explaining APIs

点击次数:

论文类型:会议论文

发表时间:2016-01-01

收录刊物:CPCI-S、SCIE

页面范围:157-167

关键字:Application Programming Interface; Text Classification; Feature Construction

摘要:Developers prefer to utilize third-party libraries when they implement some functionalities and Application Programming Interfaces (APIs) are frequently used by them. Facing an unfamiliar API, developers tend to consult tutorials as learning resources. Unfortunately, the segments explaining a specific API scatter across tutorials. Hence, it remains a challenging issue to find the relevant segments. In this study, we propose a more accurate model to find the exact tutorial fragments explaining APIs. This new model consists of a text classifier with domain specific features. More specifically, we discover two important indicators to complement traditional text based features, namely co-occurrence APIs and knowledge based API extensions. In addition, we incorporate Word2Vec, a semantic similarity metric to enhance the new model. Extensive experiments over two publicly available tutorial datasets show that our new model could find up to 90% fragments explaining APIs and improve the state-of-the-art model by up to 30% in terms of F-measure.