location: Current position: Home >> Scientific Research >> Paper Publications

Summarizing Software Artifacts: A Literature Review

Hits:

Indexed by:期刊论文

Date of Publication:2016-09-01

Journal:JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY

Included Journals:SCIE、EI、ISTIC、CSCD、Scopus

Volume:31

Issue:5

Page Number:883-909

ISSN No.:1000-9000

Key Words:mining software repositories; mining software engineering data; machine learning; summarizing software artifacts; summarizing source code

Abstract:This paper presents a literature review in the field of summarizing software artifacts, focusing on bug reports, source code, mailing lists and developer discussions artifacts. From Jan. 2010 to Apr. 2016, numerous summarization techniques, approaches, and tools have been proposed to satisfy the ongoing demand of improving software performance and quality and facilitating developers in understanding the problems at hand. Since aforementioned artifacts contain both structured and unstructured data at the same time, researchers have applied different machine learning and data mining techniques to generate summaries. Therefore, this paper first intends to provide a general perspective on the state of the art, describing the type of artifacts, approaches for summarization, as well as the common portions of experimental procedures shared among these artifacts. Moreover, we discuss the applications of summarization, i.e., what tasks at hand have been achieved through summarization. Next, this paper presents tools that are generated for summarization tasks or employed during summarization tasks. In addition, we present different summarization evaluation methods employed in selected studies as well as other important factors that are used for the evaluation of generated summaries such as adequacy and quality. Moreover, we briefly present modern communication channels and complementarities with commonalities among different software artifacts. Finally, some thoughts about the challenges applicable to the existing studies in general as well as future research directions are also discussed. The survey of existing studies will allow future researchers to have a wide and useful background knowledge on the main and important aspects of this research field.

Pre One:Analyzing Inter-objective Relationships: A Case Study of Software Upgradability

Next One:Influence analysis of Github repositories