notice
Seminar by Dr. Christoph Treude (University of São Paulo, Brazil)
Speaker: Dr. Christoph Treude
University of São Paulo, Brazil
Title: Using Natural Language Processing to Extract Task Descriptions from Software Documentation
Date: Thursday, September 17th
Time: 10AM
Place: EV11.119
ABSTRACT
Knowledge management plays a central role in many software development organizations. While much of the important technical knowledge can be captured in documentation, there often exists a gap between the information needs of software developers and the documentation structure. To help developers navigate documentation, we developed a technique for automatically extracting tasks from software documentation by conceptualizing tasks as specific programming actions that have been described in the documentation. More than 70 percent of the tasks we extracted from the documentation of two projects were judged meaningful by at least one of two developers. We present TaskNavigator, a user interface for search queries that suggests tasks extracted with our technique in an auto-complete list along with concepts, code elements, and section headers. We conducted a field study in which six professional developers used TaskNavigator for two weeks as part of their ongoing work. We found search results identified through extracted tasks to be more helpful to developers than those found through concepts, code elements, and section headers. The results indicate that task descriptions can be effectively extracted from software documentation, and that they help bridge the gap between documentation structure and the information needs of software developers.
BIO
Christoph Treude received the Diplom degree in computer science/management information systems from the University of Siegen, Germany, and the PhD degree in computer science from the University of Victoria, Canada. He is currently a postdoctoral researcher at the Institute for Mathematics and Statistics at the University of São Paulo, Brazil. His research interests include empirical software engineering, natural language processing, and social media.