Improved Automatic Summarization of Subroutines via Attention to File Context (MSR 2020 - Technical Papers)

Who

Sakib Haque, Alexander LeClair, Lingfei Wu, Collin McMillan

Track

MSR 2020 Technical Papers

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 30 Jun 2020 14:48 - 15:00 at MSR:Zoom - ML4SE Chair(s): Kevin Moran

Abstract

Software documentation largely consists of short, natural language summaries of the subroutines in the software. These summaries help programmers quickly understand what a subroutine does without having to read the source code him or herself. The task of writing these descriptions is called “source code summarization” and has been a target of research for several years. Recently, AI-based approaches have superseded older, heuristic-based approaches. Yet, to date these AI-based approaches assume that all the content needed to predict summaries is inside subroutine itself. This assumption limits performance because many subroutines cannot be understood without surrounding context. In this paper, we present an approach that models the file context of subroutines (i.e. other subroutines in the same file) and uses an attention mechanism to find words and concepts to use in summaries. We show in an experiment that our approach extends and improves several recent baselines.

Link to Preprint

https://arxiv.org/abs/2004.04881

Sakib Haque

University of Notre Dame

Bangladesh

Alexander LeClair

University Of Notre Dame

United States

Lingfei Wu

IBM Research

Collin McMillan