Traceability Support for Multi-Lingual Software Projects (MSR 2020 - Technical Papers)

Who

Yalin Liu, Jinfeng Lin, Jane Cleland-Huang

Track

MSR 2020 Technical Papers

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 30 Jun 2020 16:50 - 17:00 at MSR:Zoom - Developer Collaboration Chair(s): Bogdan Vasilescu

Abstract

Software traceability establishes associations between diverse soft-ware artifacts such as requirements, design, code, and test cases. Due to the non-trivial costs of manually creating and maintaining links,many researchers have proposed automated approaches based on information retrieval techniques. However, many globally distributed software projects produce software artifacts written in two or more languages. The use of intermingled languages reduces the efficacy of automated tracing solutions. In this paper, we first analyze and dis-cuss patterns of intermingled language use across multiple projects,and then evaluate several different tracing algorithms including the Vector Space Model (VSM), Latent Semantic Indexing (LSI), Latent Direchlet Allocation (LDA), and various models that combine mono-and cross-lingual word embeddings with the Generative Vector Space Model (GVSM). Based on an analysis of 14 Chinese-English projects, our results show that best performance is achieved using mono-lingual word embeddings integrated into GVSM with machine translation as a preprocessing step.

Yalin Liu

University of Notre Dame

Jinfeng Lin

University of Notre Dame

Jane Cleland-Huang

University of Notre Dame

United States

Traceability Support for Multi-Lingual Software Projects

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 30 Jun
Displayed time zone: (UTC) Coordinated Universal Time change

16:00 - 17:00	Developer CollaborationRegistered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything / Technical Papers at MSR:Zoom Chair(s): Bogdan Vasilescu Carnegie Mellon University Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

16:00 10m Live Q&A		Need for tweet. How open-source developers use Twitter to talk about their GitHub workMSR - Technical Paper Technical Papers Hongbo Fang , Daniel Klug , Hemank Lamba , James Herbsleb , Bogdan Vasilescu Carnegie Mellon University Pre-print Media Attached
16:10 10m Live Q&A		Can We Use SE-specific Sentiment Analysis Tools in a Cross-Platform Setting?MSR - Technical Paper Technical Papers Nicole Novielli University of Bari, Fabio Calefato University of Bari, Davide Dongiovanni University of Bari, Daniela Girardi University of Bari, Filippo Lanubile University of Bari DOI Pre-print Media Attached
16:20 10m Live Q&A		GitterCom: A Dataset of Open Source Developer Communications in GitterMSR - Data Showcase Data Showcase A: Esteban Parra Rodriguez Florida State University, A: Ashley Ellis , A: Sonia Haiduc Florida State University Pre-print Media Attached
16:30 10m Live Q&A		The Impact of Dynamics of Collaborative Software Engineering on Introverts: A Study ProtocolMSR - Registered Reports Registered Reports A: Ingrid Nunes Universidade Federal do Rio Grande do Sul (UFRGS), Brazil, A: Christoph Treude The University of Adelaide, A: Fabio Calefato University of Bari Pre-print Media Attached
16:40 10m Live Q&A		Software-related Slack Chats with Disentangled ConversationsMSR - Data Showcase Data Showcase A: Preetha Chatterjee University of Delaware, USA, A: Kostadin Damevski Virginia Commonwealth University, A: Nicholas A. Kraft UserVoice, A: Lori Pollock Pre-print Media Attached
16:50 10m Live Q&A		Traceability Support for Multi-Lingual Software ProjectsACM SIGSOFT Distinguished Paper AwardMSR - Technical Paper Technical Papers Yalin Liu University of Notre Dame, Jinfeng Lin University of Notre Dame, Jane Cleland-Huang University of Notre Dame Media Attached