What is the Vocabulary of Flaky Tests? (MSR 2020 - Technical Papers)

Who

Gustavo Pinto, Breno Miranda, Supun Dissanayake, Marcelo d'Amorim, Christoph Treude, Antonia Bertolino

Track

MSR 2020 Technical Papers

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 30 Jun 2020 14:36 - 14:48 at MSR:Zoom - ML4SE Chair(s): Kevin Moran

Abstract

Flaky tests are tests whose outcomes are non-deterministic. Despite the recent research activity on this topic, no effort has been made on understanding the vocabulary of flaky tests (e.g., networking or concurrency identifiers). This work proposes to automatically classify tests as flaky or not. Classification of flaky tests is important,for example, to detect the introduction of flaky test and to search for flaky tests after they are introduced in test suites. We evaluated performance of various machine learning algorithms to solve this problem. We constructed a dataset of flaky and non-flaky tests by running more than 50k test cases, 100 times each. We then used machine learning techniques on the resulting data set to predict which tests are flaky from their source. Based on features, such as counting stemmed tokens extracted from source code identifiers, we achieved an F-measure of 0.95 for the identification of flaky tests. The best performance was achieved when using Random Forest and Support Vector Machines for the prediction. In terms of the code identifiers that are most strongly associated with test flakiness, we noted that job, action, and services are commonly associated with flaky tests.

Link to Preprint

http://gustavopinto.org/lost+found/msr2020.pdf

Gustavo Pinto

UFPA

Brazil

Breno Miranda

Federal University of Pernambuco

Brazil

Supun Dissanayake

The University of Adelaide

Australia

Marcelo d'Amorim

Federal University of Pernambuco

Brazil

Christoph Treude

The University of Adelaide

Australia

Antonia Bertolino

CNR-ISTI

Italy

Media

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 30 Jun
Displayed time zone: (UTC) Coordinated Universal Time change

14:00 - 15:00	ML4SETechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom Chair(s): Kevin Moran William & Mary/George Mason University Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

14:00 12m Live Q&A		A Machine Learning Approach for Vulnerability CurationACM SIGSOFT Distinguished Paper AwardMSR - Technical Paper Technical Papers Chen Yang Veracode, Inc., Andrew Santosa Veracode, Inc., Ang Ming Yi , Abhishek Sharma Singapore Management University, Singapore, Asankhaya Sharma Veracode, Inc., David Lo Singapore Management University Pre-print Media Attached
14:12 12m Live Q&A		Embedding Java Classes with code2vec: Improvements from Variable ObfuscationMSR - Technical Paper Technical Papers Rhys Compton University of Waikato, Eibe Frank Department of Computer Science, University of Waikato, Panos Patros , Abigail Koay University of Waikato DOI Pre-print Media Attached
14:24 12m Live Q&A		A Study on the Accuracy of OCR Engines for Source Code Transcription from Programming ScreencastsMSR - Technical Paper Technical Papers Abdulkarim Malkadi Florida State University, USA - Jazan University, KSA, Mohammad Alahmadi Florida State University, Sonia Haiduc Florida State University Pre-print Media Attached
14:36 12m Live Q&A		What is the Vocabulary of Flaky Tests?MSR - Technical Paper Technical Papers Gustavo Pinto UFPA, Breno Miranda Federal University of Pernambuco, Supun Dissanayake The University of Adelaide, Marcelo d'Amorim Federal University of Pernambuco, Christoph Treude The University of Adelaide, Antonia Bertolino CNR-ISTI Pre-print Media Attached
14:48 12m Live Q&A		Improved Automatic Summarization of Subroutines via Attention to File ContextMSR - Technical Paper Technical Papers Sakib Haque University of Notre Dame, Alexander LeClair University Of Notre Dame, Lingfei Wu IBM Research, Collin McMillan University of Notre Dame Pre-print Media Attached