MSR 2020
Mon 29 - Tue 30 June 2020
co-located with ICSE 2020
Events (14 results)

A C/C++ Code Vulnerability Dataset with Code Changes and CVE Summaries

Data Showcase When: Tue 30 Jun 2020 11:36 - 11:48 People: Jiahao Fan, Yi Li, Shaohua Wang, Tien N. Nguyen

… code repository links, we downloaded all of the code repositories and extract … vulnerabilities spanning 91 different vulnerability types. All these code vulnerabilities are extracted from 348 Git projects. All this information has been stored …

A Large-Scale Comparative Evaluation of IR-Based Tools for Bug Localization

Technical Papers When: Mon 29 Jun 2020 15:00 - 15:10 People: Shayan Akbar, Avinash Kak

… . It is important to realize that the original authors of all these three generations … of the present paper is to present a comprehensive large-scale evaluation of all three …

A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git Commits

Data Showcase When: Mon 29 Jun 2020 17:12 - 17:21 People: Tanner Fry, Tapajit Dey, Andrey Karnauch, Audris Mockus

… , the World of Code collection. In this paper, we propose a method that finds all … the list of all author IDs that were found to have aliases. To do this, we first …

Improved Automatic Summarization of Subroutines via Attention to File Context

Technical Papers When: Tue 30 Jun 2020 14:48 - 15:00 People: Sakib Haque, Alexander LeClair, Lingfei Wu, Collin McMillan

… -based approaches assume that all the content needed to predict summaries …

A Dataset of Dockerfiles

Data Showcase When: Mon 29 Jun 2020 11:24 - 11:36 People: Jordan Henkel, Christian Bird, Shuvendu Lahiri, Thomas Reps

… to the next are all available at: https://doi.org/10.5281/zenodo.3628771. …

20-MAD - 20 years of issues and commits of Mozilla and Apache Development

Data Showcase When: Mon 29 Jun 2020 17:21 - 17:30 People: Maëlick Claes, Mika Mäntylä

… , 3.4M commits, 2.3M issues, and 17.3M issue comments. The data contains all

Hall-of-Apps: The Top Android Apps Metadata Archive

Data Showcase When: Tue 30 Jun 2020 10:37 - 10:45 People: Laura Bello-Jiménez, Camilo Escobar-Velásquez, Anamaria Mojica-Hanke, Santiago Cortés-Fernández, Mario Linares-Vásquez

… database with all the information contained in app’s HTML files (e.g., app …

Detecting and Characterizing Bots that Commit Code

Technical Papers When: Tue 30 Jun 2020 10:45 - 10:52 People: Tapajit Dey, Sara Mousavi, Eduardo Ponce, Tanner Fry, Bogdan Vasilescu, Anna Filippova, Audris Mockus

… a shareable dataset containing detailed information about 461 bots we found (all

Capture the Feature Flag: Detecting Feature Flags in Open-Source

Technical Papers When: Tue 30 Jun 2020 10:30 - 10:37 People: Jens Meinicke, Juan Hoyos, Bogdan Vasilescu, Christian Kästner

… approach to all open-source GitHub projects, identifying 231,223 candidate feature …

SoftMon: A Tool to Compare Similar Open-source Software from a Performance Perspective

Technical Papers When: Tue 30 Jun 2020 11:36 - 11:48 People: Shubhankar Suman Singh, Smruti Ranjan Sarangi

… -fledged operating systems (OSs). In all cases, our tool was able to pinpoint a set …

RTPTorrent: An Open-source Dataset for Evaluating Regression Test Prioritization

Technical Papers When: Mon 29 Jun 2020 16:30 - 16:37 People: Toni Mattis, Patrick Rein, Falco Dürsch, Robert Hirschfeld

… provide reproducible baselines for initial comparisons and make all data …

PUMiner: Mining Security Posts from Developer Question and Answer Websites with PU Learning

Technical Papers When: Tue 30 Jun 2020 11:24 - 11:36 People: Triet Le Huynh Minh, David Hin, Roland Croft, Muhammad Ali Babar

… is effective with the validation performance of at least 85% across all model …

Mutation Testing Meets Software Analytics: A Hands-On Tutorial

Education When: Tue 30 Jun 2020 14:30 - 15:00 People: Fabio Palomba

… Software testing is an essential activity to ensure software quality. In a typical use case scenario, developers write a set of test cases and run them periodically on production code to identify defects. However, not all tests have …

AIMMX: Artificial Intelligence Model Metadata Extractor

Technical Papers When: Mon 29 Jun 2020 10:36 - 10:42 People: Jason Tsay, Alan Braz, Martin Hirzel, Avraham Shinnar, Todd Mummert

… Despite all of the power that machine learning and artificial intelligence (AI) models bring to applications, much of AI development is currently a fairly ad hoc process. Software engineering and AI development share many of the same …