MSR 2020
Mon 29 - Tue 30 June 2020
co-located with ICSE 2020

The Data Showcase provides a forum to share and discuss important data sets that underpin the work of the Mining Software Repositories community.

Dates
Tracks
You're viewing the program in a time zone which is different from your device's time zone - change time zone

Mon 29 Jun
Times are displayed in time zone: (UTC) Coordinated Universal Time change

10:30 - 11:00
Programming Languages & ModelsTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom
Chair(s): Dimitris KolovosUniversity of York

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

10:30
6m
Live Q&A
An Empirical Study on the Impact of Deimplicitization on Program ComprehensionMSR - Registered Reports
Registered Reports
A: Jürgen CitoMIT, A: Jiasi ShenMassachusetts Institute of Technology, A: Martin RinardMIT
Pre-print Media Attached
10:36
6m
Live Q&A
AIMMX: Artificial Intelligence Model Metadata ExtractorMSR - Technical Paper
Technical Papers
Jason TsayIBM Research, Alan BrazIBM Research, Martin HirzelIBM Research, Avraham ShinnarIBM Research, Todd Mummert
Pre-print Media Attached
10:42
6m
Live Q&A
Using Large-Scale Anomaly Detection on Code to Improve Kotlin CompilerMSR - Technical Paper
Technical Papers
Timofey BryksinJetBrains Research, Saint Petersburg State University, Victor PetukhovJetBrains, ITMO University, Ilya Alexin, Stanislav Prikhodko, Alexey Shpilman, Vladimir KovalenkoTU Delft, Nikita PovarovJetBrains
Pre-print Media Attached
10:48
6m
Live Q&A
An Empirical Study of Method Chaining in JavaMSR - Technical Paper
Technical Papers
Tomoki NakamaruGraduate School of Information Science and Technology, The University of Tokyo, Tomomasa Matsunaga, Tetsuro YamazakiGraduate School of Information Science and Technology, The University of Tokyo, Soramichi AkiyamaDepartment of Creative Informatics, The University of Tokyo, Shigeru ChibaThe University of Tokyo
Pre-print Media Attached
10:54
6m
Live Q&A
Painting Flowers: Reasons for Using Single-State State Machines in Model-Driven EngineeringMSR - Technical Paper
Technical Papers
Nan YangEindhoven University of Technology, The Netherlands, Pieter Cuijpers, Ramon SchiffelersEindhoven University of Technology and ASML, the Netherlands, Johan Lukkien, Alexander SerebrenikEindhoven University of Technology
Media Attached
10:30 - 11:00
Refactoring & TestingTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom2
Chair(s): Maurício AnicheDelft University of Technology, Netherlands

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

10:30
7m
Live Q&A
Characterizing and Identifying Composite Refactorings: Concepts, Heuristics and PatternsMSR - Technical Paper
Technical Papers
Leonardo Da Silva SousaCarnegie Mellon University, USA, Diego CedrimPontifical Catholic University of Rio de Janeiro, Alessandro GarciaPUC-Rio, Willian OizumiPUC-Rio, Ana Carla BibianoPUC-Rio, Daniel OliveiraPUC-Rio, Miryung KimUniversity of California, Los Angeles, Anderson OliveiraPUC-Rio
Pre-print Media Attached
10:37
7m
Live Q&A
Behind the Intents: An In-depth Empirical Study on Software Refactoring in Modern Code ReviewMSR - Technical Paper
Technical Papers
Matheus PaixaoUniversity of Fortaleza, Anderson UchôaPontifical Catholic University of Rio de Janeiro (PUC-Rio), Ana Carla BibianoPUC-Rio, Daniel OliveiraPUC-Rio, Alessandro GarciaPUC-Rio, Jens KrinkeUniversity College London, Emilio Arvonio
Pre-print Media Attached
10:45
7m
Live Q&A
JTeC: A Large Collection of Java Test Classes for Test Code Analysis and ProcessingMSR - Data Showcase
Data Showcase
Federico Corò, A: Roberto VerdecchiaVrije Universiteit Amsterdam, A: Emilio Cruciani, A: Breno MirandaFederal University of Pernambuco, A: Antonia BertolinoCNR-ISTI
Pre-print Media Attached
10:52
7m
Live Q&A
TestRoutes: A Manually Curated Method Level Dataset for Test-to-Code TraceabilityMSR - Data Showcase
Data Showcase
A: András Kicsi, A: László VidácsUniversity of Szeged, Hungary, A: Tibor Gyimothy
Pre-print Media Attached
11:00 - 12:00
Build, CI, & DependenciesTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom
Chair(s): Raula Gaikovina KulaNAIST

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

11:00
12m
Live Q&A
A Tale of Docker Build Failures: A Preliminary StudyMSR - Technical Paper
Technical Papers
Yiwen WuNational University of Defense Technology, Yang ZhangNational University of Defense Technology, China, Tao WangNational University of Defense Technology, Huaimin Wang
Pre-print Media Attached
11:12
12m
Live Q&A
Using Others' Tests to Avoid Breaking UpdatesMSR - Technical Paper
Technical Papers
Suhaib MujahidConcordia University, Rabe AbdalkareemConcordia University, Montreal, Canada, Emad ShihabConcordia University, Shane McIntoshMcGill University
Pre-print Media Attached
11:24
12m
Live Q&A
A Dataset of DockerfilesMSR - Data Showcase
Data Showcase
A: Jordan HenkelUniversity of Wisconsin–Madison, A: Christian Bird, A: Shuvendu LahiriMicrosoft Research, A: Thomas RepsUniversity of Wisconsin-Madison, USA
Media Attached
11:36
12m
Live Q&A
Empirical Study of Restarted and Flaky Builds on Travis CIMSR - Technical Paper
Technical Papers
Thomas DurieuxKTH Royal Institute of Technology, Sweden, Claire Le GouesCarnegie Mellon University, Michael HiltonCarnegie Mellon University, USA, Rui AbreuInstituto Superior Técnico, U. Lisboa & INESC-ID
DOI Pre-print Media Attached
11:48
12m
Live Q&A
LogChunks: A Data Set for Build Log AnalysisMSR - Data Showcase
Data Showcase
A: Carolin BrandtDelft University of Technology, A: Annibale PanichellaDelft University of Technology, A: Andy ZaidmanTU Delft, A: Moritz BellerFacebook, USA
Pre-print Media Attached
12:00 - 13:00
Code SmellsTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom
Chair(s): Alessandro GarciaPUC-Rio

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

12:00
10m
Live Q&A
Detecting Video Game-Specific Bad Smells in Unity ProjectsMSR - Technical Paper
Technical Papers
Pre-print Media Attached
12:10
10m
Live Q&A
Investigating Severity Thresholds for Test SmellsMSR - Technical Paper
Technical Papers
Davide SpadiniDelft University of Technology, Netherlands, Martin Schvarcbacher, Ana Maria Oprescu, Magiel BruntinkSoftware Improvement Group, Alberto BacchelliUniversity of Zurich
DOI Pre-print Media Attached
12:20
10m
Live Q&A
On the Prevalence, Impact, and Evolution of SQL code smells in Data-Intensive SystemsMSR - Technical Paper
Technical Papers
Biruk Asmare Muse, Masud RahmanDalhousie University, Csaba NagySoftware Institute - USI, Lugano, Anthony CleveUniversity of Namur, Foutse KhomhPolytechnique Montréal, Giuliano AntoniolPolytechnique Montréal
Pre-print Media Attached
12:30
10m
Live Q&A
Multi-language Design Smells: A Backstage PerspectiveMSR - Registered Reports
Registered Reports
A: Mouna Abidi, A: Moses Openja, A: Foutse KhomhPolytechnique Montréal
Pre-print Media Attached
12:40
10m
Live Q&A
The Scent of Deep Learning Code: An Empirical StudyMSR - Technical Paper
Technical Papers
Hadhemi Jebnoun, Masud RahmanDalhousie University, Foutse KhomhPolytechnique Montréal, Houssem Ben Braiek
Pre-print Media Attached
12:50
10m
Live Q&A
Developer-Driven Code Smell PrioritizationMSR - Technical Paper
Technical Papers
Fabiano PecorelliUniversity of Salerno, Fabio PalombaUniversity of Salerno, Foutse KhomhPolytechnique Montréal, Andrea De LuciaUniversity of Salerno
Pre-print Media Attached
14:30 - 15:30
Bugs & IssuesTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom
Chair(s): Francisco ServantVirginia Tech

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

14:30
10m
Live Q&A
Dataset of Video Game Development ProblemsMSR - Data Showcase
Data Showcase
A: Cristiano PolitowskiConcordia University, Canada, A: Fabio PetrilloUniversity of Quebec at Chicoutimi, A: Yann-Gaël GuéhéneucConcordia University and Polytechnique Montréal, A: Gabriel Cavalheiro UllmannUNIJUI - Universidade Regional do Noroeste do Estado do Rio Grande do Sul, A: Josias De Andrade Werly
Media Attached
14:40
10m
Live Q&A
On the Relationship between User Churn and Software IssuesMSR - Technical Paper
Technical Papers
Omar El Zarif, Daniel Alencar Da CostaUniversity of Otago, Safwat HassanQueens University, Kingston, Canada, Ying ZouQueen's University, Kingston, Ontario
Pre-print Media Attached
14:50
10m
Live Q&A
A Soft Alignment Model for Bug DeduplicationMSR - Technical Paper
Technical Papers
Pre-print Media Attached
15:00
10m
Live Q&A
A Large-Scale Comparative Evaluation of IR-Based Tools for Bug LocalizationMSR - Technical Paper
Technical Papers
Media Attached
15:10
10m
Live Q&A
How Often Do Single-Statement Bugs Occur? The ManySStuBs4J DatasetMSR - Data Showcase
Data Showcase
A: Rafael-Michael KarampatsisThe University of Edinburgh, A: Charles SuttonGoogle Research
Pre-print Media Attached
15:20
10m
Live Q&A
Large-Scale Manual Validation of Bugfixing ChangesMSR - Registered Reports
Registered Reports
A: Steffen HerboldUniversity of Göttingen, A: Alexander TrautschUniversity of Göttingen, A: Benjamin Ledel
Pre-print Media Attached
16:30 - 17:30
Github & OSS DatasetsTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom
Chair(s): Olga BaysalCarleton University

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

16:30
8m
Live Q&A
A New Dataset for Pull Request AcceptanceMSR - Data Showcase
Data Showcase
A: Xunhui ZhangNational University of Defense Technology, China, A: Ayushi RastogiPostdoctoral researcher at TU Delft, A: Yue YuCollege of Computer, National University of Defense Technology, Changsha 410073, China
Pre-print Media Attached
16:38
8m
Live Q&A
A Mixed Graph-Relational Dataset of Socio-technicalInteractions in Open Source SystemsMSR - Data Showcase
Data Showcase
A: Usman Ashraf, A: Christoph Mayr-DornJohannes Kepler University Linz, A: Alexander EgyedJohannes Kepler University, Linz, A: Sebastiano Panichella
Media Attached
16:47
8m
Live Q&A
A Complete Set of Related Git Repositories Identified via Community Detection Approaches Based on Shared CommitsMSR - Data Showcase
Data Showcase
A: Audris Mockus, A: Zoe KottiAthens University of Economics and Business, A: Diomidis SpinellisAthens University of Economics and Business, A: Gabriel Dusing
Media Attached
16:55
8m
Live Q&A
A Dataset of Enterprise-Driven Open Source SoftwareMSR - Data Showcase
Data Showcase
A: Diomidis SpinellisAthens University of Economics and Business, A: Zoe KottiAthens University of Economics and Business, A: Konstantinos Kravvaritis, A: Georgios Theodorou, A: Panos LouridasAthens University of Economics and Business
DOI Pre-print Media Attached
17:04
8m
Live Q&A
A Dataset for GitHub Repository DeduplicationMSR - Data Showcase
Data Showcase
A: Diomidis SpinellisAthens University of Economics and Business, A: Zoe KottiAthens University of Economics and Business, A: Audris Mockus
DOI Pre-print Media Attached
17:12
8m
Live Q&A
A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git CommitsMSR - Data Showcase
Data Showcase
A: Tanner Fry, A: Tapajit Dey, A: Andrey KarnauchUniversity of Tennessee Knoxville, A: Audris Mockus
Pre-print Media Attached
17:21
8m
Live Q&A
20-MAD - 20 years of issues and commits of Mozilla and Apache DevelopmentMSR - Data Showcase
Data Showcase
A: Maëlick Claes University of Oulu, A: Mika MäntyläUniversity of Oulu
Media Attached
16:30 - 17:00
Platforms & DatasetsTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom2
Chair(s): Moritz BellerFacebook, USA

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

16:30
7m
Live Q&A
RTPTorrent: An Open-source Dataset for Evaluating Regression Test PrioritizationMSR - Technical Paper
Technical Papers
Toni MattisHasso Plattner Institute, University of Potsdam, Patrick ReinHasso Plattner Institute, Falco Dürsch, Robert HirschfeldHasso-Plattner-Institut (HPI), Germany
DOI Pre-print Media Attached
16:37
7m
Live Q&A
Polyglot and Distributed Software Repository Mining with CROSSFLOWMSR - Technical Paper
Technical Papers
Konstantinos Barmpis , Patrick NeubauerUniversity of York, UK, Jonathan Co, Dimitris KolovosUniversity of York, Nicholas Matragkas, Richard PaigeMcMaster University
Media Attached
16:45
7m
Live Q&A
Boa Views: Easy Modularization and Sharing of MSR AnalysesMSR - Technical Paper
Technical Papers
Che Shian Hung, Robert DyerUniversity of Nebraska - Lincoln
Pre-print Media Attached
16:52
7m
Live Q&A
Determining the Intrinsic Structure of Public Software Development HistoryMSR - Registered Reports
Registered Reports
A: Antoine PietriInria, A: Guillaume RousseauUniversité de Paris and Inria, A: Stefano ZacchiroliUniversité de Paris and Inria
Pre-print Media Attached

Tue 30 Jun
Times are displayed in time zone: (UTC) Coordinated Universal Time change

10:30 - 11:00
EvolutionTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom
Chair(s): Jürgen CitoMIT

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

10:30
7m
Live Q&A
Capture the Feature Flag: Detecting Feature Flags in Open-SourceMSR - Technical Paper
Technical Papers
Jens MeinickeCarnegie Mellon University, Juan HoyosUniversidad Nacional de Colombia, Bogdan VasilescuCarnegie Mellon University, Christian KaestnerCarnegie Mellon University
Pre-print Media Attached
10:37
7m
Live Q&A
Forking Without Clicking: on How to Identify Software Repository ForksMSR - Technical Paper
Technical Papers
Antoine PietriInria, Guillaume RousseauUniversité de Paris and Inria, Stefano ZacchiroliUniversité de Paris and Inria
Pre-print Media Attached
10:45
7m
Live Q&A
Visualization of Methods Changeability Based on VCS DataMSR - Technical Paper
Technical Papers
Sergey Svitkov, Timofey BryksinJetBrains Research, Saint Petersburg State University
Pre-print Media Attached
10:52
7m
Live Q&A
Employing Contribution and Quality Metrics for Quantifying the Software Development ProcessMSR - Data Showcase
Data Showcase
A: Themistoklis DiamantopoulosElectrical and Computer Engineering Dept, Aristotle University of Thessaloniki, A: Michail Papamichail , A: Thomas Karanikiotis, A: Kyriakos Chatzidimitriou Aristotle University of Thessaloniki, A: Andreas SymeonidisAristotle University of Thessaloniki
Pre-print Media Attached
10:30 - 11:00
Apps & BotsTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom2
Chair(s): Ivano MalavoltaVrije Universiteit Amsterdam

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

10:30
7m
Live Q&A
AndroZooOpen: Collecting Large-scale Open Source Android Apps for the Research CommunityMSR - Data Showcase
Data Showcase
A: Pei Liu, A: Li LiMonash University, Australia, A: Yanjie Zhao, A: Xiaoyu Sun, A: John GrundyMonash University
Media Attached
10:37
7m
Live Q&A
Hall-of-Apps: The Top Android Apps Metadata ArchiveMSR - Data Showcase
Data Showcase
Media Attached
10:45
7m
Live Q&A
Detecting and Characterizing Bots that Commit CodeMSR - Technical Paper
Technical Papers
Tapajit Dey, Sara Mousavi, Eduardo PonceUniversity of Tennessee - Knoxville, Tanner Fry, Bogdan VasilescuCarnegie Mellon University, Anna Filippova, Audris MockusUniversity of Tennessee - Knoxville
Pre-print Media Attached
10:52
7m
Live Q&A
Challenges in Chatbot Development: A Study of Stack Overflow PostsMSR - Technical Paper
Technical Papers
Ahmad AbdellatifConcordia University, Diego CostaConcordia University, Canada, Khaled BadranConcordia University, Rabe AbdalkareemConcordia University, Montreal, Canada, Emad ShihabConcordia University
Pre-print Media Attached
11:00 - 12:00
QualityTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom
Chair(s): Jens KrinkeUniversity College London

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

11:00
12m
Live Q&A
Beyond the Code: Mining Self-Admitted Technical Debt in Issue Tracker SystemsMSR - Technical Paper
Technical Papers
Laerte XavierUniversidade Federal de Minas Gerais (UFMG), Fabio da Silva Ferreira, Rodrigo Brito, Marco Tulio ValenteFederal University of Minas Gerais, Brazil
Pre-print Media Attached
11:12
12m
Live Q&A
An Empirical Study on Regular Expression BugsMSR - Technical Paper
Technical Papers
Peipei WangNorth Carolina State University, USA, Chris BrownNorth Carolina State University, Jamie JenningsNorth Carolina State University, Kathryn StoleeNorth Carolina State University
Pre-print Media Attached
11:24
12m
Live Q&A
Do Explicit Review Strategies Improve Code Review Performance?MSR - Registered Reports
Registered Reports
A: Pavlína Wurzel Gonçalves, A: Enrico Fregnan, A: Tobias Baum, A: Kurt SchneiderLeibniz Universität Hannover, Software Engineering Group, A: Alberto BacchelliUniversity of Zurich
Pre-print Media Attached
11:36
12m
Live Q&A
SoftMon: A Tool to Compare Similar Open-source Software from a Performance PerspectiveMSR - Technical Paper
Technical Papers
Pre-print Media Attached
11:48
12m
Live Q&A
A Study of Potential Code Borrowing and License Violations in Java Projects on GitHubMSR - Technical Paper
Technical Papers
Yaroslav GolubevJetBrains Research, ITMO University, Maria Eliseeva, Nikita PovarovJetBrains, Timofey BryksinJetBrains Research, Saint Petersburg State University
Pre-print Media Attached
11:00 - 12:00
SecurityTechnical Papers / Data Showcase at MSR:Zoom2
Chair(s): Dimitris MitropoulosAthens University of Economics and Business

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

11:00
12m
Live Q&A
Did You Remember To Test Your Tokens?MSR - Technical Paper
Technical Papers
Danielle GonzalezRochester Institute of Technology, USA, Michael RathTechnische Universität Ilmenau, Mehdi MirakhorliRochester Institute of Technology
DOI Pre-print Media Attached
11:12
12m
Live Q&A
Automatically Granted Permissions in Android appsMSR - Technical Paper
Technical Papers
Paolo Calciati IMDEA Software Institute, Konstantin KuznetsovSaarland University, CISPA, Alessandra GorlaIMDEA Software Institute, Andreas ZellerCISPA Helmholtz Center for Information Security
Media Attached
11:24
12m
Live Q&A
PUMiner: Mining Security Posts from Developer Question and Answer Websites with PU LearningMSR - Technical Paper
Technical Papers
Triet Le Huynh MinhThe University of Adelaide, David Hin, Roland Croft, Muhammad Ali BabarThe University of Adelaide
DOI Pre-print Media Attached
11:36
12m
Live Q&A
A C/C++ Code Vulnerability Dataset with Code Changes and CVE SummariesMSR - Data Showcase
Data Showcase
A: Jiahao FanNew Jersey Institute of Technology, USA, A: Yi LiNew Jersey Institute of Technology, USA, A: Shaohua WangNew Jersey Institute of Technology, USA, A: Tien N. NguyenUniversity of Texas at Dallas
Media Attached
11:48
12m
Live Q&A
The Impact of a Major Security Event on an Open Source Project: The Case of OpenSSLMSR - Technical Paper
Technical Papers
James WaldenNorthern Kentucky University
Pre-print Media Attached
14:00 - 15:00
ML4SETechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom
Chair(s): Kevin MoranWilliam & Mary/George Mason University

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

14:00
12m
Live Q&A
A Machine Learning Approach for Vulnerability CurationACM SIGSOFT Distinguished Paper AwardMSR - Technical Paper
Technical Papers
Chen YangVeracode, Inc., Andrew SantosaVeracode, Inc., Ang Ming Yi, Abhishek Sharma Singapore Management University, Singapore, Asankhaya SharmaVeracode, Inc., David LoSingapore Management University
Pre-print Media Attached
14:12
12m
Live Q&A
Embedding Java Classes with code2vec: Improvements from Variable ObfuscationMSR - Technical Paper
Technical Papers
Rhys ComptonUniversity of Waikato, Eibe FrankDepartment of Computer Science, University of Waikato, Panos Patros, Abigail KoayUniversity of Waikato
DOI Pre-print Media Attached
14:24
12m
Live Q&A
A Study on the Accuracy of OCR Engines for Source Code Transcription from Programming ScreencastsMSR - Technical Paper
Technical Papers
Abdulkarim MalkadiFlorida State University, USA - Jazan University, KSA, Mohammad AlahmadiFlorida State University, Sonia HaiducFlorida State University
Pre-print Media Attached
14:36
12m
Live Q&A
What is the Vocabulary of Flaky Tests?MSR - Technical Paper
Technical Papers
Gustavo PintoUFPA, Breno MirandaFederal University of Pernambuco, Supun DissanayakeThe University of Adelaide, Marcelo d'AmorimFederal University of Pernambuco, Christoph TreudeThe University of Adelaide, Antonia BertolinoCNR-ISTI
Pre-print Media Attached
14:48
12m
Live Q&A
Improved Automatic Summarization of Subroutines via Attention to File ContextMSR - Technical Paper
Technical Papers
Sakib HaqueUniversity of Notre Dame, Alexander LeClairUniversity Of Notre Dame, Lingfei WuIBM Research, Collin McMillanUniversity of Notre Dame
Pre-print Media Attached
16:00 - 17:00
Developer CollaborationTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom
Chair(s): Bogdan VasilescuCarnegie Mellon University

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

16:00
10m
Live Q&A
Need for tweet. How open-source developers use Twitter to talk about their GitHub workMSR - Technical Paper
Technical Papers
Pre-print Media Attached
16:10
10m
Live Q&A
Can We Use SE-specific Sentiment Analysis Tools in a Cross-Platform Setting?MSR - Technical Paper
Technical Papers
Nicole NovielliUniversity of Bari, Fabio CalefatoUniversity of Bari, Davide DongiovanniUniversity of Bari, Daniela GirardiUniversity of Bari, Filippo LanubileUniversity of Bari
DOI Pre-print Media Attached
16:20
10m
Live Q&A
GitterCom: A Dataset of Open Source Developer Communications in GitterMSR - Data Showcase
Data Showcase
A: Esteban ParraFlorida State University, A: Ashley Ellis, A: Sonia HaiducFlorida State University
Pre-print Media Attached
16:30
10m
Live Q&A
The Impact of Dynamics of Collaborative Software Engineering on Introverts: A Study ProtocolMSR - Registered Reports
Registered Reports
A: Ingrid NunesUniversidade Federal do Rio Grande do Sul (UFRGS), Brazil, A: Christoph TreudeThe University of Adelaide, A: Fabio CalefatoUniversity of Bari
Pre-print Media Attached
16:40
10m
Live Q&A
Software-related Slack Chats with Disentangled ConversationsMSR - Data Showcase
Data Showcase
A: Preetha ChatterjeeUniversity of Delaware, USA, A: Kostadin DamevskiVirginia Commonwealth University, A: Nicholas A. KraftUserVoice, A: Lori Pollock
Pre-print Media Attached
16:50
10m
Live Q&A
Traceability Support for Multi-Lingual Software ProjectsACM SIGSOFT Distinguished Paper AwardMSR - Technical Paper
Technical Papers
Yalin LiuUniversity of Notre Dame, Jinfeng LinUniversity of Notre Dame, Jane Cleland-HuangUniversity of Notre Dame
Media Attached
16:00 - 17:00
Visions & ReflectionsTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom2
Chair(s): Venera ArnaoudovaWashington State University

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

16:00
15m
Live Q&A
The State of the ML-universe: 10 Years of Artificial Intelligence & Machine Learning Software Development on GitHubMSR - Technical Paper
Technical Papers
Danielle GonzalezRochester Institute of Technology, USA, Thomas ZimmermannMicrosoft Research, Nachiappan NagappanMicrosoft Research
DOI Pre-print Media Attached
16:15
15m
Live Q&A
Ethical Mining – A Case Study on MSR Mining ChallengesACM SIGSOFT Distinguished Paper AwardMSR - Technical Paper
Technical Papers
Nicolas GoldUniversity College London, Jens KrinkeUniversity College London
DOI Pre-print Media Attached
16:30
15m
Live Q&A
From Innovations to Prospects: What Is Hidden Behind Cryptocurrencies?MSR - Technical Paper
Technical Papers
Ang JiaXi'an Jiaotong University, Ming FanXi'an Jiaotong University, Xi Xu, Di CuiXi'an Jiaotong University, Wenying Wei, Zijiang YangWestern Michigan University, Kai Ye, Ting LiuXi'an Jiaotong University
DOI Pre-print Media Attached
16:45
15m
Live Q&A
What constitutes Software? An Empirical, Descriptive Study of ArtifactsMSR - Technical Paper
Technical Papers
Pre-print Media Attached

Call for Papers

Data Showcase papers should describe data sets that are curated by their authors and made available to use by others. Ideally, these data sets should be of value to others in the community, should be preprocessed or filtered in some way, and should provide an easy-to-understand schema. Data showcase papers are expected to include:

  • a description of the data source,
  • a description of the methodology used to gather the data (including provenance and the tool used to create/generate/gather the data, if any),
  • a description of the storage mechanism, including a schema if applicable,
  • if the data has been used by the authors or others, a description of how this was done including references to previously published papers,
  • a description of the originality of the data set (that is, even if the data set has been used in a published paper, its complete description must be unpublished),
  • ideas for future research questions that could be answered using the data set,
  • ideas for further improvements that could be made to the data set, and
  • any limitations and/or challenges in creating or using the data set.

The data set should be made available at the time of submission of the paper for review, but will be considered confidential until publication of the paper. At the latest upon publication of the paper the authors should archive the data on a persistent repository that can provide a digital object identifier (DOI) such as zenodo.org, figshare.com, Archive.org, or institutional repositories. In this way the data will become citable; the DOI-based citation of the data set should be included in the camera-ready version of the paper.

Data showcase papers are not:

  • empirical studies
  • tool demos
  • or data sets that are
    • based on poorly explained or untrustworthy heuristics for data collection, or
    • result of trivial application of generic tools.

If custom tools have been used to create the data set, we expect the paper to be accompanied by the source code of the tools, along with clear documentation on how to run the tools to recreate the data set. The tools should be open source, accompanied by an appropriate license; the source code should be citable, i.e., refer to a specific release and have a DOI. GItHub provides an easy way to make source code citable. If you cannot provide the source code or the source code clause is not applicable (e.g., because the data set consists of qualitative data), please provide a short explanation of why this is not possible.

Submission

Submit your data paper (maximum 4 pages, plus 1 additional page of references) to EasyChair on or before February 6th, 2020 (abstract due January 30th).

Submitted papers will undergo single-blind peer review. We opt for single-blind peer review (as opposed to the double-blind peer review of the main track) due to the requirement above to describe the ways how data has been used in the previous studies, including the bibliographic reference to those studies. Such reference is likely to disclose the authors’ identity.

To make research data sets and research software accessible and citable, we further encourage authors to attend to the FAIR rules, i.e., data should be: Findable, Accessible, Interoperable, and Reusable.

The submission must conform to the ACM Conference Proceedings Formatting Guidelines (https://www.acm.org/publications/proceedings-template). LaTeX users must use the provided acmart.cls and ACM-Reference-Format.bst without modification, enable the conference format in the preamble of the document (i.e., \documentclass[sigconf,review]{acmart}), and use the ACM reference format for the bibliography (i.e., \bibliographystyle{ACM-Reference-Format}). The review option adds line numbers, thereby allowing referees to refer to specific lines in their comments.

Papers submitted for consideration should not have been published elsewhere and should not be under review or submitted for review elsewhere for the duration of consideration. ACM plagiarism policies and procedures shall be followed for cases of double submission. The submission must also comply with the IEEE Policy on Authorship. Please read the ACM Policy and Procedures on Plagiarism (https://www.acm.org/publications/policies/plagiarism) and the IEEE Plagiarism FAQ (https://www.ieee.org/publications/rights/plagiarism/plagiarism-faq.html) before submitting.

To submit please use the EasyChair link.

Upon notification of acceptance, all authors of accepted papers will be asked to complete a copyright form and will receive further instructions for preparing their camera ready versions. At least one author of each paper is expected to register and present the results at the MSR 2020 conference. All accepted contributions will be published in the conference electronic proceedings.

A selection of the best papers will be invited to EMSE Special Issue.

Important Dates

Abstracts Due: January 30, 2020, 23:59 AOE

Papers Due: February 6, 2020, 23:59 AOE

Author Notification: March 2, 2020

Camera Ready: March 16, 2020, 23:59 AOE

Organization

Olga Baysal, Carleton University, Canada

Bogdan Vasilescu, Carnegie Mellon University, USA

Accepted Papers

Title
20-MAD - 20 years of issues and commits of Mozilla and Apache DevelopmentMSR - Data Showcase
Data Showcase
Media Attached
A C/C++ Code Vulnerability Dataset with Code Changes and CVE SummariesMSR - Data Showcase
Data Showcase
Media Attached
A Complete Set of Related Git Repositories Identified via Community Detection Approaches Based on Shared CommitsMSR - Data Showcase
Data Showcase
Media Attached
A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git CommitsMSR - Data Showcase
Data Showcase
Pre-print Media Attached
A Dataset for GitHub Repository DeduplicationMSR - Data Showcase
Data Showcase
DOI Pre-print Media Attached
A Dataset of DockerfilesMSR - Data Showcase
Data Showcase
Media Attached
A Dataset of Enterprise-Driven Open Source SoftwareMSR - Data Showcase
Data Showcase
DOI Pre-print Media Attached
A Mixed Graph-Relational Dataset of Socio-technicalInteractions in Open Source SystemsMSR - Data Showcase
Data Showcase
Media Attached
A New Dataset for Pull Request AcceptanceMSR - Data Showcase
Data Showcase
Pre-print Media Attached
AndroZooOpen: Collecting Large-scale Open Source Android Apps for the Research CommunityMSR - Data Showcase
Data Showcase
Media Attached
Dataset of Video Game Development ProblemsMSR - Data Showcase
Data Showcase
Media Attached
Employing Contribution and Quality Metrics for Quantifying the Software Development ProcessMSR - Data Showcase
Data Showcase
Pre-print Media Attached
GitterCom: A Dataset of Open Source Developer Communications in GitterMSR - Data Showcase
Data Showcase
Pre-print Media Attached
Hall-of-Apps: The Top Android Apps Metadata ArchiveMSR - Data Showcase
Data Showcase
Media Attached
How Often Do Single-Statement Bugs Occur? The ManySStuBs4J DatasetMSR - Data Showcase
Data Showcase
Pre-print Media Attached
JTeC: A Large Collection of Java Test Classes for Test Code Analysis and ProcessingMSR - Data Showcase
Data Showcase
Pre-print Media Attached
LogChunks: A Data Set for Build Log AnalysisMSR - Data Showcase
Data Showcase
Pre-print Media Attached
Software-related Slack Chats with Disentangled ConversationsMSR - Data Showcase
Data Showcase
Pre-print Media Attached
TestRoutes: A Manually Curated Method Level Dataset for Test-to-Code TraceabilityMSR - Data Showcase
Data Showcase
Pre-print Media Attached