MSR 2020
Mon 29 - Tue 30 June 2020
co-located with ICSE 2020
Mon 29 Jun 2020 16:30 - 16:38 at MSR:Zoom - Github & OSS Datasets Chair(s): Olga Baysal

Pull-based development is a widely adopted paradigm for collaboration in distributed software development, attracting eyeballs from both academic and industry. To better study pull-based development model, this paper presents a new dataset containing 96 features collected from 11,230 projects and 3,347,937 pull requests. We describe the creation process and explain the features in details. To the best of our knowledge, our dataset is the most comprehensive and largest one toward a complete picture for pull-based development research.

Mon 29 Jun

Displayed time zone: (UTC) Coordinated Universal Time change

16:30 - 17:30
Github & OSS DatasetsTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom
Chair(s): Olga Baysal Carleton University

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

16:30
8m
Live Q&A
A New Dataset for Pull Request AcceptanceMSR - Data Showcase
Data Showcase
A: Xunhui Zhang National University of Defense Technology, China, A: Ayushi Rastogi University of Groningen, The Netherlands, A: Yue Yu College of Computer, National University of Defense Technology, Changsha 410073, China
Pre-print Media Attached
16:38
8m
Live Q&A
A Mixed Graph-Relational Dataset of Socio-technicalInteractions in Open Source SystemsMSR - Data Showcase
Data Showcase
A: Usman Ashraf , A: Christoph Mayr-Dorn Johannes Kepler University Linz, A: Alexander Egyed Johannes Kepler University, Linz, A: Sebastiano Panichella
Media Attached
16:47
8m
Live Q&A
A Complete Set of Related Git Repositories Identified via Community Detection Approaches Based on Shared CommitsMSR - Data Showcase
Data Showcase
A: Audris Mockus , A: Zoe Kotti Athens University of Economics and Business, A: Diomidis Spinellis Athens University of Economics and Business, A: Gabriel Dusing
Media Attached
16:55
8m
Live Q&A
A Dataset of Enterprise-Driven Open Source SoftwareMSR - Data Showcase
Data Showcase
A: Diomidis Spinellis Athens University of Economics and Business, A: Zoe Kotti Athens University of Economics and Business, A: Konstantinos Kravvaritis , A: Georgios Theodorou , A: Panos Louridas Athens University of Economics and Business
DOI Pre-print Media Attached
17:04
8m
Live Q&A
A Dataset for GitHub Repository DeduplicationMSR - Data Showcase
Data Showcase
A: Diomidis Spinellis Athens University of Economics and Business, A: Zoe Kotti Athens University of Economics and Business, A: Audris Mockus
DOI Pre-print Media Attached
17:12
8m
Live Q&A
A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git CommitsMSR - Data Showcase
Data Showcase
A: Tanner Fry , A: Tapajit Dey , A: Andrey Karnauch University of Tennessee Knoxville, A: Audris Mockus
Pre-print Media Attached
17:21
8m
Live Q&A
20-MAD - 20 years of issues and commits of Mozilla and Apache DevelopmentMSR - Data Showcase
Data Showcase
A: Maëlick Claes University of Oulu, A: Mika Mäntylä University of Oulu
Media Attached