MSR 2020
Mon 29 - Tue 30 June 2020
co-located with ICSE 2020
Mon 29 Jun 2020 16:38 - 16:47 at MSR:Zoom - Github & OSS Datasets Chair(s): Olga Baysal

Several researchers have studied that developers contributing to open source systems tend to self-organize in “emerging” teams. The structure of these latent teams has a significant impact on software quality, with development teams structure somewhat reflected in the way developers communicate and contribute in the subsystems of a system. Therefore, in order to study socio-technical interactions as well as the software evolution dynamics of open source systems, in this paper, we present a novel dataset, gathered from 20 open source projects, which report the developers’ activities in the scope of commits and issues at the level of subsystems. Thus, the new, generated dataset comprises of emerging and explicit links among developers, commits, issues, and source code artifacts, with data grouped around the subsystems point of view, which can be used to better study the system dynamics behind the extracted socio-technical interactions.

Mon 29 Jun

Displayed time zone: (UTC) Coordinated Universal Time change

16:30 - 17:30
Github & OSS DatasetsTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom
Chair(s): Olga Baysal Carleton University

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

16:30
8m
Live Q&A
A New Dataset for Pull Request AcceptanceMSR - Data Showcase
Data Showcase
A: Xunhui Zhang National University of Defense Technology, China, A: Ayushi Rastogi University of Groningen, The Netherlands, A: Yue Yu College of Computer, National University of Defense Technology, Changsha 410073, China
Pre-print Media Attached
16:38
8m
Live Q&A
A Mixed Graph-Relational Dataset of Socio-technicalInteractions in Open Source SystemsMSR - Data Showcase
Data Showcase
A: Usman Ashraf , A: Christoph Mayr-Dorn Johannes Kepler University Linz, A: Alexander Egyed Johannes Kepler University, Linz, A: Sebastiano Panichella
Media Attached
16:47
8m
Live Q&A
A Complete Set of Related Git Repositories Identified via Community Detection Approaches Based on Shared CommitsMSR - Data Showcase
Data Showcase
A: Audris Mockus , A: Zoe Kotti Athens University of Economics and Business, A: Diomidis Spinellis Athens University of Economics and Business, A: Gabriel Dusing
Media Attached
16:55
8m
Live Q&A
A Dataset of Enterprise-Driven Open Source SoftwareMSR - Data Showcase
Data Showcase
A: Diomidis Spinellis Athens University of Economics and Business, A: Zoe Kotti Athens University of Economics and Business, A: Konstantinos Kravvaritis , A: Georgios Theodorou , A: Panos Louridas Athens University of Economics and Business
DOI Pre-print Media Attached
17:04
8m
Live Q&A
A Dataset for GitHub Repository DeduplicationMSR - Data Showcase
Data Showcase
A: Diomidis Spinellis Athens University of Economics and Business, A: Zoe Kotti Athens University of Economics and Business, A: Audris Mockus
DOI Pre-print Media Attached
17:12
8m
Live Q&A
A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git CommitsMSR - Data Showcase
Data Showcase
A: Tanner Fry , A: Tapajit Dey , A: Andrey Karnauch University of Tennessee Knoxville, A: Audris Mockus
Pre-print Media Attached
17:21
8m
Live Q&A
20-MAD - 20 years of issues and commits of Mozilla and Apache DevelopmentMSR - Data Showcase
Data Showcase
A: Maëlick Claes University of Oulu, A: Mika Mäntylä University of Oulu
Media Attached