MSR 2020
Mon 29 - Tue 30 June 2020
co-located with ICSE 2020
Tue 30 Jun 2020 16:00 - 16:15 at MSR:Zoom2 - Visions & Reflections Chair(s): Venera Arnaoudova

In the last few years, artificial intelligence (AI) and machine learning (ML) have become ubiquitous terms. These powerful techniques have escaped obscurity in academic communities with the recent onslaught of AI & ML tools, frameworks, and libraries that make these techniques accessible to a wider audience of developers. As a result, applying AI & ML to solve existing and emergent problems is an increasingly popular practice. However, little is known about this domain from the software engineering perspective. Many AI & ML tools and applications are open source, and hosted on platforms such as GitHub that provide rich tools for large-scale distributed software development. Despite widespread use and popularity, these repositories have never been examined as a community to identify unique properties, development patterns, and trends.

In this paper, we conducted a large-scale empirical study of AI & ML Tool (700) and Application (4,524) repositories hosted on GitHub to develop such a characterization. To compare this community to the wider population of repositories, we compare our analyses to 4,101 unrelated repositories. We enhance this characterization with an elaborate study of developer workflow that measures collaboration and autonomy within a repository. We’ve captured key insights of this community’s 10 year history such as it’s primary language (Python) and most popular repositories (Tensorflow, Tesseract). Our findings show the AI & ML community has unique characteristics that should be accounted for in future research.

Tue 30 Jun

Displayed time zone: (UTC) Coordinated Universal Time change

16:00 - 17:00
Visions & ReflectionsTechnical Papers / Registered Reports / Keynote / MSR Awards / FOSS Award / Education / Data Showcase / Mining Challenge / MSR Challenge Proposals / Ask Me Anything at MSR:Zoom2
Chair(s): Venera Arnaoudova Washington State University

Q/A & Discussion of Session Papers over Zoom (Joining info available on Slack)

16:00
15m
Live Q&A
The State of the ML-universe: 10 Years of Artificial Intelligence & Machine Learning Software Development on GitHubMSR - Technical Paper
Technical Papers
Danielle Gonzalez Rochester Institute of Technology, USA, Thomas Zimmermann Microsoft Research, Nachiappan Nagappan Microsoft Research
DOI Pre-print Media Attached
16:15
15m
Live Q&A
Ethical Mining – A Case Study on MSR Mining ChallengesACM SIGSOFT Distinguished Paper AwardMSR - Technical Paper
Technical Papers
Nicolas Gold University College London, Jens Krinke University College London
DOI Pre-print Media Attached
16:30
15m
Live Q&A
From Innovations to Prospects: What Is Hidden Behind Cryptocurrencies?MSR - Technical Paper
Technical Papers
Ang Jia Xi'an Jiaotong University, Ming Fan Xi'an Jiaotong University, Xi Xu , Di Cui Xi'an Jiaotong University, Wenying Wei , Zijiang Yang Western Michigan University, Kai Ye , Ting Liu Xi'an Jiaotong University
DOI Pre-print Media Attached
16:45
15m
Live Q&A
What constitutes Software? An Empirical, Descriptive Study of ArtifactsMSR - Technical Paper
Technical Papers
Pre-print Media Attached