Using Large-Scale Anomaly Detection on Code to Improve Kotlin Compiler (MSR 2020 - Technical Papers)

Who

Timofey Bryksin, Victor Petukhov, Ilya Alexin, Stanislav Prikhodko, Alexey Shpilman, Vladimir Kovalenko, Nikita Povarov

Track

MSR 2020 Technical Papers

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 29 Jun 2020 10:42 - 10:48 at MSR:Zoom - Programming Languages & Models Chair(s): Dimitris Kolovos

Abstract

In this work we apply anomaly detection to source code and bytecode to facilitate development of a programming language and its compiler. We define anomaly as a code fragment that is different from typical code written in a particular programming language. Identifying such code fragments is beneficial to both language developers and end users, since anomalies may indicate potential issues with the compiler or with runtime performance. Moreover, anomalies could correspond to problems in language design. For this study, we choose Kotlin as the target programming language. We outline and discuss approaches to obtaining vector representations of source code and bytecode and to detection of anomalies across vectorized code snippets. The paper presents a method that aims to detect two types of anomalies: syntax tree anomalies and so-called compiler-induced anomalies that arise only in the compiled bytecode. We describe several experiments that employ different combinations of vectorizaton and anomaly detection techniques, and discuss types of detected anomalies and their usefulness for language developers. We demonstrate that the extracted anomalies and the underlying extraction technique provide additional value for language development.

Link to Preprint

https://arxiv.org/abs/2004.01618

Timofey Bryksin

JetBrains Research, Saint Petersburg State University

Russia

Victor Petukhov

JetBrains, ITMO University

Russia

Ilya Alexin

Stanislav Prikhodko

Alexey Shpilman

Vladimir Kovalenko

TU Delft

Netherlands

Nikita Povarov

JetBrains

Using Large-Scale Anomaly Detection on Code to Improve Kotlin Compiler

bilibili link