Clone Detection and Benchmarking in Big Code (NJR 2018)

Sun 4 - Fri 9 November 2018 Boston, Massachusetts, United States

Track

NJR 2018

Time Zone

The program is currently displayed in (GMT-05:00) Guadalajara, Mexico City, Monterrey.

Use conference time zone: (GMT-05:00) Guadalajara, Mexico City, MonterreySelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 6 Nov 2018 16:30 - 17:00 at Newbury - IV

Abstract

Copying a code fragment and then reusing it by pasting and adapting (e.g., adding/modifying/deleting statements) is a common practice in software development, resulted in a significant amount of duplicated code in software systems. On the other hand, duplicated code poses a number of threats to the maintenance of software systems such as clones are the #1 “bad smell” in Fowler’s refactoring list. Software clones are thus considered to be one of the major contributors to the high software maintenance cost, which could be up to 80% of the total software development cost. The era of Big Data has introduced new applications for clone detection. For example, clone detection has been used to find similar mobile applications, to intelligently tag code snippets, to identify code examples, and so on from large inter-project repositories. The dual role of clones in software development and maintenance, along with these many emerging new applications of clone detection, has led to a great many clone detection tools and analysis frameworks. In this talk, I will outline our experience in developing clone detection tools from large-scale inter-projects code repositories using even a desktop machine with standard hardware configurations. I will then also talk about how do we evaluate such large-scale clone detection tools using our BigCloneBench, a clone benchmark of more than eight million manually validated clone pairs in 25 thousand Java projects.

Time Zone

The program is currently displayed in (GMT-05:00) Guadalajara, Mexico City, Monterrey.

Use conference time zone: (GMT-05:00) Guadalajara, Mexico City, MonterreySelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 6 Nov
Displayed time zone: Guadalajara, Mexico City, Monterrey change

15:30 - 17:00	IVNJR at Newbury

15:30 30m Talk		Decompiling Ethereum Bytecode and Detecting Gas-Focused Vulnerabilities NJR Yannis Smaragdakis University of Athens
16:00 30m Talk		SWAN: A Program Analysis Framework for Swift NJR Karim Ali University of Alberta
16:30 30m Talk		Clone Detection and Benchmarking in Big Code NJR Chanchal K. Roy University of Saskatchewan

Clone Detection and Benchmarking in Big Code

Tue 6 Nov
Displayed time zone: Guadalajara, Mexico City, Monterrey change

Chanchal K. Roy

University of Saskatchewan

Tracks

Co-hosted Conferences

Workshops

Co-hosted Symposia

Clone Detection and Benchmarking in Big Code

Program Display Configuration

Program Display Configuration

Tue 6 NovDisplayed time zone: Guadalajara, Mexico City, Monterrey change

Chanchal K. Roy

University of Saskatchewan

Tue 6 Nov
Displayed time zone: Guadalajara, Mexico City, Monterrey change