We are working on creating a Normalized Java Resource (NJR) that will speed up innovation in the area of software tools. Those tools include security enhancers, bug finders, and code synthesizers, all of which can benefit greatly from access to Big Code. Our vision is a diverse collection of 100,000 normalized Java projects that is executable, scriptable, and searchable. The Java projects stem from the Sourcerer collection and we normalize their representation to enable large-scale processing with reproducible results. Such processing includes execution, static and dynamic analysis, scriptable interaction, and search for projects with specific dynamic characteristics. For each search of the collection, NJR returns both a file with Java projects and a container for a cloud service such as Amazon EC2. Thus, a researcher can run tools on those projects both locally and on a cloud service. Researchers will be both beneficiaries and contributors to NJR. They benefit from searching for Java projects that fit their need, and once their tools run on NJR, they contribute to an ever-increasing collection of measurements. Notice the powerful network effect: the more people run tools on NJR, the more data we get for search, and the more data we get for search, the more people will want to search and run on NJR.
Invited Talks
Call for Presentations
We welcome presentations on tools that may benefit from NJR, on efforts to create large collections of programs, and on experiences with existing collections.
Tue 6 NovDisplayed time zone: Guadalajara, Mexico City, Monterrey change
08:30 - 10:00 | |||
08:30 30mTalk | The DaCapo Benchmark Suite: A Methodological, Engineering, and Social Journey NJR Steve Blackburn Australian National University | ||
09:00 30mTalk | NJR: 100,000 Executable, Scriptable, and Searchable Java Programs NJR Jens Palsberg University of California, Los Angeles | ||
09:30 30mTalk | NJR discussion; Needs, Requirements, Wishes NJR Jens Palsberg University of California, Los Angeles |
10:30 - 12:00 | |||
10:30 30mTalk | Analyzing Code Repositories: A Case Study in Reproducibility NJR Jan Vitek Northeastern University | ||
11:00 30mTalk | Declarative Program Analysis and Big Code: Challenges and Opportunities NJR Mayur Naik University of Pennsylvania | ||
11:30 30mTalk | Towards Automating Precision Studies of Clone Detectors NJR Farima Farmahinifarahani University of California at Irvine, USA |
13:30 - 15:00 | |||
13:30 30mTalk | Delphi: Connecting Researchers to Enable Comparable Large-Scale Experiments in Program Analysis NJR Ben Hermann University of Paderborn Media Attached File Attached | ||
14:00 30mTalk | Statically Checking the Inconsistencies of Security Assumptions/Measures in Android Apps and Systems NJR Zhiyun Qian UC Riverside | ||
14:30 30mTalk | Qualitas Corpus Analysis NJR Craig Anslow Victoria University of Wellington |
15:30 - 17:00 | |||
15:30 30mTalk | Decompiling Ethereum Bytecode and Detecting Gas-Focused Vulnerabilities NJR Yannis Smaragdakis University of Athens | ||
16:00 30mTalk | SWAN: A Program Analysis Framework for Swift NJR Karim Ali University of Alberta | ||
16:30 30mTalk | Clone Detection and Benchmarking in Big Code NJR Chanchal K. Roy University of Saskatchewan |