Paper page - MIGRATION-BENCH: Repository-Level Code Migration Benchmark from Java 8

We introduce 🤗 MigrationBench dataset, a benchmark dataset tailored for repository-level code migration, specifically targeting java 8 to 17 or other long-term support versions.

1. Dataset

MigrationBench comprises a large-scale collection of GitHub repositories, organized into three subsets:

🤗 AmazonScience/migration-bench-java-full contains 5,102 repos
Each repo has a test directory or at least one test case

🤗 AmazonScience/migration-bench-java-selected with 300 repos
A curated subset of 🤗 migration-bench-java-full

🤗 AmazonScience/migration-bench-java-utg has 4,814 repos
The unit test generation (utg) dataset, disjoint with 🤗 migration-bench-java-full

2. Evaluation Framework

To enable standardized and rigorous evaluation of LLM performance on this complex task, we provide a comprehensive open-source evaluation framework, available at: https://github.com/amazon-science/MigrationBench.

3. Baseline: Code Migration with LLMs

Inspired by Teaching Large Language Models to Self-Debug, we introduce SD-Feedback and demonstrate that LLMs can effectively tackle repository-level code migration from java 8 to 17.

On the selected subset using Claude-3.5-Sonnet-v2, SD-Feedback achieves 62.33% and 27.33% success rate (pass@1) for minimal and maximal migration respectively.

Source link

What's Hot

Paper page – EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion

Stability AI is working on a licensing marketplace for creators

Alibaba’s Qwen-MT Promises Smarter, Cheaper Translations Across 92 Languages

Paper page – MIGRATION-BENCH: Repository-Level Code Migration Benchmark from Java 8

Paper page – EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion

Paper page – Hierarchical Budget Policy Optimization for Adaptive Reasoning

Paper page – DMOSpeech 2: Reinforcement Learning for Duration Prediction in Metric-Optimized Speech Synthesis

Artist Loses Final Appeal in Case of Apologising for ‘Fishrot Scandal’

US Appeals Court Overturns $8.8 M. Trademark Judgement For Yuga Labs

Old Masters ‘Making a Comeback’ in London: Morning Links

Bill Proposed To Apply Anti-Money Laundering Regulations to Art Market

Paper page – EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion

Stability AI is working on a licensing marketplace for creators

Alibaba’s Qwen-MT Promises Smarter, Cheaper Translations Across 92 Languages

What's Hot

Paper page – MIGRATION-BENCH: Repository-Level Code Migration Benchmark from Java 8

1. Dataset

2. Evaluation Framework

3. Baseline: Code Migration with LLMs

Related Posts

Subscribe to Updates