SkillAgentSearch skills...

Repairbench

Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/

Install / Use

/learn @ASSERT-KTH/Repairbench
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

RepairBench

Leaderboard of frontier models for program repair.

If you use RepairBench, please cite:

@inproceedings{repairbench,
  title={RepairBench: Leaderboard of Frontier Models for Program Repair}, 
  author={André Silva and Martin Monperrus},
  booktitle = {IEEE/ACM International Workshop on Large Language Models for Code (LLM4Code)},
  year={2025},
  url={https://arxiv.org/abs/2409.18952}, 
  doi = {10.1109/LLM4Code66737.2025.00006}
}

For the code to reproduce the benchmark, please refer to https://github.com/ASSERT-KTH/repairbench-framework

Structure

  • results includes all prompts, patches, and evaluation results
  • scripts contains scripts used to parse results into other formats
  • website contains the leaderboard's website code
View on GitHub
GitHub Stars11
CategoryDevelopment
Updated5mo ago
Forks2

Languages

Jupyter Notebook

Security Score

72/100

Audited on Oct 26, 2025

No findings