HLCE
(EMNLP 2025 Findings) Source Evaluation scripts for Humanity's Last Code Exam
Install / Use
/learn @Humanity-s-Last-Code-Exam/HLCEREADME
Why Do We Need HLCE?
With the increasing capabilities of LLMs, many benchmarks have become too easy!

News
August 21, 2025 🎉Congratulations on our paper HLCE being accepted by EMNLP 2025.
🛠️Dataset Usage
- You can download dataset via this link: https://huggingface.co/HumanLastCodeExam
🔮Dataset Evaluation
Prerequisites
- Python 3.8 or higher
- Git
Setup Instructions
-
Clone the repository:
git clone git@github.com:Humanity-s-Last-Code-Exam/HLCE.git cd HLCE -
Install the package and its dependencies:
pip install -e .
-
For IOI, kindly follow these instructions to obtain the definitive evaluation results.
-
For ICPC-World-Finals,kindly follow these instructions to obtain the definitive evaluation results.
📊 Leaderboard
- If you wish to submit your model to the leaderboard, please follow the instructions.
💾Citation
@misc{li2025humanityscodeexamadvanced,
title={Humanity's Last Code Exam: Can Advanced LLMs Conquer Human's Hardest Code Competition?},
author={Xiangyang Li and Xiaopeng Li and Kuicai Dong and Quanhu Zhang and Rongju Ruan and Xinyi Dai and Xiaoshuang Liu and Shengchun Xu and Yasheng Wang and Ruiming Tang},
year={2025},
eprint={2506.12713},
archivePrefix={arXiv},
primaryClass={cs.SE},
url={https://arxiv.org/abs/2506.12713},
}
📄 License
Usage and License Notices: The data and code are intended and licensed for research use only.
Related Skills
node-connect
343.1kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
90.0kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
343.1kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
343.1kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
