<div id="top"></div>      <div align="center"> <a href="https://github.com/openreasoner/openr/"> <img src="figure/openr_logo.png" alt="Logo" width="200"> </a> <h1 align="center" style="font-size: 30px;">OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models</h1> <a href="https://arxiv.org/abs/2410.09671">Paper</a> · <a href="https://github.com/openreasoner/openr/blob/main/reports/Tutorial-LLM-Reasoning-Wang.pdf">Tutorial</a> · <a href="https://github.com/openreasoner/openr">Code</a> · <a href="https://openreasoner.github.io/">Docs</a> · <a href="https://huggingface.co/datasets/openreasoner/MATH-APS">Data</a> · <a href="https://huggingface.co/openreasoner/Math-psa">Model</a> · <a href="https://github.com/openreasoner/openr/issues">Issue</a> · <a href="https://www.modelscope.cn/studios/modelscope/OpenR_Inference">Demo</a> [ <a href="https://github.com/openreasoner/openr/blob/main/README.md">English</a> ][ <a href="https://github.com/openreasoner/openr/blob/main/README_zh.md">中文</a> ] </div>

[ GitHub contributors ][contributors-url] GitHub License [ GitHub Issues or Pull Requests ][issues-url] [ GitHub forks ][forks-url] [ GitHub Repo stars ][stars-url]

<details> <summary>Table of Contents 📖 </summary> <ol> <li><a href="#news-and-updates">News and Updates</a></li> <li><a href="#features">Features</a></li> <li><a href="#todo">TODO</a></li> <li><a href="#benchmark">Benchmark</a></li> <li><a href="#plots">Plots</a></li> <li><a href="#provided-datasets-and-models">Datasets and Models</a></li> <li> <a href="#getting-started">Getting Started</a> <ul> <li><a href="#installation">Installation</a></li> <li><a href="#quickstart">Quick Start</a></li> </ul> </li> <li><a href="#usage">Usage</a></li> <li><a href="#join-us">Join Us</a></li> <li><a href="#contact">Contact</a></li> <li><a href="#response-examples">Response Examples</a></li> <li><a href="#community">Community</a></li> <li><a href="#reference">Reference</a></li> </ol> </details>

News and Updates

[29/11/2024] We have now added a demo page on ModelScope. Many thanks to @wangxingjun778 !
[24/10/2024] OpenR now supports MCTS reasoning (#24)! 🌲
[15/10/2024] Our report is on Arxiv!
[12/10/2024] OpenR has been released! 🚀

Features

| Feature | Contents | |----------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | ✅ Process-supervision Data Generation | - OmegaPRM: Improve Mathematical Reasoning in Language Models by Automated Process Supervision | | ✅ Online Policy Training | - RL Training: APPO, GRPO, TPPO; | | ✅ Generative and Discriminative PRM Training | - PRM Training: Supervised Training for PRMs - Generative RM Training: Direct GenRM | | ✅ Multiple Search Strategies | - Greedy Search - Best-of-N - Beam Search - MCTS - rStar: Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers - Critic-MCTS: Under Review | | ✅ Test-time Computation and Scaling Law | TBA, see benchmark |

TODO

| Feature | TODO (High Priority, We value you contribution!) | |-----------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | 👨‍💻Data | - Re-implement Journey Learning | | 👨‍💻RL Training | - Distributed Training - Reinforcement Fine-Tuning (RFT) #80 | | 👨‍💻PRM | - Larger-scale training - GenRM-CoT implementation - Soft-label training #57 | | 👨‍💻Reasoning | - Optimize code structure #53 - More tasks on reasoning (AIME, etc.) #53 - Multi-modal reasoning #82 - Reasoning in code generation #68 - Dots #75 - Consistency check - Benchmarking |

Benchmark

See Benchmark !

Plots

<p align

Openr

Install / Use

README

News and Updates

Features

TODO

Benchmark

Plots