29 skills found
OpenRLHF / OpenRLHFAn Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
BytedTsinghua-SIA / DAPOAn Open-source RL System from ByteDance Seed and Tsinghua AIR
WangJingyao07 / Awesome GRPOCodebase of GRPO: Implementations and Resources of GRPO and Its Variants
opendilab / LightRFTLightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement Fine-Tuning Framework
saikiranrallabandi / InframindInfraMind: Fine-tuning toolkit for training SLMs on Infrastructure-as-Code using GRPO/DAPO. Achieves 97.3% accuracy on IaC generation.
Ruijian-Zha / FinRL DAPO SR🚀 A New DAPO Algorithm for Stock Trading (arXiv:2505.06408) Implementation of our IEEE IDS 2025 accepted algorithm combining Dynamic Sampling Policy Optimization (DAPO), Group Relative Policy Optimization (GRPO), and LLM-driven risk/sentiment signals for efficient and profitable stock trading on the NASDAQ-100 index.
komi22 / DAPOZZero Trust Integrated Security Solution
mbzuai-oryx / MediX R1Open Ended Medical Reinforcement Learning
lns / DapoSource code for the paper "Divergence-Augmented Policy Optimization"
Polygon Painter for Low-Poly style 3D Models. Plugin for Unity.
MystenLabs / DapolDAPOL+ Proof of Liabilities using Bulletproofs and Sparse Merkle trees
KulunuOS / 6DAPose6D Assembly Pose Estimation by Point Cloud Registration for Robot Manipulation
egin10 / Dapodikscraping data sekolah dari web dapodik (Data Refrensi) : https://referensi.data.kemdikbud.go.id/index11.php
TeenLucifer / Dapo ReproduceNo description available
Yinghui-Li-New / DAPoinTrNo description available
DarkAngel7 / UINavigationController DAPowerfulCustomizationA category to expand UINavigationController, UINavigationItem and UIViewController. You can customization UINavigationBar for each view controller and enjoy your life.
boschresearch / Defect Aware Prompt OptimizationAccompanying code for paper "DAPO: Defect-aware Hybrid Prompt Optimization via Progressive Tuning for Zero-Shot Multi-type Anomaly Detection and Segmentation"
putradimas / Dapodik SDKUnofficial Dapodik SDK for PHP
dapodix / DapodikSDK python untuk aplikasi dapodik.
myaser / DAPOSDialectal Arabic Part Of Speech Tagger