423 skills found · Page 1 of 15
karpathy / AutoresearchAI agents running research on single-GPU nanochat training automatically
Unity-Technologies / Ml AgentsThe Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
OpenPipe / ARTAgent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!
activeloopai / DeeplakeDeeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.
tensortrade-org / TensortradeAn open source reinforcement learning framework for training, evaluating, and deploying robust trading agents.
DLR-RM / Rl Baselines3 ZooA training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Yvictor / TradingGymTrading and Backtesting environment for training reinforcement learning agent or simple rule base algo.
langfengQ / Verl Agentverl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
openai / Neural MmoCode for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"
microsoft / TextWorldTextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
opendilab / DI StarAn artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
AgentR1 / Agent R1Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
araffin / Rl Baselines ZooA collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
huawei-noah / SMARTSScalable Multi-Agent RL Training School for Autonomous Driving
JudgmentLabs / JudgevalThe open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.
BytedTsinghua-SIA / MemAgentA MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
pat-jj / S3[EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)
OpenBMB / AgentCPMAn End-to-End Infrastructure for Training and Evaluating Various LLM Agents
mila-iqia / BabyaiBabyAI platform. A testbed for training agents to understand and execute language commands.
pat-jj / DeepRetrieval[COLM’25] DeepRetrieval — 🔥 Training Search Agent by RLVR with Retrieval Outcome