AirGym

A high-performance drone deep reinforcement learning platform built upon IsaacGym.

Generate Convert Improve

Install / Use

/learn @emNavi/AirGym

About this skill

Quality Score

0/100

README

GitHub License GitHub Issues GitHub Release <img src="https://img.shields.io/badge/PyTorch-2.0%2B-EE4C2C?logo=pytorch">

<div align="center"> <img src="doc/logo2.png" width="300" style="margin-bottom: 20px;"> A High-Performance Quadrotor Deep Reinforcement Learning Platform Built on NVIDIA IsaacGym </div>

Changes Reminder

v0.5.1 released on 2025.09.27: Fix bugs: planning task resets at start when loading trained model planning_cnn_rate.pth.

v0.5.0 released on 2025.07.21: Update VAE model.

v0.4.0 released on 2025.04.21: Update reward functions for planning. Upload pretrained model for task planning.

v0.3.0 released on 2025.04.16: Bugs fixed on environments registry, and tests had done on every tasks.

~~v0.2.0 released on 2025.04.05~~: This version has bugs! Re-inplement the asset manager part to make it clear and efficient. Split the algorithm outside the /airgym.

v0.1.0dev released: first release.

<details> <summary style="display: list-item; cursor: pointer; text-decoration: none"> <h2 style="display: inline; border-bottom: none">Table of Contents</h2> </summary>

AirGym
Features
Environments

Hovering
Tracking
Balloon (Target Reaching)
Avoid (Dynamic Obstacle)
Planning
Customized Env

Installation

Requirements
One-shot Install
Test the Installation

Usage

Directory Structure Description
Training from Scratch
Playing and Testing
Customize a New Task

TODO
FAQ
License
Acknowledgements
Citation

</details>

AirGym

AirGym is an open souce Python quadrotor simulator based on IsaacGym, a part of AirGym series Sim-to-Real workding flow. It provides a high-fidelity dynamics and Deep Reinforcement Learning (DRL) framework for quadrotor robot learning research. Furthermore, we also provide toolkits for transferring policy from AirGym simulator to the real quadrotor emNavi-X152b, making Sim-to-Real possible. The documentation website is at https://emnavi.tech/AirGym/.

The Sim-to-Real working flow of AirGym series is broken down into four parts:

AirGym: the quadrotor simulation platform described in this repository, providing environments and basic training algorithm.
rlPx4Controller: a flight geometric controller that maintains strict control logic alignment with the open-source PX4, for a better Sim-to-Real.
AirGym-Real: an onboard Sim-to-Real module compatible with AirGym, enabling direct loading of pretrained models, supporting onboard visual-inertial pose estimation, ROS topic publishing, and one-shot scripted deployment.
control_for_gym: a middleware layer based on MAVROS for forwarding control commands at various levels to PX4 autopilot. It includes a finite state machine to facilitate switching between DRL models and traditional control algorithms.

Features

AirGym is more lightweight and has a clearer file structure compared to other simulators, because it was designed from the beginning with the goal of achieving Sim-to-Real transfer.

Lightweight & Customizable: AirGym is extremely lightweight yet highly extensible, allowing you to quickly set up your own customized training task.
Strict Alignment with PX4 Logic: Flight control in AirGym is supported by rlPx4Controller. It maintains strict control logic alignment with the open-source PX4, for a better Sim-to-Real.
Multiple Control Modes: AirGym provides various control modes including PY (position & yaw), LV (linear velocity & yaw), CTA (collective thrust & attitude angle), CTBR (collective thrust & body rate), SRT (single-rotor thrust).
Sim-to-Real Toolkits: AirGym series have complete flow of robot learning Sim-to-Real and provide a potential to transfer well-trained policies to a physical device.

Environments

AirGym provides four basic tasks Hovering, Balloon, Tracking, Avoid, and a higher level task Planning. All tasks are implemented on X152b quadrotor frame since this is our Sim-to-Real device.

Hovering

Task Hovering: the quadrotor is expected to be initialized randomly inside a cube with a side length of 2 meters, then converge into the center and hover until the end of the episode. Also, this task can be used as "go to waypoint" task if specify a target waypoint.

Balloon

Task Balloon: also called target reaching. It is essentially a variation of the hovering task, but with a key difference: the quadrotor moves at a higher speed, rapidly dashing toward the balloon (the target).

Tracking

Task Tracking: tracking a sequance of waypoints which is played as a trajectory. The tracking speed is effected by the trajectory playing speed.

Avoid

Task Avoid: hover and try to avoid a cube or a ball with random throwing velocity and angle. This task provides depth image as input.

Planning

Task Planning: a drone navigates and flies through a random generated woods like area, using only depth information as input. No global information is utilized in this task which means a better adaptation in a GNSS-deny environment and without a VIO.

Customized Environments

Furthermore, you can build a customized environment and even task upon AirGym. Here is a simple demo of random assets generation.

Installation

Requirements

Ubuntu 20.04
Conda or Miniconda

Note: this repository has been tested on Ubuntu 20.04 PyTorch 2.0.0 + CUDA11.8 with RTX4090.

One-shot Install

Run configuration.sh to download and install NVIDIA Isaac Gym Preview 4 and rlPx4Controller at ~, and create a new conda environment named airgym. Note that sudo is required to install apt package:

chmod +x configuration.sh
./configuration.sh

Test the Installation

You can run the example script which is a quadrotor position control illustration of 4096 parallel environments:

conda activate airgym
python airgym/scripts/example.py --task hovering --ctl_mode pos --num_envs 4

The default ctl_mode is position control.

Usage

Directory Structure Parsing

AirGym/
│
├── airgym/                                      # source code
│   ├── envs/                                    # envs definition
│   │    ├── base/                               # base envs
│   │    └── task/                               # tasks
│   │          ├── planning.py                   # task env file
│   │          ├── planning_config.py            # task env config file
│   │          ├── ...
│   │           ...
│   │
│   ├── assets/                                  # assets
│   │     ├── env_assets/                        # env assets
│   │     └── robots/                            # robot assets
│   ├── scripts/                                 # debug scripts
│   └── utils/                                   # others
│
├── doc/                                         # doc assets
│
├── lib/                                         # RL algorithm
│
├── scripts/                                     # program scipts
│   ├── config/                                  # config file for training
│   ├── runner.py                                # program entry
│   └── ...                                      
│
└── README.md                                    # introduction

Training from Scratch

We train the model by a rl_games liked customed PPO (rl_games was discarded after the version 0.0.1beta because of its complexity for use). The training algorithm is at airgym/lib modified from rl_games-1.6.1. Of course you can use any other RL libs for training.

Training from scratch can be down within minites. The

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

last30days-skill

16.5k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

sec-edgar-agentkit

AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.

emNavi

View profile

View on GitHub

GitHub Stars140

CategoryEducation

Updated3d ago

Forks12

emNavi/AirGym

Languages

Python

Security Score

100/100

Audited on Mar 27, 2026

No findings

AirGym

Install / Use

README

Changes Reminder

AirGym

Features

Environments

<font face='courier new'>Hovering</font>

<font face='courier new'>Balloon</font>

<font face='courier new'>Tracking</font>

<font face='courier new'>Avoid</font>

<font face='courier new'>Planning</font>

Customized Environments

Installation

Requirements

One-shot Install

Test the Installation

Usage

Directory Structure Parsing

Training from Scratch

Related Skills