SkillAgent Search skills...⌘K

HTransRL

Hybrid Transformer based Multi-agent Reinforcement Learning (HTransRL) is for drone coordination in air corridors, addressing the challenges of dynamic dimensions and types of state inputs, which cannot addressed by the traditional MARL.

Generate Convert Improve

Install / Use

/learn @SECNetLabUNM/HTransRL

About this skill

Quality Score

0/100

Category

Education & Research

Supported Platforms

Universal

Tags

advanced-air-mobility air-corridor multiagent-reinforcement-learning proximal-policy-optimization reinforcement-learning transformer unmanned-aerial-vehicle urban-air-mobility

README

Hybrid Transformer based Multi-agent Reinforcement Learning for Multiple Unmanned Aerial Vehicle Coordination in Air Corridors

Modeling

Air Corridor, Cylinder and Torus

Animation

cttc, one-transfer

4 air corridors, cylinder-torus-torus-cylinder, 12 UAVs, 4-static, and 3-mobile

cttcttcttc, 3-transfer

10 air corridors, cylinder-torus-torus-cylinder-torus-torus-cylinder-torus-torus-cylinder, 12 UAVs, 4-static, and 3-mobile

RL Training

Network Structure

Embedding network normalizes the input values and standardizes the input dimensions.
Transformer processes dynamic neighbors' information using encoders and decoders.
Actor-critic network outputs the estimated state value and stochastic action in spherical coordinates.

Training File

Train one set of parameters: main.py

Train a batch, parameter grid search: batched_grid_search.sh

Models (actor/critic) are saved every 0.25 million steps. Training process is visualized with terminal log and TensorBoard.

Test File

Serial, generate animation: D3MOVE_test_single_core.py

Parallel, generate data for figs: D3MOVE_test_parallel.py

SECNetLabUNM

View profile

GitHub Stars20

CategoryEducation

Updated5d ago

Forks6

SECNetLabUNM/HTransRL

Languages

Python

Security Score

80/100

Audited on Mar 26, 2026

No findings