SkillAgentSearch skills...

Monoforce

[IROS 2024] [ICML 2024 Workshop Differentiable Almost Everything] MonoForce: Learnable Image-conditioned Physics Engine

Install / Use

/learn @ctu-vras/Monoforce

README

MonoForce: Learnable Image-conditioned Physics Engine

[!Note] An updated version is available at ctu-vras/fusionforce.

IROS-2024 Arxiv ICML-2024-Diff-XYZ

Video Video Video Poster Data

<img src="./monoforce/docs/imgs/qualitative_results_v2.jpg" width="800"/>

Robot-terrain interaction prediction from RGB camera images as input:

  • predicted trajectory,
  • terrain shape and properties,
  • interaction forces and contacts.

<img src="./monoforce/docs/imgs/examples/ramp_success.png" width="200"/> <img src="./monoforce/docs/imgs/examples/high_grass2.png" width="200"/> <img src="./monoforce/docs/imgs/examples/wall3.png" width="200"/> <img src="./monoforce/docs/imgs/examples/snow.png" width="200"/>

Examples of predicted trajectories and autonomous traversal through vegetation:

<p> <a href="https://www.youtube.com/watch?v=JGi-OzTBG1k"> <img src="./monoforce/docs/imgs/demo_oru.png" alt="video link" width="300"> </a> <a href="https://drive.google.com/file/d/1TTNTyqZnObtdE_PdCc2GprszphnE3hxS/view?usp=drive_link"> <img src="./monoforce/docs/imgs/park_navigation_video_teaser.jpg" alt="video link" width="500"> </a> </p>

Table of Contents

Running

<img src="./monoforce/docs/imgs/pipeline.png" width="800"/>

The MonoForce pipeline consists of the Terrain Encoder and the Physics Engine. Given input RGB images and cameras calibration the Terrain Encoder predicts terrain properties. Then the differentiable Physics Engine simulates robot trajectory and interaction forces on the predicted terrain for a provided control sequence. Refer to the monoforce/examples folder for implementation details.

Please run the following command to explore the MonoForce pipeline:

cd monoforce/
python scripts/run.py --img-paths IMG1_PATH IMG2_PATH ... IMGN_PATH --cameras CAM1 CAM2 ... CAMN --calibration-path CALIB_PATH

For example if you want to test the model with the provided images from the ROUGH dataset:

cd monoforce/scripts/
./run.sh

Please, refer to the installation instructions to download the pre-trained model weights.

ROS Integration

<img src="./monoforce/docs/imgs/monoforce.gif" width="800"/>

We provide a ROS nodes for both the trained Terrain Encoder model and the Differentiable Physics module. They are integrated into the launch file:

roslaunch monoforce monoforce.launch

Training

The following terrain properties are predicted by the model:

  • Elevation: the terrain shape.
  • Friction: the friction coefficient between the robot and the terrain.
  • Stiffness: the terrain stiffness.
  • Damping: the terrain damping.
<img src="./monoforce/docs/imgs/training.jpg" width="800"/>

An example of the predicted elevation and friction maps (projected to camera images):

<p> <a href="https://drive.google.com/file/d/15Uo82hwE_OiRHsuGd0-9qcvrYOXsosn0/view?usp=drive_link"> <img src="./monoforce/docs/imgs/friction_prediction_tradr.png" alt="video link" width="800"> </a> </p> One can see that the model predicts the friction map with higher values for road areas and with the smaller value for grass where the robot could have less traction.

To train the model, please run:

cd monoforce/scripts/
python train.py

Please refer to the train_friction_head_with_pretrained_terrain_encoder.ipynb notebook for the example of the terrain properties learning with the pretrained Terrain Encoder model and differentiable physics loss.

Navigation

Navigation method with MonoForce predicting terrain properties and possible robot trajectories from RGB images and control inputs. The package is used as robot-terrain interaction and path planning pipeline.

<p> <a href="https://drive.google.com/file/d/1mqKEh_3VHZo4kDcJXP572SD1BVw37hSf/view?usp=drive_link"> <img src="monoforce/docs/imgs/forest_navigation_video_teaser.png" alt="video link" width="800"> </a> </p>

We provide the differentiable physics model for robot-terrain interaction prediction:

  • Pytorch: The model is implemented in Pytorch. Please refer to the diff_physics.ipynb notebook for the example of the trajectory prediction.

Navigation consists of the following stages:

  • Terrain prediction: The Terrain Encoder is used to estimate terrain properties.
  • Trajectories simulation: The Physics Engine is used to shoot the robot trajectories.
  • Trajectory selection: The trajectory with the smallest cost based on robot-terrain interaction forces is selected.
  • Control: The robot is controlled to follow the selected trajectory.

Citation

Consider citing the papers if you find the work relevant to your research:

@inproceedings{agishev2024monoforce,
    title={MonoForce: Self-supervised Learning of Physics-informed Model for Predicting Robot-terrain Interaction},
    author={Ruslan Agishev and Karel Zimmermann and Vladimír Kubelka and Martin Pecka and Tomáš Svoboda},
    booktitle={IEEE/RSJ International Conference on Intelligent Robots and Systems - IROS},
    year={2024},
    eprint={2309.09007},
    archivePrefix={arXiv},
    primaryClass={cs.RO},
    url={https://arxiv.org/abs/2309.09007},
    doi={10.1109/IROS58592.2024.10801353},
}
@inproceedings{agishev2024endtoend,
    title={End-to-end Differentiable Model of Robot-terrain Interactions},
    author={Ruslan Agishev and Vladim{\'\i}r Kubelka and Martin Pecka and Tomas Svoboda and Karel Zimmermann},
    booktitle={ICML 2024 Workshop on Differentiable Almost Everything: Differentiable Relaxations, Algorithms, Operators, and Simulators},
    year={2024},
    url={https://openreview.net/forum?id=XuVysF8Aon}
}

Related Skills

View on GitHub
GitHub Stars90
CategoryEducation
Updated7d ago
Forks8

Languages

Jupyter Notebook

Security Score

100/100

Audited on Mar 25, 2026

No findings