Monoforce
[IROS 2024] [ICML 2024 Workshop Differentiable Almost Everything] MonoForce: Learnable Image-conditioned Physics Engine
Install / Use
/learn @ctu-vras/MonoforceREADME
MonoForce: Learnable Image-conditioned Physics Engine
<img src="./monoforce/docs/imgs/qualitative_results_v2.jpg" width="800"/>[!Note] An updated version is available at ctu-vras/fusionforce.
Robot-terrain interaction prediction from RGB camera images as input:
- predicted trajectory,
- terrain shape and properties,
- interaction forces and contacts.
<img src="./monoforce/docs/imgs/examples/ramp_success.png" width="200"/> <img src="./monoforce/docs/imgs/examples/high_grass2.png" width="200"/> <img src="./monoforce/docs/imgs/examples/wall3.png" width="200"/> <img src="./monoforce/docs/imgs/examples/snow.png" width="200"/>
Examples of predicted trajectories and autonomous traversal through vegetation:
<p> <a href="https://www.youtube.com/watch?v=JGi-OzTBG1k"> <img src="./monoforce/docs/imgs/demo_oru.png" alt="video link" width="300"> </a> <a href="https://drive.google.com/file/d/1TTNTyqZnObtdE_PdCc2GprszphnE3hxS/view?usp=drive_link"> <img src="./monoforce/docs/imgs/park_navigation_video_teaser.jpg" alt="video link" width="500"> </a> </p>Table of Contents
- Installation Instructions
- Data
- Terrain Encoder
- Differentiable Physics Engine
- Running
- Examples
- ROS Integration
- Training
- Navigation
- Citation
Running
<img src="./monoforce/docs/imgs/pipeline.png" width="800"/>The MonoForce pipeline consists of the Terrain Encoder and the Physics Engine. Given input RGB images and cameras calibration the Terrain Encoder predicts terrain properties. Then the differentiable Physics Engine simulates robot trajectory and interaction forces on the predicted terrain for a provided control sequence. Refer to the monoforce/examples folder for implementation details.
Please run the following command to explore the MonoForce pipeline:
cd monoforce/
python scripts/run.py --img-paths IMG1_PATH IMG2_PATH ... IMGN_PATH --cameras CAM1 CAM2 ... CAMN --calibration-path CALIB_PATH
For example if you want to test the model with the provided images from the ROUGH dataset:
cd monoforce/scripts/
./run.sh
Please, refer to the installation instructions to download the pre-trained model weights.
ROS Integration
<img src="./monoforce/docs/imgs/monoforce.gif" width="800"/>We provide a ROS nodes for both the trained Terrain Encoder model and the Differentiable Physics module. They are integrated into the launch file:
roslaunch monoforce monoforce.launch
Training
The following terrain properties are predicted by the model:
- Elevation: the terrain shape.
- Friction: the friction coefficient between the robot and the terrain.
- Stiffness: the terrain stiffness.
- Damping: the terrain damping.
An example of the predicted elevation and friction maps (projected to camera images):
<p> <a href="https://drive.google.com/file/d/15Uo82hwE_OiRHsuGd0-9qcvrYOXsosn0/view?usp=drive_link"> <img src="./monoforce/docs/imgs/friction_prediction_tradr.png" alt="video link" width="800"> </a> </p> One can see that the model predicts the friction map with higher values for road areas and with the smaller value for grass where the robot could have less traction.To train the model, please run:
cd monoforce/scripts/
python train.py
Please refer to the train_friction_head_with_pretrained_terrain_encoder.ipynb notebook for the example of the terrain properties learning with the pretrained Terrain Encoder model and differentiable physics loss.
Navigation
Navigation method with MonoForce predicting terrain properties and possible robot trajectories from RGB images and control inputs. The package is used as robot-terrain interaction and path planning pipeline.
<p> <a href="https://drive.google.com/file/d/1mqKEh_3VHZo4kDcJXP572SD1BVw37hSf/view?usp=drive_link"> <img src="monoforce/docs/imgs/forest_navigation_video_teaser.png" alt="video link" width="800"> </a> </p>We provide the differentiable physics model for robot-terrain interaction prediction:
- Pytorch: The model is implemented in Pytorch. Please refer to the diff_physics.ipynb notebook for the example of the trajectory prediction.
Navigation consists of the following stages:
- Terrain prediction: The Terrain Encoder is used to estimate terrain properties.
- Trajectories simulation: The Physics Engine is used to shoot the robot trajectories.
- Trajectory selection: The trajectory with the smallest cost based on robot-terrain interaction forces is selected.
- Control: The robot is controlled to follow the selected trajectory.
Citation
Consider citing the papers if you find the work relevant to your research:
@inproceedings{agishev2024monoforce,
title={MonoForce: Self-supervised Learning of Physics-informed Model for Predicting Robot-terrain Interaction},
author={Ruslan Agishev and Karel Zimmermann and Vladimír Kubelka and Martin Pecka and Tomáš Svoboda},
booktitle={IEEE/RSJ International Conference on Intelligent Robots and Systems - IROS},
year={2024},
eprint={2309.09007},
archivePrefix={arXiv},
primaryClass={cs.RO},
url={https://arxiv.org/abs/2309.09007},
doi={10.1109/IROS58592.2024.10801353},
}
@inproceedings{agishev2024endtoend,
title={End-to-end Differentiable Model of Robot-terrain Interactions},
author={Ruslan Agishev and Vladim{\'\i}r Kubelka and Martin Pecka and Tomas Svoboda and Karel Zimmermann},
booktitle={ICML 2024 Workshop on Differentiable Almost Everything: Differentiable Relaxations, Algorithms, Operators, and Simulators},
year={2024},
url={https://openreview.net/forum?id=XuVysF8Aon}
}
Related Skills
best-practices-researcher
The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app
groundhog
398Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
isf-agent
a repo for an agent that helps researchers apply for isf funding
last30days-skill
17.2kAI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
