Monoforce

[IROS 2024] [ICML 2024 Workshop Differentiable Almost Everything] MonoForce: Learnable Image-conditioned Physics Engine

Generate Convert Improve

Install / Use

/learn @ctu-vras/Monoforce

About this skill

Quality Score

0/100

README

MonoForce: Learnable Image-conditioned Physics Engine

[!Note] An updated version is available at ctu-vras/fusionforce.

Robot-terrain interaction prediction from RGB camera images as input:

predicted trajectory,
terrain shape and properties,
interaction forces and contacts.

Examples of predicted trajectories and autonomous traversal through vegetation:

Running

The MonoForce pipeline consists of the Terrain Encoder and the Physics Engine. Given input RGB images and cameras calibration the Terrain Encoder predicts terrain properties. Then the differentiable Physics Engine simulates robot trajectory and interaction forces on the predicted terrain for a provided control sequence. Refer to the monoforce/examples folder for implementation details.

Please run the following command to explore the MonoForce pipeline:

cd monoforce/
python scripts/run.py --img-paths IMG1_PATH IMG2_PATH ... IMGN_PATH --cameras CAM1 CAM2 ... CAMN --calibration-path CALIB_PATH

For example if you want to test the model with the provided images from the ROUGH dataset:

cd monoforce/scripts/
./run.sh

Please, refer to the installation instructions to download the pre-trained model weights.

ROS Integration

We provide a ROS nodes for both the trained Terrain Encoder model and the Differentiable Physics module. They are integrated into the launch file:

roslaunch monoforce monoforce.launch

Training

The following terrain properties are predicted by the model:

Elevation: the terrain shape.
Friction: the friction coefficient between the robot and the terrain.
Stiffness: the terrain stiffness.
Damping: the terrain damping.

An example of the predicted elevation and friction maps (projected to camera images):

<a href="https://drive.google.com/file/d/15Uo82hwE_OiRHsuGd0-9qcvrYOXsosn0/view?usp=drive_link"> <img src="./monoforce/docs/imgs/friction_prediction_tradr.png" alt="video link" width="800"> </a> One can see that the model predicts the friction map with higher values for road areas and with the smaller value for grass where the robot could have less traction.

To train the model, please run:

cd monoforce/scripts/
python train.py

Please refer to the train_friction_head_with_pretrained_terrain_encoder.ipynb notebook for the example of the terrain properties learning with the pretrained Terrain Encoder model and differentiable physics loss.

Navigation

Navigation method with MonoForce predicting terrain properties and possible robot trajectories from RGB images and control inputs. The package is used as robot-terrain interaction and path planning pipeline.

We provide the differentiable physics model for robot-terrain interaction prediction:

Pytorch: The model is implemented in Pytorch. Please refer to the diff_physics.ipynb notebook for the example of the trajectory prediction.

Navigation consists of the following stages:

Terrain prediction: The Terrain Encoder is used to estimate terrain properties.
Trajectories simulation: The Physics Engine is used to shoot the robot trajectories.
Trajectory selection: The trajectory with the smallest cost based on robot-terrain interaction forces is selected.
Control: The robot is controlled to follow the selected trajectory.

Citation

Consider citing the papers if you find the work relevant to your research:

@inproceedings{agishev2024monoforce,
    title={MonoForce: Self-supervised Learning of Physics-informed Model for Predicting Robot-terrain Interaction},
    author={Ruslan Agishev and Karel Zimmermann and Vladimír Kubelka and Martin Pecka and Tomáš Svoboda},
    booktitle={IEEE/RSJ International Conference on Intelligent Robots and Systems - IROS},
    year={2024},
    eprint={2309.09007},
    archivePrefix={arXiv},
    primaryClass={cs.RO},
    url={https://arxiv.org/abs/2309.09007},
    doi={10.1109/IROS58592.2024.10801353},
}

@inproceedings{agishev2024endtoend,
    title={End-to-end Differentiable Model of Robot-terrain Interactions},
    author={Ruslan Agishev and Vladim{\'\i}r Kubelka and Martin Pecka and Tomas Svoboda and Karel Zimmermann},
    booktitle={ICML 2024 Workshop on Differentiable Almost Everything: Differentiable Relaxations, Algorithms, Operators, and Simulators},
    year={2024},
    url={https://openreview.net/forum?id=XuVysF8Aon}
}

Related Skills

best-practices-researcher

The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

isf-agent

a repo for an agent that helps researchers apply for isf funding

last30days-skill

17.2k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

ctu-vras

View profile

View on GitHub

GitHub Stars90

CategoryEducation

Updated7d ago

Forks8

ctu-vras/monoforce

Languages

Jupyter Notebook

Security Score

100/100

Audited on Mar 25, 2026

No findings