MT3
Code for the paper "Next Generation Multitarget Trackers: Random Finite Set Methods vs Transformer-based Deep Learning" (https://arxiv.org/abs/2104.00734).
Install / Use
/learn @JulianoLagana/MT3README
Introduction
This repository contains the code for the paper "Next Generation Multitarget Trackers: Random Finite Set Methods vs Transformer-based Deep Learning" (https://arxiv.org/abs/2104.00734). The code for MT3 was developed as joint effort by Juliano Pinto, Georg Hess, William Ljungbergh, Yuxuan Xia, Lennart Svensson and Henk Wymeersch, and was partially based on the code available at the repositories for DETR and Deformable-DETR.
Setting up
In order to set up a conda environment with all the necessary dependencies, run the command:
conda env create -f conda-env/environment-<gpu/cpu>.yml
Running experiments
Run an experiment using the train.py script. Example usage:
src/training.py -tp configs/tasks/task1.yaml -mp configs/models/mt3.yaml
Training hyperparameters such as batch size, learning rate, checkpoint interval, etc, are found in the file configs/models/mt3.yaml.
Evaluating experiments
After an experiment has generated checkpoints, you can evaluate its average GOSPA score using the eval.py script. The evaluation hyperparameters can be found inside configs/eval/default.yaml. Example usage:
src/eval.py -rp src/results/experiment_name -tp configs/tasks/task1.yaml
Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
groundhog
398Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
last30days-skill
16.5kAI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
sec-edgar-agentkit
10AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.
