JTMDet
No description available
Install / Use
/learn @LiC2023/JTMDetREADME
Joint Transformer and Mamba Fusion for Multispectral Object Detection
Intro
Official Code for Joint Transformer and Mamba Fusion for Multispectral Object Detection.
Installation
Install libraries including numpy, pytorch, timm, mamba-ssm, etc. according to requirement.txt
Training
- change the pretrain-weight and data cfg.
- change the img2path and filter label
-
python train.py
Test
- change the weights and data cfg
- change the img2path and filter label
-
python val.py
Visualize
- change the weights and data cfg
- change the img2path and filter label
-
python result_vis.py
Dataset
The datasets and annotations used in this repo:
-FLIR [BaiDu Drive] (code:jtm6)
-LLVIP [BaiDu Drive] (code:jtm6)
-M<sup>3</sup>FD [BaiDu Drive](code:jtm6)
Weight
-FLIR [BaiDu Drive] (code:jtm6)
-LLVIP [BaiDu Drive] (code:jtm6)
-M<sup>3</sup>FD [BaiDu Drive] (code:jtm6)
Pretrain_YOLOv5_Weight
-YOLOv5 [BaiDu Drive] (code:jtm6)
Cite
If you find our model/method/dataset useful, please cite our work:
@article{li2025joint,
title={Joint Transformer and Mamba fusion for multispectral object detection},
author={Li, Chao and Peng, Xiaoming},
journal={Image and Vision Computing},
pages={105468},
year={2025},
publisher={Elsevier}
}
Related Skills
node-connect
351.4kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
110.7kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
351.4kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
351.4kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
