SimOTM

By converting single-channel grayscale images into multi-channel images through various data enhancement techniques, SimOTM enhances the detection capabilities of object detection models without complex manual design or combination requirements.

Generate Convert Improve

Install / Use

/learn @wandahangFY/SimOTM

About this skill

Quality Score

0/100

README

SimOTM: Simplified One-to-Many Preprocessing Method for Object Detection in Grayscale Images

Introduction

Gray-scale images are widely used in applications such as low-light imaging, medical diagnostics, and industrial inspection due to their simplicity and reduced computational requirements. However, their single-channel nature introduces challenges in object detection, including low object differentiation, noise, and luminance inequality. Traditional preprocessing methods aim to enhance detectability by removing irrelevant information and restoring useful details. Yet, these methods often rely on design for specific scenarios, lack universality, and can even degrade detection results if improperly applied. Consequently, many object detection algorithms avoid extensive preprocessing during training. Additionally, current methods underutilize the potential of single-channel gray-scale images. To address these issues, this paper proposes a simple and general preprocessing algorithm named one to many (OTM) for gray-scale object detection. By converting single-channel gray-scale images into multi-channel formats through image preprocessing and feeding them into the detection model, the algorithm improves detection performance without complex manual design. For validation, a simplified OTM method (SimOTM) is introduced to demonstrate its effectiveness. In this paper, the SimOTM method was incorporated into various object detection frameworks for improving the detection effect of models, and it was tested on four gray object detection datasets from distinct fields. In scenarios where speed remains comparable, the detection performance has been significantly enhanced. Specifically, the mean Average Precision (mAP) of YOLOv5 has improved by 0.43% to 1.37%, YOLOX-s has seen an increase of 0.33% to 3.88%, and YOLOv12 has boosted by 0.69% to 2.29%.

Contributions

A novel preprocessing method, OTM-Fusion, is proposed for grayscale object detection.
SimOTM, a simplified version of OTM-Fusion, is introduced for efficient deployment.
The method is integrated and validated across YOLOv3-YOLOv12 models.
Extensive validation on four open-source datasets proves its robustness and generality.

Quick Start Guide YOLOv11 or YOLOv11-RGBT

1. Clone the Project

git clone https://github.com/ultralytics/ultralytics.git 
cd ultralytics

git clone https://github.com/wandahangFY/YOLOv11-RGBT.git 
cd YOLOv11-RGBT

2. Modify the file

(1) Replace base.py under YOLOv11 with base.py of this project. (ultralytics/data/base.py)
(2) Specify the use_simotm parameter of the BaseDataset (use_simotm="SimOTM")

3. Prepare the Dataset

Configure your dataset directory or TXT file .

4. Install Dependencies

pip install -r requirements.txt

5. Run the Program

python train.py --data your_dataset_config.yaml

6. Testing

Run the test script to verify if the data loading is correct:

python val.py

Quick Start Guide YOLOv7

1. Clone the Project

git clone https://github.com/WongKinYiu/yolov7.git 
cd yolov7

2. Modify the file

(1) Replace base.py under YOLOv11 with datasets.py of this project. (utils/datasets.py)
(2) Specify the load_image (use SimOTM)

3. Prepare the Dataset

Configure your dataset directory or TXT file .

4. Install Dependencies

pip install -r requirements.txt

5. Run the Program

python train.py

6. Testing

Run the test script to verify if the data loading is correct:

python val.py

Implemented in C++ or CUDA

For specific implementation, please refer to Function.cpp.

Citation Format

Wan, Dahang & Lu, Rongsheng & Hu, Bingtao & Shen, Siyuan & Xu, Ting & Lang, Xianli. (2023). Otm-Fusion: An Image Preprocessing Method for Object Detection in Grayscale Image. 10.2139/ssrn.4532335.

Reference Links

Closing Remarks

Thank you for your interest and support in this project. The authors strive to provide the best quality and service, but there is still much room for improvement. If you encounter any issues or have any suggestions, please let us know. Furthermore, this project is currently maintained by the author personally, so there may be some oversights and errors. If you find any issues, feel free to provide feedback and suggestions.

Other Open-Source Projects

Other open-source projects are being organized and released gradually. Please check the author's homepage for downloads in the future. Homepage

Related Skills

diffs

343.1k

Use the diffs tool to produce real, shareable diffs (viewer URL, file artifact, or both) instead of manual edit summaries.

openpencil

1.9k

The world's first open-source AI-native vector design tool and the first to feature concurrent Agent Teams. Design-as-Code. Turn prompts into UI directly on the live canvas. A modern alternative to Pencil.

HappyColorBlend

HappyColorBlendVibe Project Guidelines Project Overview HappyColorBlendVibe is a Figma plugin for color palette generation with advanced tint/shade blending capabilities. It allows designers to

Flyaro-waffle-app

Waffle Delight - Full Stack MERN Application Rules & Documentation Project Overview A comprehensive waffle delivery application built with MERN stack featuring premium UI/UX, admin management, a

wandahangFY

View profile

View on GitHub

GitHub Stars30

CategoryDesign

Updated7mo ago

Forks7

wandahangFY/SimOTM

Languages

Python

Security Score

82/100

Audited on Sep 2, 2025

No findings