Alturos.ImageAnnotation
A collaborative tool for labeling image data for yolo
Install / Use
/learn @AlturosDestinations/Alturos.ImageAnnotationREADME

Alturos.ImageAnnotation
The purpose of this project is to manage training data for Neural Networks. The images are stored in an object storage for example Amazon S3. In our case we have image data for different runs that we want to annotate together. You can upload a folder into a package. For every package you can set your own tags... this information is stored in a database for example Amazon DynamoDB.

Features
- Collaborative annotation of images
- Verification of image annotation data
- Export for yolo (train.txt, test.txt, obj.names) with filters
- No requirement for a custom server
Installation
You have the choice to use AWS or install your own environment.
Keyboard Shortcuts
Shortcut | Description | --- | --- | <kbd>↓</kbd> | Next image | <kbd>↑</kbd> | Previous image | <kbd>→</kbd> | Next Object Class | <kbd>←</kbd> | Previous Object Class | <kbd>0</kbd>-<kbd>9</kbd> | Select Object Class | <kbd>W</kbd><kbd>A</kbd><kbd>S</kbd><kbd>D</kbd><br>+<kbd>Shift</kbd><br>+<kbd>Ctrl</kbd><br>+<kbd>Alt</kbd> | Move Bounding Box<br>Resize<br>Quick<br>Invert
Data preperation
How many images are required
You should preferably have 2000 different images for each class or more.
Extract images from a video
If you have a video file and need the individual frames you can use ffmpeg to extract the images. This command exports every 10th frame in the video.
ffmpeg -i input.mp4 -vf "select=not(mod(n\,10))" -vsync vfr 1_every_10/img_%03d.jpg
Articles of interest
Credits
This program uses icons from the Silk icon set created by Mark James, which can be found here. The icon set is licensed under a CC BY 3.0 license. Some changes were made to the icons.
Other Image Annotation Tools
Related Skills
node-connect
352.5kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
111.3kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
352.5kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
352.5kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
