Cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Install / Use
/learn @cvat-ai/CvatREADME
Computer Vision Annotation Tool (CVAT)
[![CI][ci-img]][ci-url] [![Gitter chat][gitter-img]][gitter-url] [![Discord][discord-img]][discord-url] [![Coverage Status][coverage-img]][coverage-url] [![server pulls][docker-server-pulls-img]][docker-server-image-url] [![ui pulls][docker-ui-pulls-img]][docker-ui-image-url] [![DOI][doi-img]][doi-url] [![Status][status-img]][status-url]
CVAT is an interactive video and image annotation tool for computer vision. It is used by tens of thousands of users and companies around the world. Our mission is to help developers, companies, and organizations around the world to solve real problems using the Data-centric AI approach.
Start using CVAT online: cvat.ai. You can use it for free, or subscribe to get unlimited data, organizations, autoannotations, and Roboflow and HuggingFace integration.
Or set CVAT up as a self-hosted solution: Self-hosted Installation Guide. We provide Enterprise support for self-hosted installations with premium features: SSO, LDAP, Roboflow and HuggingFace integrations, and advanced analytics (coming soon). We also do trainings and a dedicated support with 24 hour SLA.
Quick start ⚡
- Installation guide
- Manual
- Contributing
- Datumaro dataset framework
- Server API
- Python SDK
- Command line tool
- XML annotation format
- AWS Deployment Guide
- Frequently asked questions
- Where to ask questions
Partners ❤️
CVAT is used by teams all over the world. In the list, you can find key companies which help us support the product or an essential part of our ecosystem. If you use us, please drop us a line at contact@cvat.ai.
- Human Protocol uses CVAT as a way of adding annotation service to the Human Protocol.
- FiftyOne is an open-source dataset curation and model analysis tool for visualizing, exploring, and improving computer vision datasets and models that are tightly integrated with CVAT for annotation and label refinement.
Public datasets
ATLANTIS, an open-source dataset for semantic segmentation of waterbody images, developed by iWERS group in the Department of Civil and Environmental Engineering at the University of South Carolina is using CVAT.
For developing a semantic segmentation dataset using CVAT, see:
CVAT online: cvat.ai
This is an online version of CVAT. It's free, efficient, and easy to use.
cvat.ai runs the latest version of the tool. You can create up to 10 tasks there and upload up to 500Mb of data to annotate. It will only be visible to you or the people you assign to it.
For now, it does not have analytics features like management and monitoring the data annotation team. It also does not allow exporting images, just the annotations.
We plan to enhance cvat.ai with new powerful features. Stay tuned!
Prebuilt Docker images 🐳
Prebuilt docker images are the easiest way to start using CVAT locally. They are available on Docker Hub:
The images have been downloaded more than 1M times so far.
Screencasts 🎦
Here are some screencasts showing how to use CVAT.
<!--lint disable maximum-line-length-->Computer Vision Annotation Course: we introduce our course series designed to help you annotate data faster and better using CVAT. This course is about CVAT deployment and integrations, it includes presentations and covers the following topics:
- Speeding up your data annotation process: introduction to CVAT and Datumaro. What problems do CVAT and Datumaro solve, and how they can speed up your model training process. Some resources you can use to learn more about how to use them.
- Deployment and use CVAT. Use the app online at app.cvat.ai. A local deployment. A containerized local deployment with Docker Compose (for regular use), and a local cluster deployment with Kubernetes (for enterprise users). A 2-minute tour of the interface, a breakdown of CVAT’s internals, and a demonstration of how to deploy CVAT using Docker Compose.
Product tour: in this course, we show how to use CVAT, and help to get familiar with CVAT functionality and interfaces. This course does not cover integrations and is dedicated solely to CVAT. It covers the following topics:
- Pipeline. In this video, we show how to use app.cvat.ai: how to sign up, upload your data, annotate it, and download it.
For feedback, please see Contact us
API
SDK
- Install with
pip install cvat-sdk - PyPI package homepage
- Documentation
CLI
- Install with
pip install cvat-cli - PyPI package homepage
- Documentation
Supported annotation formats
CVAT supports multiple annotation formats. You can select the format after clicking the Upload annotation and Dump annotation buttons. Datumaro dataset framework allows additional dataset transformations with its command line tool and Python library.
For more information about the supported formats, see: Annotation Formats.
<!--lint disable maximum-line-length-->| Annotation format | Import | Export | | ------------------------------------------------------------------------------------------------ | ------ | ------ | | CVAT for images | ✔️ | ✔️ | | CVAT for a video | ✔️ | ✔️ | | Datumaro | ✔️ | ✔️ | | PASCAL VOC | ✔️ | ✔️ | | Segmentation masks from PASCAL VOC | ✔️ | ✔️ | | YOLO | ✔️ | ✔️ | | MS COCO Object Detection | ✔️ | ✔️ | | MS COCO Keypoints Detection | ✔️ | ✔️ | | MOT | ✔️ | ✔️ | | MOTS PNG | ✔️ | ✔️ | | LabelMe 3.0 | ✔️ | ✔️ | | ImageNet | ✔️ | ✔️ | | CamVid | ✔️ | ✔️ | | WIDER Face | ✔️ | ✔️ | | VGGFace2 | ✔️ | ✔️ | | Market-1501 | ✔️ | ✔️ | | ICDAR13/15 | ✔️ | ✔️ | | Open Images V6 | ✔️ | ✔️ | | Cityscapes | ✔️ | ✔️ | | KITTI | ✔️ | ✔️ | | Kitti Raw Format | ✔️ | ✔️ | | LFW
Related Skills
docs-writer
98.5k`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie
model-usage
326.5kUse CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
arscontexta
2.8kClaude Code plugin that generates individualized knowledge systems from conversation. You describe how you think and work, have a conversation and get a complete second brain as markdown files you own.
skills
N-API Template - Code Generation Reference This document serves as a comprehensive reference for generating API code following the N-API Template standards. Use this guide to ensure consistency and
