qapyq

(CapPic) An image viewer and AI-assisted editing tool that helps with curating datasets for generative AI models, finetunes and LoRA.

Screenshot of qapyq with its 5 windows open.

Features

Image Viewer: Display and navigate images
- Quick-starting desktop application built with Qt
- Runs smoothly with a million images
- Modular interface that lets you place windows on different monitors
- Open multiple tabs
- Zoom/pan and fullscreen mode
- Gallery with thumbnails and optionally captions ?
- Semantic image sorting with text prompts ?
- Compare two images ?
- Measure size, area and pixel distances ?
- Slideshow ?
Image/Mask Editor: Prepare images for training
- Crop and save parts of images ?
- Scale images, optionally using AI upscale models ?
- Dynamic save paths with template variables ?
- Manually edit masks with multiple layers ?
- Generate masks with AI models ?
- Record masking operations into macros ?
- VAE-encode images and check their latent representation ?
Captioning: Describe images with text
- Edit captions manually with drag-and-drop support ?
- Save multiple captions in a JSON file per image ?
- Multi-Edit Mode: Edit captions of multiple images simultaneously ?
- Focus Mode: Add the same tags to many files quickly ?
- Tag grouping, merging, sorting, filtering and replacement rules ?
- Colored text highlighting
- Autocomplete with tags from your groups and CSV files ?
- CLIP Token Counter ?
- Automated captioning with support for grounding ?
- Dynamic prompts with templates and text transformations ?
- Multi-turn conversations with VLMs ?
- Further refinement with LLMs
Stats/Filters: Summarize your data and get an overview
- List all tags, image resolutions, masked regions, or size of concept folders ?
- Filter images and create subsets ?
- Combine and chain filters
- Export the summaries as CSV
Batch Processing: Process whole folders at once
- Flexible batch captioning, tagging and transformation ?
- Batch scaling of images
- Batch masking with user-defined macros
- Batch cropping of images using your macros
- Copy, move and rename files, create symlinks, ZIP captions for backups
AI Assistance:
- Support for state-of-the-art captioning and masking models
- Model and sampling settings, GPU acceleration with CPU offload support
- On-the-fly NF4 and INT8 quantization
- Run inference locally and/or on multiple remote machines over SSH ?
- Separate inference subprocess isolates potential crashes and allows complete VRAM cleanup

Supported Models

These are the supported architectures with links to the original models. Find more specialized finetuned models on huggingface.co.

Tagging Models for generating keyword captions for images.
- JoyTag
- PixAI Tagger (onnx)
- WD (onnx) (eva02 recommended)
Captioning Models for generating complete-sentence captions for images.
- Florence-2
- Gemma3 (GGUF)
- InternVL2, InternVL2.5, [InternVL2.5-MPO](https://huggingface.co/collecti

Qapyq

Install / Use

README

qapyq

Features

Supported Models