495 skills found · Page 1 of 17
Significant-Gravitas / AutoGPTAutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
oobabooga / Text Generation WebuiThe original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline.
roboflow / SupervisionWe write your reusable computer vision tools. 💜
microsoft / OmniParserA simple screen parsing tool towards pure vision based GUI agent
crmne / Ruby LlmOne beautiful Ruby API for OpenAI, Anthropic, Gemini, Bedrock, Azure, OpenRouter, DeepSeek, Ollama, VertexAI, Perplexity, Mistral, xAI, GPUStack & OpenAI compatible APIs. Agents, Chat, Vision, Audio, PDF, Images, Embeddings, Tools, Streaming & Rails integration.
automeris-io / WebPlotDigitizerComputer vision assisted tool to extract numerical data from plot images.
Turbo1123 / RoubaoAndroid Automation Tool Based on Vision-Language Models
qingchencloud / Clawpanel🦞 OpenClaw 可视化管理面板 — 内置 AI 助手(工具调用 + 图片识别 + 多模态),一键安装 | Visual management panel with built-in AI assistant (tool calling + vision + multimodal + i18n(11))
szczyglis-dev / Py GptDesktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, speech synthesis and recognition, web search, memory, presets, assistants,and more. Linux, Windows, Mac
unrealcv / Synthetic Computer VisionA list of synthetic dataset and tools for computer vision
quietvoid / Dovi Tooldovi_tool is a CLI tool combining multiple utilities for working with Dolby Vision.
Acly / Krita Vision ToolsKrita plugin which adds selection tools to mask objects with a single click, or by drawing a bounding box.
waybarrios / Vllm MlxOpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s. Works with Claude Code.
open-edge-platform / DatumaroDataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
day8 / Re Frame 10xA debugging dashboard for re-frame. X-ray vision as tooling.
cvhciKIT / SlothSloth is a tool for labeling image and video data for computer vision research.
OvidijusParsiunas / MyvisionComputer vision based ML training data generation tool :rocket:
Tessellate-Imaging / Monk V1Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.
DIYer22 / BoxxTool-box for efficient build and debug in Python. Especially for Scientific Computing and Computer Vision.
PV-Bhat / Vibe Check MCP ServerVibe Check is a tool that provides mentor-like feedback to AI Agents, preventing tunnel-vision, over-engineering and reasoning lock-in for complex and long-horizon agent workflows. KISS your over-eager AI Agents goodbye! Effective for: Coding, Ambiguous Tasks, High-Risk tasks