Results for "vision-ai"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

1,086 skills found · Page 1 of 37

Significant-Gravitas / AutoGPT

183.1k

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

claude codeclaude desktop

agentic-aiagentsai+8

100

Updated 16m ago

mudler / LocalAI

44.8k

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

zedclaude code+1

agentsaiapi+15

Updated 6m ago

ashishpatel26 / 500 AI Machine Learning Deep Learning Computer Vision NLP Projects With Code

32.6k

500 AI Machine learning Deep learning Computer vision NLP Projects with code

universal

artificial-intelligenceartificial-intelligence-projectsawesome+10

Updated 1h ago

jacobgil / Pytorch Grad Cam

12.7k

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

universal

class-activation-mapscomputer-visiondeep-learning+14

Updated 11h ago

web-infra-dev / Midscene

12.5k

AI-powered, vision-driven UI automation for every platform.

universal

aiai-testbrowser-use+5

Updated 3h ago

kornia / Kornia

11.1k

🐍 Geometric Computer Vision Library for Spatial AI

universal

artificial-intelligencecomputer-visiondeep-learning+8

Updated 1d ago

dusty-nv / Jetson Inference

8.8k

Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

universal

caffecomputer-visiondeep-learning+17

Updated 8h ago

GetStream / Vision Agents

7.6k

Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

universal

agentic-aiagentsai+8

Updated 22m ago

facebookresearch / Mmf

5.6k

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

universal

captioningdeep-learningdialog+7

Updated 1d ago

TarrySingh / Artificial Intelligence Deep Learning Machine Learning Tutorials

4.0k

A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.

universal

artificial-intelligenceawscapsule-network+17

Updated 8h ago

NVlabs / VILA

3.8k

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

universal

Updated 23h ago

SkyworkAI / Skywork R1V

3.2k

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.

universal

deepseek-r1grpollm+8

Updated 4d ago

jonyzhang2023 / Awesome Embodied Vla Va Vln

2.9k

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

universal

Updated 2h ago

roflcoopter / Viseron

2.7k

Self-hosted, local only NVR and AI Computer Vision software. With features such as object detection, motion detection, face recognition and more, it gives you the power to keep an eye on your home, office or any other place you want to monitor.

universal

coralcudadarknet+17

Updated 5h ago

enpeizhao / CVprojects

2.6k

computer vision projects | 计算机视觉相关好玩的AI项目（Python、C++、embedded system）

universal

computer-visioncppcuda+5

Updated 5h ago

icereed / Paperless Gpt

2.2k

Use LLMs and LLM Vision (OCR) to handle paperless-ngx - Document Digitalization powered by AI

universal

aichatgptllm+5

Updated 23h ago

qingchencloud / Clawpanel

2.0k

🦞 OpenClaw 可视化管理面板 — 内置 AI 助手（工具调用 + 图片识别 + 多模态），一键安装 | Visual management panel with built-in AI assistant (tool calling + vision + multimodal + i18n(11))

universal

admin-panelai-agentai-assistant+17

Updated 7m ago

Intent-Lab / VisionClaw

2.0k

Real-time AI assistant for Meta Ray-Ban smart glasses -- voice + vision + agentic actions via Gemini Live and OpenClaw

gemini cli

Updated 29m ago

szczyglis-dev / Py Gpt

1.7k

Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, speech synthesis and recognition, web search, memory, presets, assistants,and more. Linux, Windows, Mac

claude codeclaude desktop+2

aiai-assistantartificial-intelligence+17

Updated 1d ago

cvzone / Cvzone

1.3k

This is a Computer vision package that makes its easy to run Image processing and AI functions. At the core it uses OpenCV and Mediapipe libraries.

universal

computervisionmediapipeopencv+1

Updated 18m ago