:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, MCP, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference

🔌 MCP Serverclaude codeclaude desktop+1

mcpapivoice+1

Updated 1h ago

SamurAIGPT / Generative-Media-Skills

3.0k

Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.

🤖 CLAUDE.mdclaude codecursor+1

claudeapimcp+1

Updated 1h ago

SamurAIGPT / Generative-Media-Skills

3.0k

Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.

🔌 MCP Serverclaude codeclaude desktop+2

mcpapiimage

Updated 1h ago

pinkpixel-dev / MCPollinations

A Model Context Protocol (MCP) server that enables AI assistants to generate images, text, and audio through the Pollinations APIs. Supports customizable parameters, image saving, and multiple model options.

🔌 MCP Serverclaude codeclaude desktop

mcpapivoice+1

Updated 1mo ago

AgriciDaniel / claude-shorts

Interactive longform-to-shortform video creator — Claude Code skill with Remotion-rendered animated captions, AI segment scoring, cursor tracking, and audio-aware boundary snapping

📄 SKILL.mdclaude codecursor

skill

Updated 1d ago

wells1137 / media-skills

A collection of open-source Agent Skills for content creation — images, audio, and video.

📄 SKILL.mdclaude code

skillimage

Updated 10d ago

WeberG619 / cadre-ai

Voice-driven AI professional agent. Real-time conversations powered by Gemini Live API, native audio streaming, and multimodal intelligence. BIM/Revit, financial analysis, and web search tools.

🔌 MCP Serverclaude codeclaude desktop+1

mcpapiautomation+2

Updated 3d ago

agrathwohl / carla-mcp-server

An MCP server for controlling the Carla audio plugin host

🔌 MCP Serverclaude codeclaude desktop

mcp

Updated 5d ago

jftuga / transcript-critic

Claude Code skill that transcribes audio/video with whisper.cpp to get structured critical analysis including timestamped summaries, evidence notes, logical fallacies, and underdeveloped areas

📄 SKILL.mdclaude code

skill

Updated 3d ago

samson-art / transcriptor-mcp

An MCP server (stdio + HTTP/SSE) that fetches video transcripts/subtitles via yt-dlp, with pagination for large responses. Supports YouTube, Twitter/X, Instagram, TikTok, Twitch, Vimeo, Facebook, Bilibili, VK, Dailymotion. Whisper fallback — transcribes audio when subtitles are unavailable (local or OpenAI API). Works with Cursor and other MCP host

🔌 MCP Serverclaude codeclaude desktop+1

mcpapidocker

Updated 4d ago