Autoshow
End-to-end workflow to automatically generate show notes from audio/video transcripts
Install / Use
/learn @autoshow/AutoshowAbout this skill
Quality Score
0/100
Category
Content & MediaSupported Platforms
Claude Code
Claude Desktop
Gemini CLI
README
autoshow-bun
Overview
Full-stack application for automatically processing videos and audio with AI transcription, summarization, text-to-speech, image generation, and music generation.
Features
- Audio Processing: Extract audio from YouTube videos, streaming platforms, or direct file uploads
- AI Transcription: Groq Whisper, DeepInfra, Fal, Gladia, ElevenLabs, Rev, AssemblyAI, Deepgram, Soniox, or HappyScribe
- LLM Summarization: OpenAI GPT, Claude, or Google Gemini models for generating episode descriptions, summaries, and chapters
- Text-to-Speech: Convert summaries to audio with OpenAI or ElevenLabs voices
- Image Generation: Create cover images from AI-generated prompts
- Music Generation: Generate background music with ElevenLabs in multiple genres
- Docker Support: Containerized deployment with analysis and optimization tools
- Build Analysis: Analyze and optimize SolidStart bundle size and performance
Quick Start
Test URLs
https://www.youtube.com/watch?v=nXtaETBZ29g
https://ajc.pics/audio/fsjam-short.mp3
https://ajc.pics/autoshow/textract-3.pdf
Install Dependencies
bun install
Run Type Check
bun check
Start Local Development Server
bun dev
Visit http://localhost:4321 to generate show notes through the web interface.
Start Docker Container (Optional)
bun up
Documentation
CLI Commands
- Overview - CLI commands and usage
- Configuration - Environment setup and API keys
- Docker - Container management
- Build Report - Bundle analysis and optimization
- E2E Testing - End-to-end test suite
- Fetch Models - Update model definitions
API Documentation
- Overview - API endpoints and usage
- Health - Health check endpoint
- Jobs - Job management
- Process - Multi-step processing pipeline
- Download - File upload endpoints
- Media - Audio, image, and video processing
Testing
- Overview - Testing strategy
- Verify Tests - Model verification
- E2E Tests - End-to-end tests
- API Tests - API endpoint tests
- Playwright Tests - Browser tests
Architecture
- Models & Configuration - All models, pricing, performance, and env vars
- Steps Architecture - Processing pipeline
- Progress Tracking - Progress system
