Snippets
Automatically extracts, processes, and indexes code snippets from GitHub repositories into a searchable vector database.
Install / Use
/learn @cheolwanpark/SnippetsREADME
SNIPPETS
Snippets is an intelligent code repository system that automatically extracts, processes, and indexes code snippets from GitHub repositories into a searchable vector database. It combines AI-powered code analysis with semantic search capabilities to help developers quickly find relevant code examples and patterns.
Key Features
- 🚀 Use Your Claude Subscription: Access this tool with your Claude subscription plan. no extra cost!
- 🤖 Automated Repository Processing: Extract meaningful code snippets from any GitHub repository
- 🔍 Semantic Search: Find code by meaning, not just keywords, using vector embeddings
- 🔧 MCP Integration: Seamless integration with Claude Code through Model Context Protocol
- ⚡ Background Processing: Efficient queue-based processing for large repositories
- 🐳 Docker Ready: Complete containerized setup for easy deployment
How It Works
- Repository Ingestion: Add GitHub repositories through the web interface
- AI Processing: Claude Code agents analyze and extract meaningful code snippets
- Vector Embedding: Code snippets are converted to semantic vectors using state-of-the-art models
- Smart Storage: Snippets are indexed in Qdrant vector database for fast similarity search
- Easy Discovery: Search and explore code through the web UI or integrate with Claude Code via MCP
Quick Start
Prerequisites
- Docker & Docker Compose: For containerized deployment
- API Keys:
- Claude Code OAuth Token (from you subscription,
claude setup-token) - Google Gemini API key (for embeddings, you can claim Free API Key)
- Claude Code OAuth Token (from you subscription,
1. Clone and Setup
git clone https://github.com/cheolwanpark/snippets
cd snippets
2. Environment Configuration
Create your environment file:
cp docker/.env.example docker/.env
Edit docker/.env with your API keys:
# Required: Claude API for code analysis
CLAUDE_CODE_OAUTH_TOKEN=your_claude_token_here
# Required: Gemini API for embeddings
EMBEDDING_API_KEY=your_gemini_api_key_here
# Optional: GitHub PAT, required for PRIVATE repository access
GITHUB_TOKEN=your_github_pat_here
# Optional: Cohere API key, required for reranking
COHERE_API_KEY=your_cohere_api_key_here
# Optional: Customize ports
FRONT_PORT=3000
MCP_PORT=8080
3. Launch with Docker
cd docker
docker-compose up -d
This starts all services:
- Frontend: http://localhost:3000
- API with MCP server: http://localhost:8000 (api), http://localhost:8080/mcp (mcp)
- Worker: controlled by RQ
- Qdrant: http://localhost:6333
- Redis: localhost:6379
4. First Repository
- Open http://localhost:3000 in your browser
- Enter a GitHub repository URL (e.g.,
https://github.com/user/repo) - Click 'Embed'
- Monitor progress in the dashboard
- Search your snippets in 'Query' tab once processing completes!
5. Connect MCP server to claude
claude mcp add --transport http snippets http://localhost:8080/mcp
This enables Claude Code to search your processed snippets directly during development.
Configuration
Essential Environment Variables
The system requires minimal configuration for most use cases:
Required API Keys
# Claude API token for AI-powered code analysis
CLAUDE_CODE_OAUTH_TOKEN=your_claude_api_token
# Google Gemini API key for generating embeddings
EMBEDDING_API_KEY=your_gemini_api_key
Optional Customization
# Service Ports
FRONT_PORT=3000 # Frontend web interface
MCP_PORT=8080 # MCP server port
# Database Configuration (use defaults unless you have existing instances)
QDRANT_URL=http://qdrant:6333
REDIS_URL=redis://redis:6379
# Processing Settings
EMBEDDING_MODEL=gemini-embedding-001 # Embedding model to use
PIPELINE_MAX_CONCURRENCY=10 # Concurrency setup - how many files will be processed in parallel
Getting API Keys
- Claude API Token: type
claude setup-tokenin your terminal - Gemini API Key: Get one from Google AI Studio
Usage
Web Interface
The web interface at http://localhost:3000 provides a complete repository management experience:
Repository Management
-
Add Repository:
- Go to 'Embed' Tab
- Enter GitHub URL (private repositories supported with your PAT token)
- Optional: Configure processing options (file filters, size limits)
- Start processing
-
Monitor Progress:
- Real-time processing status
- Progress indicators for extraction phases
- Error reporting for failed repositories
-
Repository Settings:
- Edit processing configuration
- Re-process with different settings
- Remove repositories and their snippets
Snippet Search
-
Semantic Search:
- Go to 'Query' Tab
- Enter natural language queries ("authentication middleware", "error handling")
- Use technical terms ("async function", "React hooks")
- Search by programming concepts
- Optional: Configure search options (repository name, language)
-
Explore Snippets:
- View code with syntax highlighting
- See file context and repository source
- Copy snippets for use in your projects
MCP Integration with Claude Code
The Snippets MCP server enables seamless integration with Claude Code for enhanced development workflows.
Setup MCP Connection
# in your project directory
claude mcp add --transport http snippets http://localhost:8080/mcp
# OR configure the mcp server for user scope
claude mcp add -s user --transport http snippets http://localhost:8080/mcp
# $CODEX_HOME/config.toml
[mcp_servers.snippets]
url = "http://localhost:8080/mcp"
Using the Search Tool
Once connected, Claude Code gains access to the search tool:
# Find error handling patterns
search error handling patterns in Python. use snippets.
# Find authentication patterns
search JWT authentication middleware. use snippets.
# Language-specific search
search async database queries. use snippets.
# Repository-specific search
search React component patterns. use snippets.
Search Parameters (will be configured automatically by claude code):
query: Natural language description of what you're looking forlimit: Number of results (default: 10, max: 50)repo_name: Filter to specific repositorylanguage: Filter by programming language
Workflow Integration
Use the MCP integration for:
- Code Discovery: Find examples before implementing new features
- Pattern Research: Explore different approaches to common problems
- Learning: Understand how concepts are implemented across repositories
- Code Review: Find similar implementations for comparison
Architecture
System Overview
Snippets is built as a microservices architecture with clear separation of concerns:
┌─────────────────┐ ┌───────────────────┐ ┌─────────────────┐
│ Frontend │ │ API/MCP Server │ │ Worker Pool │
│ (Next.js) │◄──►│ (FastAPI/FastMCP) │◄──►│ (RQ/Redis) │
└─────────────────┘ └───────────────────┘ └─────────────────┘
│ │
▼ ▼
┌──────────────────┐ ┌─────────────────┐
│ Vector DB │ │ Message Queue │
│ (Qdrant) │ │ (Redis) │
└──────────────────┘ └─────────────────┘
Components
Frontend (Next.js)
- Location:
front/ - Purpose: Web interface for repository management and snippet search
- Technology: Next.js 14, React, TypeScript, Tailwind CSS
- Features:
- Repository CRUD operations
- Real-time processing status
- Semantic search interface
- Responsive design with dark/light themes
API Server (FastAPI)
- Location:
src/api/ - Purpose: REST API for all backend operations
- Technology: FastAPI, Python 3.12, Pydantic
- Endpoints:
- Repository management (
/repo) - Snippet search (
/snippets)
- Repository management (
Worker System (RQ/Redis)
- Location:
src/worker/ - Purpose: Background processing of repositories
- Technology: RQ (Redis Queue), Redis
- Responsibilities:
- Repository cloning and analysis
- Code snippet extraction using AI agents
- Vector embedding generation
- Database storage operations
Vector Database (Qdrant)
- Purpose: Store and search code snippet embeddings
- Technology: Qdrant vector database
- Features:
- High-performance similarity search
- Metadata filtering (language, repository)
- Scalable vector storage
- Built-in clustering and indexing
MCP Server (FastMCP)
- Location:
src/mcpserver/ - Purpose: Model Context Protocol integration
- Technology: FastMCP, mounted on main API
- Tools Provided:
search: Semantic snippet search for Claude Code
Data Flow
Repository Processing Flow
- User Input: Repository URL submitted via web interface
- API Validation: URL validation and repository metadata extraction
- Queue Job: Processing job added to Redis queue
- Worker Processing:
- Clone repository to temporary storage
- Filter files by type and size
- Extract meaningful code snippets using Claude Code Agents
- Generate vector embeddings for each snippet
- Store in Qdrant with metadata
- Status Updates: Real-time status updates via polling
- Completion: Repository marked as processed, snippets available for search
Search Flow
- Query Input: User enters search query (Web UI or MCP)
- Query Embedding: Convert query to vector using same embedding model
- Vector Search: Qdrant performs similarity search with filters
- Result Ranking: Results reranked by cohere
Related Skills
Hook Development
111.6kThis skill should be used when the user asks to "create a hook", "add a PreToolUse/PostToolUse/Stop hook", "validate tool use", "implement prompt-based hooks", "use ${CLAUDE_PLUGIN_ROOT}", "set up event-driven automation", "block dangerous commands", or mentions hook events (PreToolUse, PostToolUse, Stop, SubagentStop, SessionStart, SessionEnd, UserPromptSubmit, PreCompact, Notification). Provides comprehensive guidance for creating and implementing Claude Code plugin hooks with focus on advanced prompt-based hooks API.
MCP Integration
111.6kThis skill should be used when the user asks to "add MCP server", "integrate MCP", "configure MCP in plugin", "use .mcp.json", "set up Model Context Protocol", "connect external service", mentions "${CLAUDE_PLUGIN_ROOT} with MCP", or discusses MCP server types (SSE, stdio, HTTP, WebSocket). Provides comprehensive guidance for integrating Model Context Protocol servers into Claude Code plugins for external tool and service integration.
Plugin Structure
111.6kThis skill should be used when the user asks to "create a plugin", "scaffold a plugin", "understand plugin structure", "organize plugin components", "set up plugin.json", "use ${CLAUDE_PLUGIN_ROOT}", "add commands/agents/skills/hooks", "configure auto-discovery", or needs guidance on plugin directory layout, manifest configuration, component organization, file naming conventions, or Claude Code plugin architecture best practices.
Skill Development
111.6kThis skill should be used when the user wants to "create a skill", "add a skill to plugin", "write a new skill", "improve skill description", "organize skill content", or needs guidance on skill structure, progressive disclosure, or skill development best practices for Claude Code plugins.
