Subsearch
semantic search for any subreddit. indexes posts as vectors, finds what you're actually looking for instead of matching keywords
Install / Use
/learn @rohhann12/SubsearchREADME
SubSearch
semantic search engine for reddit. type a concept, get back relevant posts — no keyword matching.
how it works
scraping — hits the reddit API for a given subreddit, pulls recent posts + metadata.
vectorizing — a small docker sidecar we built (./embedding-service): FastAPI + all-MiniLM-L6-v2. hit POST /embed with text, get back a vector. runs on port 8001, starts before the backend.
storage — embeddings go into ChromaDB, a lightweight vector database running in its own container.
search — your query gets embedded the same way, then ChromaDB finds the closest matches by cosine similarity.
backend — Express server wires it all together. exposes endpoints to index a subreddit and search across it.
frontend — Next.js UI, dark theme, nothing fancy. just a search box that hits the backend.
stack
chromadb— vector store, port 8000embedding-service— sentence-transformers (all-MiniLM-L6-v2), port 8001backend— Express/TypeScript, port 3004ui— Next.js 16 + Tailwind v4, port 3000
running it
docker-compose up
everything starts together. hit localhost:3000 when it's up.
indexing a subreddit
GET /index?subreddit=sideprojects&limit=100
then search:
POST /search
{ "query": "indie hackers building in public", "limit": 10 }
Related Skills
node-connect
345.9kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
106.4kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
345.9kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
345.9kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
