88 skills found · Page 1 of 3
isaacus-dev / SemchunkA fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.
benbrandt / Text SplitterSplit text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
mirth / ChonkyFully neural approach for text chunking
chonkie-inc / Chunk🚀 The Fastest Chunker in the West 🇺🇸 Upto 1TB/s "semantic" chunking, quick and easy!
andrea9293 / MCP Documentation ServerMCP Documentation Server - Bridge the AI Knowledge Gap. ✨ Features: Document management • Gemini integration • AI-powered semantic search • File uploads • Smart chunking • Multilingual support • Zero-setup 🎯 Perfect for: New frameworks • API docs • Internal guides
rango-ramesh / Advanced ChunkerSemantic Chunker is a lightweight Python package for semantically-aware chunking and clustering of text.
zeroentropy-ai / ZchunkA new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.
Ayanami0730 / AragA-RAG: Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces. State-of-the-art RAG framework with keyword, semantic, and chunk read tools for multi-hop QA.
biplab-iitb / PractNLPToolsPractical Natural Language Processing Tools for Humans. Dependency Parsing, Syntactic Constituent Parsing, Semantic Role Labeling, Named Entity Recognisation, Shallow chunking, Part of Speech Tagging, all in Python.
jparkerweb / Semantic Chunking🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
curiousily / RagbaseCompletely local RAG. Chat with your PDF documents (with open LLM) and UI to that uses LangChain, Streamlit, Ollama (Llama 3.1), Qdrant and advanced methods like reranking and semantic chunking.
revelrylabs / Text Chunker ExA library for semantically coherent text chunking
agamm / Semantic SplitA Python library to chunk/group your texts based on semantic similarity.
baryhuang / MCP Server Any OpenapiA MCP server that enables Claude to discover and call any API endpoint through semantic search. Intelligently chunks OpenAPI specifications to handle large API documentation, with built-in request execution capabilities. Perfect for integrating private APIs with Claude Desktop.
mburaksayici / RAG BoilerplateRAG boilerplate with semantic/propositional chunking, hybrid search (BM25 + dense), LLM reranking, query enhancement agents, CrewAI orchestration, Qdrant vector search, Redis/Mongo sessioning, Celery ingestion pipeline, Gradio UI, and an evaluation suite (Hit-Rate, MRR, hybrid configs).
wangxj03 / Code SplitterSplit code into semantic chunks
bobmatnyc / AI Code ReviewAI-powered code review CLI with multiple providers (Gemini, Claude, OpenAI). Features 95%+ token reduction via semantic chunking, 7 review types (security/performance/evaluation), multi-language support, interactive fixes, and developer skill assessment.
Ambuj123-lab / Agentic Rag Financial ParserEnterprise RAG ecosystem managing 15,000+ semantic chunks. Features hybrid parsing (LlamaParse/PyMuPDF) and 256-dim MRL embeddings for 512MB RAM environments
lesteroliver911 / Contextual Chunking Graphpowered RagAn advanced retrieval system that combines semantic vector search with token-based search, using contextual chunking and knowledge graphs to improve accuracy. It ensures precise query responses with LLM-based answer checking and graph visualization.
Lokesh-Chimakurthi / Reliable RAGReliable RAG setup that uses Semantic Double Merging Chunking from llamaindex, Qdrant Hybrid Search, colBERT for reranking and Google Gemini as LLM.