Results for "llm-document-processing"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

51 skills found · Page 1 of 2

lotus-data / Lotus

1.6k

AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that's as simple as writing Pandas code

universal

ai-data-processingdatallm+7

Updated 7h ago

swiss-ai / Mmore

199

Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets and feed them to an LLM as a knowledge base? Well, MMORE is here to help you!

universal

Updated 15h ago

Dicklesworthstone / Ultimate MCP Server

144

Comprehensive MCP server exposing dozens of capabilities to AI agents: multi-provider LLM delegation, browser automation, document processing, vector ops, and cognitive memory systems

claude codecursor

agentllmmcp+1

Updated 5d ago

HaxyMoly / Vicuna LangChain

A simple LangChain-like implementation based on Sentence Embedding+local knowledge base, with Vicuna (FastChat) serving as the LLM. Supports both Chinese and English, and can process PDF, HTML, and DOCX formats of documents as knowledge base.

universal

Updated 6mo ago

marieai / Marie AI

Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pipelines (GenAI, LLM, VLLM) into your applications, supporting various tasks such as document cleanup, optical character recognition (OCR), classification, splitting, named entity recognition, and form processing

universal

dockerdocument-layout-analysisdocument-parser+14

Updated 21h ago

PStarH / LLM Boost Recognition

OCR and Voice Recognition Module: Effortlessly convert documents and audio into actionable text using advanced OCR engines and voice recognition technologies, featuring LLM correction and GPU acceleration—perfect for processing all kinds of hard data like math formula!

universal

Updated 1mo ago

LiuYuancheng / Threats 2 MITRE AI Mapper

The objective of this program is to leverage AI-LLM technology to process of human language-based CTI documents to succinctly summarize the attack flow path outlined within such materials via mapping the attack behaviors to the MITRE-ATT&CK and matching the vulnerabilities to MITRE-CWE.

universal

Updated 1d ago

manas95826 / Empire Chain

Empire Chain is a Python framework that orchestrates all your AI needs by seamlessly integrating LLMs (OpenAI, Anthropic, Groq), vector stores (Qdrant, ChromaDB), document processing, speech-to-text, web crawling, data visualization, and interactive chatbots into a unified interface, making it easy to build powerful AI applications like RAG systems

claude codeclaude desktop

Updated 3d ago

climatechange-ai-tutorials / Nlp Policy Analysis

Explore how Natural Language Processing (NLP) can be used to assist in identifying and mapping climate-relevant literature using a supervised learning approach and leverage a state of the art Large Language Model (LLM) to classify climate policy documents.

universal

Updated 1mo ago

xxxbrian / MCP Rquest

A MCP server providing realistic browser-like HTTP request capabilities with accurate TLS/JA3/JA4 fingerprints for bypassing anti-bot measures. It also supports converting PDF and HTML documents to Markdown for easier processing by LLMs.

claude codeclaude desktop+1

aiclaudehttp-requests+2

Updated 4d ago

sno-ai / Magi Markdown

MAGI: Markdown for Agent Guidance & Instruction - A next-generation markdown extension designed specifically for AI systems. MAGI enhances standard markdown with structured metadata, embedded AI instructions, and explicit document relationships, creating a seamless bridge between human-readable content and LLM/agent processing. Perfect for RAG,KAG

universal

aiai-agentsai-native+9

Updated 8d ago

robert-mcdermott / Doc2md

A utility that extracts text from images or PDFs using a local or remote OpenAI-compatible LLM API endpoint with vision-capable multimodal models. For PDFs, each page is rendered to an image and processed sequentially; outputs are concatenated into a single Markdown document.

universal

Updated 3d ago

unit-mesh / Co Unit

CoUnit，一个基于 LLM 的虚拟团队接口人（API），通过向量化文档、知识库、SDK和 API 等，结合 LLM 智能化团队间对接与协作。Merge artificial intelligence seamlessly with team collaboration. Leverage intelligent vectorization to process documents, knowledge bases, SDKs, and APIs, empowering teams to unleash their creativity.

universal

aigcgenaigenai-poc

Updated 1y ago

330205812 / NexusRAG

A knowledge base backend system for LLMs with full-text search, semantic retrieval, and knowledge graph querying. Ready-to-use modules for document processing and RAG, enabling quick deployment of enterprise knowledge retrieval systems.

universal

Updated 9mo ago

zircote / Rlm Rs

Rust CLI implementing the Recursive Language Model (RLM) pattern for Claude Code. Process documents 100x larger than context windows through intelligent chunking, SQLite persistence, and recursive sub-LLM orchestration.

claude codeclaude desktop

ai-toolschunkingclaude+17

Updated 5h ago

hivellm / Transmutation

Transmutation is a Rust-based document conversion module designed to transform various file formats into optimized text and image outputs suitable for LLM processing and vector embeddings. Built as a core component of the HiveLLM Vectorizer ecosystem, it leverages [Docling](https://github.com/docling-project) for advanced document understanding.

zed

audiodocximage+4

Updated 14d ago

junfanz1 / Cognito LangGraph RAG Chatbot

This project implements an advanced Retrieval Augmented Generation (RAG) workflow to enhance question-answering accuracy and reduce LLM hallucinations. It leverages LangGraph to create a stateful, multi-step process that includes document retrieval, relevance grading, and web search fallback.

universal

chromadblangchainlanggraph+2

Updated 1mo ago

AdityaBhatt3010 / Universal Offline AI Chatbot

Universal Offline AI Chatbot is a lightweight, extensible local assistant that answers questions based on your own PDFs — legal, technical, scientific, educational, or enterprise documents. It uses a fast, locally hosted LLM via Ollama, and processes your files using semantic search to provide meaningful answers.

universal

Updated 26d ago

SAP-samples / Multimodal Generative AI For Bpm

Contains dataset and source code for the thesis "Generative AI for Business Process Management - Suitability of Modalities". Aims to evaluate feasibility of generating structured process models from unstructured documents containing images and texts using multimodal LLMs

universal

bpmdatasetgenai+2

Updated 16d ago

sergioq2 / Law Bot Assistance

This repository is about an APP to help lawyers to process law documents and suit cases using AI Agents trained with OpenAI and others LLMs frameworks

universal

Updated 1mo ago