51 skills found · Page 1 of 2
lotus-data / LotusAI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that's as simple as writing Pandas code
swiss-ai / MmoreMassive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets and feed them to an LLM as a knowledge base? Well, MMORE is here to help you!
Dicklesworthstone / Ultimate MCP ServerComprehensive MCP server exposing dozens of capabilities to AI agents: multi-provider LLM delegation, browser automation, document processing, vector ops, and cognitive memory systems
HaxyMoly / Vicuna LangChainA simple LangChain-like implementation based on Sentence Embedding+local knowledge base, with Vicuna (FastChat) serving as the LLM. Supports both Chinese and English, and can process PDF, HTML, and DOCX formats of documents as knowledge base.
marieai / Marie AIComplex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pipelines (GenAI, LLM, VLLM) into your applications, supporting various tasks such as document cleanup, optical character recognition (OCR), classification, splitting, named entity recognition, and form processing
PStarH / LLM Boost RecognitionOCR and Voice Recognition Module: Effortlessly convert documents and audio into actionable text using advanced OCR engines and voice recognition technologies, featuring LLM correction and GPU acceleration—perfect for processing all kinds of hard data like math formula!
LiuYuancheng / Threats 2 MITRE AI MapperThe objective of this program is to leverage AI-LLM technology to process of human language-based CTI documents to succinctly summarize the attack flow path outlined within such materials via mapping the attack behaviors to the MITRE-ATT&CK and matching the vulnerabilities to MITRE-CWE.
manas95826 / Empire ChainEmpire Chain is a Python framework that orchestrates all your AI needs by seamlessly integrating LLMs (OpenAI, Anthropic, Groq), vector stores (Qdrant, ChromaDB), document processing, speech-to-text, web crawling, data visualization, and interactive chatbots into a unified interface, making it easy to build powerful AI applications like RAG systems
climatechange-ai-tutorials / Nlp Policy AnalysisExplore how Natural Language Processing (NLP) can be used to assist in identifying and mapping climate-relevant literature using a supervised learning approach and leverage a state of the art Large Language Model (LLM) to classify climate policy documents.
xxxbrian / MCP RquestA MCP server providing realistic browser-like HTTP request capabilities with accurate TLS/JA3/JA4 fingerprints for bypassing anti-bot measures. It also supports converting PDF and HTML documents to Markdown for easier processing by LLMs.
sno-ai / Magi MarkdownMAGI: Markdown for Agent Guidance & Instruction - A next-generation markdown extension designed specifically for AI systems. MAGI enhances standard markdown with structured metadata, embedded AI instructions, and explicit document relationships, creating a seamless bridge between human-readable content and LLM/agent processing. Perfect for RAG,KAG
robert-mcdermott / Doc2mdA utility that extracts text from images or PDFs using a local or remote OpenAI-compatible LLM API endpoint with vision-capable multimodal models. For PDFs, each page is rendered to an image and processed sequentially; outputs are concatenated into a single Markdown document.
unit-mesh / Co UnitCoUnit,一个基于 LLM 的虚拟团队接口人(API),通过向量化文档、知识库、SDK和 API 等,结合 LLM 智能化团队间对接与协作。Merge artificial intelligence seamlessly with team collaboration. Leverage intelligent vectorization to process documents, knowledge bases, SDKs, and APIs, empowering teams to unleash their creativity.
330205812 / NexusRAGA knowledge base backend system for LLMs with full-text search, semantic retrieval, and knowledge graph querying. Ready-to-use modules for document processing and RAG, enabling quick deployment of enterprise knowledge retrieval systems.
zircote / Rlm RsRust CLI implementing the Recursive Language Model (RLM) pattern for Claude Code. Process documents 100x larger than context windows through intelligent chunking, SQLite persistence, and recursive sub-LLM orchestration.
hivellm / TransmutationTransmutation is a Rust-based document conversion module designed to transform various file formats into optimized text and image outputs suitable for LLM processing and vector embeddings. Built as a core component of the HiveLLM Vectorizer ecosystem, it leverages [Docling](https://github.com/docling-project) for advanced document understanding.
junfanz1 / Cognito LangGraph RAG ChatbotThis project implements an advanced Retrieval Augmented Generation (RAG) workflow to enhance question-answering accuracy and reduce LLM hallucinations. It leverages LangGraph to create a stateful, multi-step process that includes document retrieval, relevance grading, and web search fallback.
AdityaBhatt3010 / Universal Offline AI ChatbotUniversal Offline AI Chatbot is a lightweight, extensible local assistant that answers questions based on your own PDFs — legal, technical, scientific, educational, or enterprise documents. It uses a fast, locally hosted LLM via Ollama, and processes your files using semantic search to provide meaningful answers.
SAP-samples / Multimodal Generative AI For BpmContains dataset and source code for the thesis "Generative AI for Business Process Management - Suitability of Modalities". Aims to evaluate feasibility of generating structured process models from unstructured documents containing images and texts using multimodal LLMs
sergioq2 / Law Bot AssistanceThis repository is about an APP to help lawyers to process law documents and suit cases using AI Agents trained with OpenAI and others LLMs frameworks