Results for "document-ocr"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

323 skills found · Page 1 of 11

PaddlePaddle / PaddleOCR

74.6k

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

universal

ai4sciencechineseocrdocument-parsing+10

Updated 6m ago

run-llama / Llama Index

48.2k

LlamaIndex is the leading document agent and OCR platform

universal

agentsapplicationdata+7

Updated 9m ago

getomni-ai / Zerox

12.2k

OCR & Document Extraction using vision models

universal

ocrpdf

Updated 9h ago

clovaai / Donut

6.8k

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

universal

computer-visiondocument-aieccv-2022+3

Updated 3h ago

mindee / Doctr

6.0k

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

universal

deep-learningdocument-recognitionocr+6

Updated 1h ago

run-llama / Liteparse

3.5k

A fast, helpful, and open-source document parser

universal

document-ocrdocument-processingocr+4

Updated 1h ago

ocropus-archive / DUP Ocropy

3.5k

Python-based tools for document analysis and OCR

universal

Updated 8h ago

CatchTheTornado / Text Extract Api

3.1k

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

universal

anonymizationapiextract+6

Updated 2h ago

Nutlope / Llama Ocr

2.4k

Document to Markdown OCR library with Llama 3.2 vision

universal

Updated 4d ago

WZBSocialScienceCenter / Pdftabextract

2.3k

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

universal

data-miningimage-processingocr+3

Updated 16d ago

icereed / Paperless Gpt

2.2k

Use LLMs and LLM Vision (OCR) to handle paperless-ngx - Document Digitalization powered by AI

universal

aichatgptllm+5

Updated 7h ago

tjmlabs / ColiVara

1.5k

Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has state of the art retrieval performance on both text and visual documents. using vision models instead of chunking and text-processing for documents. No OCR, no text extraction, no broken tables, or missing images.

universal

Updated 1h ago

NanoNets / Docstrange

1.4k

Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

universal

aidocument-parserdocument-parsing+10

Updated 7h ago

Topdu / OpenOCR

1.3k

OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful reproductions of the core implementations from a wide range of academic papers.

universal

chineseocrdocument-analysisdocument-parsing+5

Updated 14h ago

opensemanticsearch / Open Semantic Search

1.2k

Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)

universal

annotationfaceted-searchfulltext-search+17

Updated 1d ago

scribeocr / Scribeocr

775

Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.

zed

abbyyhocrocr+2

Updated 1d ago

drmingler / Docling Api

759

Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.

universal

apifastapimarkdown-parser+6

Updated 4d ago

kreuzberg-dev / Html To Markdown

614

High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 56+ document formats using streaming parsers and built-in OCR.

universal

hocrhtmlhtml-converter+5

Updated 21m ago

fufankeji / DeepSeek OCR Web

546

Out-of-the-box DeepSeek OCR document parsing Web Studio

universal

Updated 10h ago

opendatalab / MinerU Diffusion

357

A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding. Topics

universal

ai4sciencediffusiondlm+13

Updated 6h ago