PaddleOCR
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Install / Use
/learn @PaddlePaddle/PaddleOCRREADME
English | 简体中文 | 繁體中文 | 日本語 | 한국어 | Français | Русский | Español | العربية
<!-- icon -->PaddleOCR is an industry-leading, production-ready OCR and document AI engine, offering end-to-end solutions from text extraction to intelligent document understanding
</div>PaddleOCR
[!TIP] PaddleOCR now provides an MCP server that supports integration with Agent applications like Claude Desktop. For details, please refer to PaddleOCR MCP Server.
The PaddleOCR 3.0 Technical Report is now available. See details at: PaddleOCR 3.0 Technical Report.
The PaddleOCR-VL Technical Report is now available. See details at PaddleOCR-VL Technical Report.
The Beta version of the PaddleOCR official website is now live, offering a more convenient online experience and large-scale PDF file parsing, as well as free API and MCP services. For more details, please visit the PaddleOCR official website.
PaddleOCR converts documents and images into structured, AI-friendly data (like JSON and Markdown) with industry-leading accuracy—powering AI applications for everyone from indie developers and startups to large enterprises worldwide. With over 60,000 stars and deep integration into leading projects like MinerU, RAGFlow, pathway and cherry-studio, PaddleOCR has become the premier solution for developers building intelligent document applications in the AI era.
PaddleOCR 3.0 Core Features
[