MinerU2PPT
Enhanced MinerU to PPTX converter with AI-powered text correction, multi-language support, and professional GUI. Inspired by JuniverseCoder's original MinerU2PPT.
Install / Use
/learn @arlinamid/MinerU2PPTREADME
MinerU to PPTX Converter
This tool converts PDF files and images into editable PowerPoint presentations (.pptx) by leveraging structured data from the MinerU PDF Extractor. It accurately reconstructs text, images, and layout, providing a high-fidelity, editable version of the original document.
🙏 Acknowledgment: This project is inspired by and extends the original MinerU2PPT by JuniverseCoder. We've added significant enhancements including AI-powered text correction, multi-language support, advanced rendering features, and comprehensive developer tools.
The application features a user-friendly graphical interface (GUI) and is designed for easy use.

🚀 Quick Start
Step 1: Get MinerU JSON Data
Choose your preferred method to extract document structure:
🌐 Web-based (Recommended)
- Visit: MinerU PDF Extractor
- Upload your PDF/image → Download JSON
💻 Desktop App
- Download: MinerU Desktop App
- Install → Process files → Export JSON
Step 2: Download MinerU2PPTX
- Get the latest MinerU2PPTX.exe (no installation needed)
Step 3: Convert
- Run MinerU2PPTX.exe → Select your original file + JSON → Convert!
For Users: How to Use
As a user, you only need the standalone executable file (e.g., MinerU2PPT.exe). You do not need to install Python or any libraries.
-
Download the Application: Get the latest executable from the project's Releases page.
-
Get the MinerU JSON File:
You have two options to extract the JSON data needed for conversion:
Option A: Web-based Extractor (Recommended)
- Visit the MinerU PDF/Image Extractor website
- Upload your PDF or image file and let it process
- Download the resulting JSON file
Option B: Desktop Application
- Download the MinerU Desktop App (Windows)
- Install and run the application locally
- Process your PDF/image files to generate JSON data
The JSON file contains the structural information that our tool needs for accurate conversion.

-
Run the Converter:
- Double-click the executable to start the application.
- Select Input File: Drag and drop your PDF or image file onto the first input field, or use the "Browse..." button.
- Select JSON File: Drag and drop the JSON file you downloaded from MinerU onto the second input field.
- Output Path: The output path for your new PowerPoint file will be automatically filled in. You can change it by typing directly or using the "Save As..." button.
- Options:
- Remove Watermark: Check this box to automatically erase elements like page numbers or footers.
- Generate Debug Images: Keep this unchecked unless you are troubleshooting.
- Click Start Conversion.
-
Open Your File: Once the conversion is complete, click the "Open Output Folder" button to find your new
.pptxfile.
Using Batch Mode
The application also supports converting multiple files at once in Batch Mode.
- Switch to Batch Mode: Click the "Batch Mode" button in the top-right corner of the application. The interface will switch to the batch processing view.
- Add Tasks:
- Click the "Add Task" button. A new window will pop up.
- In the popup, select the Input File, the corresponding MinerU JSON File, and specify the Output Path.
- Set the Remove Watermark option for this specific task.
- Click "OK" to add the task to the list.
- Manage Tasks: You can add multiple tasks to the list. If you need to remove a task, select it from the list and click "Delete Task".
- Start Batch Conversion: Once all your tasks are added, click "Start Batch Conversion". The application will process each task sequentially. A log will show the progress for each file.
For Developers
This section provides instructions for running the application from source and packaging it for distribution.
Environment Setup
- Clone the repository.
- It is recommended to use a virtual environment:
python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate - Install the required dependencies from
requirements.txt.pip install -r requirements.txt
Running from Source
- To run the GUI application:
python gui.py - To use the CLI (sub-commands):
# Convert PDF to PPTX python main.py convert --json <path_to_json> --input <path_to_pdf> --output <path_to_pptx> [OPTIONS] # Extract images from PDF using MinerU JSON bounding boxes python main.py extract-images --json <path_to_json> --input <path_to_pdf> --output <folder> [--dpi 200] [--overwrite] # Show version python main.py --version
Packaging as a Standalone Executable (.exe)
You can package the GUI application into a single .exe file for easy distribution.
-
Install PyInstaller:
pip install pyinstaller -
Build the Executable: Run the
pyinstallercommand from the project's root directory. Use the--nameflag to specify a professional name for your application.--windowed: Prevents a console window from appearing in the background.--onefile: Bundles everything into a single executable file.--name: Sets the name of the final executable.
# Build both executables using the provided spec files pyinstaller --clean cli.spec # → dist/cli.exe (console) pyinstaller --clean MinerU2PPTX.spec # → dist/MinerU2PPTX.exe (GUI) # Or use the build script python build_exe.py -
Find the Executables:
dist/cli.exe— Console CLI for scripting and automationdist/MinerU2PPTX.exe— Windowed GUI application
📚 Documentation
For comprehensive documentation, please visit the docs/ folder:
- 📖 Complete User Guide - Interactive manual with 3-language support
- 🏗️ Build Instructions - Development setup and deployment
- 🤖 AI Integration Guide - AI services configuration and usage
- 🌍 Translation System - Multi-language support details
- 📋 All Documentation - Complete documentation index
📝 Changelog
All notable changes to this project are documented in CHANGELOG.md:
- v2.0.1 (Latest) - CLI sub-commands, PDF image extractor, reduced exe size
- v2.0.0 - Major release with AI integration, multi-language support, and professional GUI
- All Versions - Complete version history and feature additions
- Planned Features - Upcoming enhancements and roadmap
📄 License
This project is licensed under the MIT License - see the LICENSE file for details.
Attribution
This enhanced version is inspired by and extends JuniverseCoder's original MinerU2PPT. The significant enhancements (AI integration, multi-language support, professional GUI, comprehensive documentation) are original contributions licensed under MIT.
Third-Party
This software uses various open-source libraries and AI services. Please see the LICENSE file for complete attribution and third-party acknowledgments.
💖 Support
If this tool helps you, consider:
- ⭐ Star the repository
- ☕ Buy me a coffee
- 🐛 Report issues for improvements
