SkillAgentSearch skills...

MinerU2PPT

Enhanced MinerU to PPTX converter with AI-powered text correction, multi-language support, and professional GUI. Inspired by JuniverseCoder's original MinerU2PPT.

Install / Use

/learn @arlinamid/MinerU2PPT
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

简体中文

License: MIT Release Downloads Stars Issues

MinerU to PPTX Converter

This tool converts PDF files and images into editable PowerPoint presentations (.pptx) by leveraging structured data from the MinerU PDF Extractor. It accurately reconstructs text, images, and layout, providing a high-fidelity, editable version of the original document.

🙏 Acknowledgment: This project is inspired by and extends the original MinerU2PPT by JuniverseCoder. We've added significant enhancements including AI-powered text correction, multi-language support, advanced rendering features, and comprehensive developer tools.

The application features a user-friendly graphical interface (GUI) and is designed for easy use.

GUI Screenshot

🚀 Quick Start

Step 1: Get MinerU JSON Data

Choose your preferred method to extract document structure:

🌐 Web-based (Recommended)

💻 Desktop App

Step 2: Download MinerU2PPTX

Step 3: Convert

  • Run MinerU2PPTX.exe → Select your original file + JSON → Convert!

For Users: How to Use

As a user, you only need the standalone executable file (e.g., MinerU2PPT.exe). You do not need to install Python or any libraries.

  1. Download the Application: Get the latest executable from the project's Releases page.

  2. Get the MinerU JSON File:

    You have two options to extract the JSON data needed for conversion:

    Option A: Web-based Extractor (Recommended)

    Option B: Desktop Application

    • Download the MinerU Desktop App (Windows)
    • Install and run the application locally
    • Process your PDF/image files to generate JSON data

    The JSON file contains the structural information that our tool needs for accurate conversion. Download JSON

  3. Run the Converter:

    • Double-click the executable to start the application.
    • Select Input File: Drag and drop your PDF or image file onto the first input field, or use the "Browse..." button.
    • Select JSON File: Drag and drop the JSON file you downloaded from MinerU onto the second input field.
    • Output Path: The output path for your new PowerPoint file will be automatically filled in. You can change it by typing directly or using the "Save As..." button.
    • Options:
      • Remove Watermark: Check this box to automatically erase elements like page numbers or footers.
      • Generate Debug Images: Keep this unchecked unless you are troubleshooting.
    • Click Start Conversion.
  4. Open Your File: Once the conversion is complete, click the "Open Output Folder" button to find your new .pptx file.

Using Batch Mode

The application also supports converting multiple files at once in Batch Mode.

  1. Switch to Batch Mode: Click the "Batch Mode" button in the top-right corner of the application. The interface will switch to the batch processing view.
  2. Add Tasks:
    • Click the "Add Task" button. A new window will pop up.
    • In the popup, select the Input File, the corresponding MinerU JSON File, and specify the Output Path.
    • Set the Remove Watermark option for this specific task.
    • Click "OK" to add the task to the list.
  3. Manage Tasks: You can add multiple tasks to the list. If you need to remove a task, select it from the list and click "Delete Task".
  4. Start Batch Conversion: Once all your tasks are added, click "Start Batch Conversion". The application will process each task sequentially. A log will show the progress for each file.

For Developers

This section provides instructions for running the application from source and packaging it for distribution.

Environment Setup

  1. Clone the repository.
  2. It is recommended to use a virtual environment:
    python -m venv venv
    source venv/bin/activate  # On Windows: venv\Scripts\activate
    
  3. Install the required dependencies from requirements.txt.
    pip install -r requirements.txt
    

Running from Source

  • To run the GUI application:
    python gui.py
    
  • To use the CLI (sub-commands):
    # Convert PDF to PPTX
    python main.py convert --json <path_to_json> --input <path_to_pdf> --output <path_to_pptx> [OPTIONS]
    
    # Extract images from PDF using MinerU JSON bounding boxes
    python main.py extract-images --json <path_to_json> --input <path_to_pdf> --output <folder> [--dpi 200] [--overwrite]
    
    # Show version
    python main.py --version
    

Packaging as a Standalone Executable (.exe)

You can package the GUI application into a single .exe file for easy distribution.

  1. Install PyInstaller:

    pip install pyinstaller
    
  2. Build the Executable: Run the pyinstaller command from the project's root directory. Use the --name flag to specify a professional name for your application.

    • --windowed: Prevents a console window from appearing in the background.
    • --onefile: Bundles everything into a single executable file.
    • --name: Sets the name of the final executable.
    # Build both executables using the provided spec files
    pyinstaller --clean cli.spec          # → dist/cli.exe (console)
    pyinstaller --clean MinerU2PPTX.spec  # → dist/MinerU2PPTX.exe (GUI)
    
    # Or use the build script
    python build_exe.py
    
  3. Find the Executables:

    • dist/cli.exe — Console CLI for scripting and automation
    • dist/MinerU2PPTX.exe — Windowed GUI application

📚 Documentation

For comprehensive documentation, please visit the docs/ folder:

📝 Changelog

All notable changes to this project are documented in CHANGELOG.md:

  • v2.0.1 (Latest) - CLI sub-commands, PDF image extractor, reduced exe size
  • v2.0.0 - Major release with AI integration, multi-language support, and professional GUI
  • All Versions - Complete version history and feature additions
  • Planned Features - Upcoming enhancements and roadmap

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Attribution

This enhanced version is inspired by and extends JuniverseCoder's original MinerU2PPT. The significant enhancements (AI integration, multi-language support, professional GUI, comprehensive documentation) are original contributions licensed under MIT.

Third-Party

This software uses various open-source libraries and AI services. Please see the LICENSE file for complete attribution and third-party acknowledgments.

💖 Support

If this tool helps you, consider:

View on GitHub
GitHub Stars5
CategoryCustomer
Updated19d ago
Forks2

Languages

Python

Security Score

70/100

Audited on Mar 5, 2026

No findings