SkillAgentSearch skills...

TrOCR

Handwritten Text Recognition

Install / Use

/learn @rajdhal/TrOCR

README

University of Windsor 4990/4960 Sign-In Sheet Name Recognizer

Welcome to the University of Windsor's SmartCampus Text Recognition project! This open-source repository is dedicated to helping the University of Windsor community streamline the process of recognizing text from forms using a customized version of Microsoft's trOCR technology. With Gradio, a user-friendly web interface, this project enables easy interaction with the underlying machine learning model for text recognition.

Features

  • Seamlessly integrated with our custom trOCR technology.
  • Leveraging YOLO object detection for precise image analysis and text localization.
  • User-friendly interface powered by Gradio.
  • Quickly recognize text from forms and documents.
  • Customizable for your specific use case.

Table of Contents

Getting Started

Before you get started, ensure you have the following prerequisites in place:

  • Python 3.10+
  • Pip package manager

Installation

  1. Clone this repository to your local machine:
git clone https://github.com/rajdhal/trOCR.git
  1. Navigate to the project directory:
cd trOCR
  1. (Optional) Create and activate a virtual environment (recommended for better dependency management):
python3 -m venv venv
source venv/bin/activate
  1. Install the required dependencies using pip:
pip install -r requirements.txt

Usage

  1. Run the Gradio web application:
python3 main.py
  1. Open your web browser and navigate to http://localhost:7860 (by default).

  2. Upload an image of the form you want to recognize text from.

  3. Click the "Submit" button, and the annotated image alongside a downloadable CSV file will be available in the output.

  4. Customize the application to suit your specific needs by modifying the Gradio interface and machine learning model as necessary.

Contributing

We welcome contributions from the University of Windsor community and the open-source community at large. If you'd like to contribute to this project, please follow these steps:

  1. Fork this repository.
  2. Create a new branch for your feature or bug fix.
  3. Make your changes and commit them.
  4. Push your changes to your fork.
  5. Create a pull request back to this repository.

We hope that the University of Windsor's SmartCampus Text Recognition project simplifies text recognition tasks for the university community. If you have any questions, feedback, or suggestions, please don't hesitate to reach out. We look forward to collaborating with you to improve and expand this application.

Related Skills

View on GitHub
GitHub Stars12
CategoryContent
Updated1mo ago
Forks1

Languages

Python

Security Score

80/100

Audited on Feb 21, 2026

No findings