TrOCR

Handwritten Text Recognition

Generate Convert Improve

Install / Use

/learn @rajdhal/TrOCR

About this skill

Quality Score

0/100

README

University of Windsor 4990/4960 Sign-In Sheet Name Recognizer

Welcome to the University of Windsor's SmartCampus Text Recognition project! This open-source repository is dedicated to helping the University of Windsor community streamline the process of recognizing text from forms using a customized version of Microsoft's trOCR technology. With Gradio, a user-friendly web interface, this project enables easy interaction with the underlying machine learning model for text recognition.

Features

Seamlessly integrated with our custom trOCR technology.
Leveraging YOLO object detection for precise image analysis and text localization.
User-friendly interface powered by Gradio.
Quickly recognize text from forms and documents.
Customizable for your specific use case.

Getting Started
Installation
Usage
Contributing
License

Getting Started

Before you get started, ensure you have the following prerequisites in place:

Python 3.10+
Pip package manager

Installation

Clone this repository to your local machine:

git clone https://github.com/rajdhal/trOCR.git

Navigate to the project directory:

cd trOCR

(Optional) Create and activate a virtual environment (recommended for better dependency management):

python3 -m venv venv

source venv/bin/activate

Install the required dependencies using pip:

pip install -r requirements.txt

Usage

Run the Gradio web application:

python3 main.py

Open your web browser and navigate to http://localhost:7860 (by default).
Upload an image of the form you want to recognize text from.
Click the "Submit" button, and the annotated image alongside a downloadable CSV file will be available in the output.
Customize the application to suit your specific needs by modifying the Gradio interface and machine learning model as necessary.

Contributing

We welcome contributions from the University of Windsor community and the open-source community at large. If you'd like to contribute to this project, please follow these steps:

Fork this repository.
Create a new branch for your feature or bug fix.
Make your changes and commit them.
Push your changes to your fork.
Create a pull request back to this repository.

We hope that the University of Windsor's SmartCampus Text Recognition project simplifies text recognition tasks for the university community. If you have any questions, feedback, or suggestions, please don't hesitate to reach out. We look forward to collaborating with you to improve and expand this application.

Related Skills

claude-opus-4-5-migration

84.6k

Migrate prompts and code from Claude Sonnet 4.0, Sonnet 4.5, or Opus 4.1 to Opus 4.5

docs-writer

99.5k

`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie

model-usage

341.6k

Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.

TrendRadar

50.1k

⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载，你的 AI 舆情监控助手与热点筛选工具！聚合多平台热点 + RSS 订阅，支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机，也支持接入 MCP 架构，赋能 AI 自然语言对话分析、情感洞察与趋势预测等。支持 Docker ，数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。

rajdhal

View profile

View on GitHub

GitHub Stars12

CategoryContent

Updated1mo ago

Forks1

rajdhal/trOCR

Languages

Python

Security Score

80/100

Audited on Feb 21, 2026

No findings