TrOCR
Handwritten Text Recognition
Install / Use
/learn @rajdhal/TrOCRREADME
University of Windsor 4990/4960 Sign-In Sheet Name Recognizer
Welcome to the University of Windsor's SmartCampus Text Recognition project! This open-source repository is dedicated to helping the University of Windsor community streamline the process of recognizing text from forms using a customized version of Microsoft's trOCR technology. With Gradio, a user-friendly web interface, this project enables easy interaction with the underlying machine learning model for text recognition.
Features
- Seamlessly integrated with our custom trOCR technology.
- Leveraging YOLO object detection for precise image analysis and text localization.
- User-friendly interface powered by Gradio.
- Quickly recognize text from forms and documents.
- Customizable for your specific use case.
Table of Contents
Getting Started
Before you get started, ensure you have the following prerequisites in place:
- Python 3.10+
- Pip package manager
Installation
- Clone this repository to your local machine:
git clone https://github.com/rajdhal/trOCR.git
- Navigate to the project directory:
cd trOCR
- (Optional) Create and activate a virtual environment (recommended for better dependency management):
python3 -m venv venv
source venv/bin/activate
- Install the required dependencies using pip:
pip install -r requirements.txt
Usage
- Run the Gradio web application:
python3 main.py
-
Open your web browser and navigate to
http://localhost:7860(by default). -
Upload an image of the form you want to recognize text from.
-
Click the "Submit" button, and the annotated image alongside a downloadable CSV file will be available in the output.
-
Customize the application to suit your specific needs by modifying the Gradio interface and machine learning model as necessary.
Contributing
We welcome contributions from the University of Windsor community and the open-source community at large. If you'd like to contribute to this project, please follow these steps:
- Fork this repository.
- Create a new branch for your feature or bug fix.
- Make your changes and commit them.
- Push your changes to your fork.
- Create a pull request back to this repository.
We hope that the University of Windsor's SmartCampus Text Recognition project simplifies text recognition tasks for the university community. If you have any questions, feedback, or suggestions, please don't hesitate to reach out. We look forward to collaborating with you to improve and expand this application.
Related Skills
claude-opus-4-5-migration
84.6kMigrate prompts and code from Claude Sonnet 4.0, Sonnet 4.5, or Opus 4.1 to Opus 4.5
docs-writer
99.5k`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie
model-usage
341.6kUse CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
TrendRadar
50.1k⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构,赋能 AI 自然语言对话分析、情感洞察与趋势预测等。支持 Docker ,数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。
