SkillAgentSearch skills...

OcrTranslator

Convert captured images to text using BaiduOCR, GoogleOCR, WindowsOCR, tesseractOCR, RapidOCR or Capture2Text, and translate the resulting text using Google, Chatgpt, Edgegpt, DeepL or many more. Desktop application with a nice GUI provided by customtkinter.

Install / Use

/learn @Azornes/OcrTranslator

README

<p align="center"> <h1 align="center">OCR Translator</h1> <p align="center"><i>Convert captured images into text and then translate that text.</i></p> </p> <p align="center"> <a href="https://github.com/Azornes/ocrTranslator/releases"> <img alt="Downloads" src="https://img.shields.io/github/downloads/Azornes/ocrTranslator/latest/total?label=Downloads&style=flat-square"> </a> <a href="https://visitorbadge.io/status?path=https%3A%2F%2Fgithub.com%2FAzornes%2FocrTranslator"><img src="https://api.visitorbadge.io/api/combined?path=https%3A%2F%2Fgithub.com%2FAzornes%2FocrTranslator&countColor=%2337d67a&style=flat-square&labelStyle=none" /></a> <img alt="python 3.9" src="https://img.shields.io/badge/python-3.9-3776AB?logo=Python&logoColor=FFFFFF&style=flat-square"> </a> </p>

With this app, you can select your preferred OCR and translation services. After clicking on START or using the keyboard shortcut Alt+Win+T, the program will launch and you can choose the area of the screen to scan for text using OCR. If you have selected a translation service, the text will then be automatically translated.

preview:

https://user-images.githubusercontent.com/20650591/233107070-f9a14ed8-5c77-4947-8fa5-8d1c86d4a04f.mp4

🔥 Features

  • Desktop application with a user-friendly graphical user interface (GUI) provided by customtkinter.
  • Ability to select preferred OCR and translation services.
  • Option to run the program using either the START button or the keyboard shortcut (Alt+Win+T or bound from options).
  • Capability to choose the area of the screen to scan for text using OCR and save the position (for example, when watching a movie and the subtitles always appear in one spot, so you don't have to select the text area again).
  • Automatic translation of the captured text if a translation service has been selected.
  • Ability to capture subtitles from movies or games by selecting the corresponding area of the screen and displaying the translated text next to them.
  • Chat with chatGPT or edgeGPT.
  • Ability to translate from the clipboard or manually entered text (similar to a typical translation app).
  • Save all selected options and settings to a file and load them when the program is launched.

Desktop App

Download the desktop app here Tested only on Windows 10.


Dependency

  1. Python 3.9. (If you want run from source)
  2. (optional) Capture2Text.
  3. (optional) Tesseract.
  4. (optional) Google api generate a service_account_creds.json. Then, put file into the ocrTranslate/configs directory.
<details> <summary>

5. (optional) ChatGPT

Source

</summary>

Configuration

  1. Create account on OpenAI's ChatGPT
  2. Save your email and password

Authentication method: (Choose 1 and paste to app settings)

- Email/Password

Currently broken for free users. Do export PUID="..." if you have a plus account. The PUID is a cookie named _puid Not supported for Google/Microsoft accounts.

- Access token

https://chat.openai.com/api/auth/session

</details> <details> <summary>

6. (optional) EdgeGPT

Source

</summary> <details> <summary>

Checking access (Required)

</summary>
  • Install the latest version of Microsoft Edge
  • Alternatively, you can use any browser and set the user-agent to look like you're using Edge (e.g., Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/111.0.0.0 Safari/537.36 Edg/111.0.1661.51). You can do this easily with an extension like "User-Agent Switcher and Manager" for Chrome and Firefox.
  • Open bing.com/chat
  • If you see a chat feature, you are good to go
</details> <details> <summary>

Getting authentication (Required)

</summary>
  • Install the cookie editor extension for Chrome or Firefox
  • Go to bing.com
  • Open the extension
  • Click "Export" on the bottom right, then "Export as JSON" (This saves your cookies to clipboard)
  • Paste your cookies into a file cookies.json
  • Paste your file cookies.json to ocrTranslate/configs/
</details> </details>

📊 Tables with information

<details> <summary>

Supported OCR Services

</summary>

| ID | OCR | Internet/Local | Status | |-----|-------------------------------------------------------------------------------------------------------|----------------|--------| | 1 | Google Vision Api | Internet | stable | | 2 | Google Vision Free Demo | Internet | stable | | 3 | Baidu Api | Internet | stable | | 4 | Windows OCR | Local | stable | | 5 | Capture2Text | Local | stable | | 6 | Tesseract | Local | stable | | 7 | RapidOCR | Local | stable |

</details> <details> <summary>

Supported Translation Services

Source

</summary>

| ID | Translator | Number of Supported Languages | Advantage | Service | Status | |-----|-----------------------------------------------------------------------------------|-------------------------------|---------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------|---------------------------------| | 1 | Niutrans | 302 | support the most languages in the world | Northeastern University / Niutrans, China | / | | 2 | Alibaba | 221 | support most languages, support professional field | Alibaba, China | stable | | 3 | Baidu | 201 | support most languages, support professional field, support Classical Chinese | Baidu, China | stable | | 4 | Iciba | 187 | support the most languages in the world | Kingsoft / Xiaomi, China | stable | | 5 | MyMemory | 151 | support the most languages in the world, good at Creole English, Creole French | Translated, Italy | stable | | 6 | Iflytek | 140 | support the most languages in the world | Iflytek, China | / | | 7 | Google | 134 | support more languages in the world | Google, America | stable(offline in China inland) | | 8 | VolcEngine | 122 | support more languages in the world, support professional field | ByteDance, China | / | | 9 | [Lingvanex](https

Related Skills

View on GitHub
GitHub Stars91
CategoryDevelopment
Updated4d ago
Forks5

Languages

Python

Security Score

100/100

Audited on Mar 31, 2026

No findings