22 skills found
nhjydywd / SubtitleOCR快如闪电的硬字幕提取工具。仅需苹果M1芯片或英伟达3060显卡即可达到10倍速提取。A very fast tool for video hardcode subtitle extraction
maxrd2 / SubtitleComposerSubtitle Composer - KF5/Qt Video Subtitle Editor
lars76 / Chinese Subtitle OcrOptical character recognition for Chinese subtitles using SSD and CNN
PatchyVideo / MMD TranslatorGenerate srt subtitle from video using OCR, mainly designed for MMD videos
freyjaSubOCR / Freyja Sub Ocr ElectronNodejs + electron user interface for freyja subtitle OCR extractor
tomkam1702 / OCR Translator🎮 Real-time game subtitle translator with AI-powered OCR. Context-aware translation for 20+ languages. Free offline models + dirt cheap APIs. Perfect for gaming in foreign languages!
ecdye / MacSubtitleOCRConvert bitmap subtitles into SubRip format using the macOS Vision framework
gwen-lg / Subtile OcrA Rust command line tool to convert subtitle from image to text with OCR. Started as fork of vobsubocr.
glowinthedark / Subtitles OcrHard-burned subtitles OCR to SRT extractor
op200 / Simple Subtitle OCRA simple OCR program with GUI for hard subtitles extraction. 内嵌字幕提取
shenbo / Video Subtitles Ocr视频字幕提取,基于 opencv 和 tesseract
BruceHan98 / OCR Extract Subtitles使用OCR技术提取视频字幕
muhammadsohaib60 / Urdu OCROur project is based on one of the most important application of machine learning i.e. pattern recognition. Optical character recognition or optical character reader is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo or from subtitle text superimposed on an image. We are working on developing an OCR for URDU. We studied a couple of research papers related to our project. So far, we have found that Both Arabic and Urdu are written in Perso-Arabic script; at the written level, therefore, they share similarities. The styles of Arabic and Persian writing have a heavy influence on the Urdu script. There are 6 major styles for writing Arabic, Persian and Pashto as well. Urdu is written in Naskh writing style which is most famous of all. Optical character recognition (OCR) is the process of converting an image of text, such as a scanned paper document or electronic fax file, into computer-editable text [1]. The text in an image is not editable: the letters are made of tiny dots (pixels) that together form a picture of text. During OCR, the software analyzes an image and converts the pictures of the characters to editable text based on the patterns of the pixels in the image. After OCR, the converted text can be exported and used with a variety of word-processing, page layout and spreadsheet applications [2]. One of the main aims of OCR is to emulate the human ability to read at a much faster rate by associating symbolic identities with images of characters. Its potential applications include Screen Readers, Refreshable Braille Displays [3], reading customer filled forms, reading postal address off envelops, archiving and retrieving text etc. OCR’s ultimate goal is to develop a communication interface between the computer and its potential users. Urdu is the national language of Pakistan. It is a language that is understood by over 300 million people belonging to Pakistan, India and Bangladesh. Due to its historical database of literature, there is definitely a need to devise automatic systems for conversion of this literature into electronic form that may be accessible on the worldwide web. Although much work has been done in the field of OCR, Urdu and other languages using the Arabic script like Farsi, Urdu and Arabic, have received least attention. This is due in part to a lack of interest in the field and in part to the intricacies of the Arabic script. Owing to this state of indifference, there remains a huge amount of Urdu and Arabic literature unattended and rotting away on some old shelves. The proposed research aims to develop workable solutions to many of the problems faced in realization of an OCR designed specifically for Urdu Noori Nastaleeq Script, which is widely used in Urdu newspapers, governmental documents and books. The underlying processes first isolate and classify ligatures based on certain carefully chosen special, contour and statistical features and eventually recognize them with the aid of Feed-Forward Back Propagation Neural Networks. The input to the system is a monochrome bitmap image file of Urdu text written in Noori Nastaleeq and the output is the equivalent text converted to an editable text file.
mohamedbassel24 / OCR For Arabic ScriptsOptical character recognition or optical character reader (OCR) is the recognition process of text obtained from media in the form of typed, handwritten or printed text into machine-encoded text form. The text in question may be presented in the form of a scanned document, a photo of a document, a scene-photo or from subtitle text superimposed on an image.
yuvalsol / SubtitlesCleanerSubtitles Cleaner cleans SubRip .srt subtitle files from OCR errors, Hearing-impaired lines and other junk.
CharlinChen / OCR Video Subtitle Translator对原有视频字幕进行OCR识别,识别结果通过有道翻译接口进行翻译,最终生成可直接导入Premiere的xml字幕文件。
kirksaunders / Subtitle Ocr ConsoleCommand line tool for converting image-based subtitle formats to text-based using a custom OCR model
yuukimasato / Video Subtitle Ocr本项目是一个基于 PySide6 (Qt for Python) 和 PaddleOCR 开发的桌面应用程序,旨在帮助用户从视频文件中提取硬字幕(即内嵌在视频画面中的字幕),并将其转换为标准的 `.ass` 格式字幕文件。
hekmon / Subtitles AI OcrRead PGS (Bluray) and VobSub (DVD) image subtitles and extract their text using external Vision Language Models.
CreativeHub008 / Tts Sub EditorTTS Subtitle Editor: The all-in-one smart video editor. Automate subtitles with OCR & Whisper, translate globally with AI, generate natural TTS voiceovers, and craft dynamic captions with GPU-accelerated motion tracking. Built for modern content creators.