Results for "optical-recognition"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

552 skills found · Page 1 of 19

clovaai / Deep Text Recognition Benchmark

3.9k

Text recognition (optical character recognition) with deep learning methods, ICCV 2019

universal

crnndeep-learninggrcnn+11

Updated 6h ago

otiai10 / Gosseract

3.1k

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

universal

goocrocr-server+2

Updated 8h ago

kha-white / Manga Ocr

2.6k

Optical character recognition for Japanese text, with the main focus being Japanese manga

universal

comicscomputer-visiondeep-learning+4

Updated 2h ago

hwalsuklee / Awesome Deep Text Detection Recognition

2.5k

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

universal

awesome-listawesome-listsdeep-learning+9

Updated 3d ago

rmtheis / Android Ocr

2.2k

Experimental optical character recognition app

universal

androidocroptical-character-recognition+1

Updated 9d ago

GauravSingh9356 / J.A.R.V.I.S

1.0k

Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.

universal

chatgptdictionary-applicationdifflib+17

Updated 8h ago

kangoka / Tiktodv3

840

TIKTOD V3 is a bot application designed to automate interactions on Zefoy website, such as increasing views, hearts, followers, and shares on a specified video. The bot uses technologies like Selenium for web automation and OCR (Optical Character Recognition) for solving captchas.

universal

pythonseleniumtiktok+5

Updated 1d ago

OCR4all / OCR4all

704

Provides OCR (Optical Character Recognition) services through web applications

universal

Updated 2d ago

evilgix / Evil

698

Optical Character Recognition in Swift for iOS&macOS. 银行卡、身份证、门牌号光学识别

universal

cnn-modelkerasmachine-learning+3

Updated 14h ago

aashrafh / Mozart

697

An optical music recognition (OMR) system. Converts sheet music to a machine-readable version.

universal

mozartmusicmusic-sheet+9

Updated 6h ago

kdzwinel / JS OCR Demo

480

JavaScript optical character recognition demo

universal

demojavascriptocr

Updated 1mo ago

ZumingHuang / Awesome Ocr Resources

430

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

universal

awesomecomputer-visiondeep-learning+7

Updated 2mo ago

blueaxis / Poricom

422

Optical character recognition in manga images. Manga OCR desktop application

universal

manga-ocrmanga-readerocr+1

Updated 1d ago

apacha / OMR Datasets

378

Collection of datasets used for Optical Music Recognition

universal

datasetmusicmusic-information-retrieval+2

Updated 3d ago

tensorflow / Moonlight

330

Optical music recognition in TensorFlow

universal

Updated 19d ago

puhuilab / Phocr

306

an open high-performance Optical Character Recognition (OCR) toolkit

universal

Updated 18d ago

dhvanikotak / Emotion Detection In Videos

297

The aim of this work is to recognize the six emotions (happiness, sadness, disgust, surprise, fear and anger) based on human facial expressions extracted from videos. To achieve this, we are considering people of different ethnicity, age and gender where each one of them reacts very different when they express their emotions. We collected a data set of 149 videos that included short videos from both, females and males, expressing each of the the emotions described before. The data set was built by students and each of them recorded a video expressing all the emotions with no directions or instructions at all. Some videos included more body parts than others. In other cases, videos have objects in the background an even different light setups. We wanted this to be as general as possible with no restrictions at all, so it could be a very good indicator of our main goal. The code detect_faces.py just detects faces from the video and we saved this video in the dimension 240x320. Using this algorithm creates shaky videos. Thus we then stabilized all videos. This can be done via a code or online free stabilizers are also available. After which we used the stabilized videos and ran it through code emotion_classification_videos_faces.py. in the code we developed a method to extract features based on histogram of dense optical flows (HOF) and we used a support vector machine (SVM) classifier to tackle the recognition problem. For each video at each frame we extracted optical flows. Optical flows measure the motion relative to an observer between two frames at each point of them. Therefore, at each point in the image you will have two values that describes the vector representing the motion between the two frames: the magnitude and the angle. In our case, since videos have a resolution of 240x320, each frame will have a feature descriptor of dimensions 240x320x2. So, the final video descriptor will have a dimension of #framesx240x320x2. In order to make a video comparable to other inputs (because inputs of different length will not be comparable with each other), we need to somehow find a way to summarize the video into a single descriptor. We achieve this by calculating a histogram of the optical flows. This is, separate the extracted flows into categories and count the number of flows for each category. In more details, we split the scene into a grid of s by s bins (10 in this case) in order to record the location of each feature, and then categorized the direction of the flow as one of the 8 different motion directions considered in this problem. After this, we count for each direction the number of flows occurring in each direction bin. Finally, we end up with an s by s by 8 bins descriptor per each frame. Now, the summarizing step for each video could be the average of the histograms in each grid (average pooling method) or we could just pick the maximum value of the histograms by grid throughout all the frames on a video (max pooling For the classification process, we used support vector machine (SVM) with a non linear kernel classifier, discussed in class, to recognize the new facial expressions. We also considered a Naïve Bayes classifier, but it is widely known that svm outperforms the last method in the computer vision field. A confusion matrix can be made to plot results better.

cseas / Ocr Table

277

Extract tables from scanned image PDFs using Optical Character Recognition.

universal

extract-tablesocrocr-table+6

Updated 1mo ago

rsommerfeld / Trocr

256

Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models".

universal

computer-visionhandwritten-text-recognitionocr+3

Updated 14h ago

dwqs / Ollama Ocr

251

A powerful OCR (Optical Character Recognition) package that uses state-of-the-art vision language models

universal

Updated 2mo ago