SkillAgent Search skills...⌘K

Audio2text

creates text from audio of A/V input file, using docker, sphinx. extracts keywords and NLP entities. leverages OpenNews, Stanford, Oxford, CMU and more

Generate Convert Improve

Install / Use

/learn @traceypooh/Audio2text

About this skill

Quality Score

0/100

Category

Development & Engineering

Supported Platforms

Universal

Tags

audio docker entities extract-keywords natural-language-processing offline speech-recognition srt transcript video

README

audio2text

docker-based pocketsphinx that takes A/V input file to create text from the audio and extract keywords and entities from the results.

Allows one to, once SETUP is done, run locally on a Mac, network disabled / no cloud, and run speech to text and text analysis on an A/V file.

PREREQUISTES

Docker (eg: https://docs.docker.com/docker-for-mac/install/ )

git (brew or XCode setups have you covered ;-)

SETUP

git clone https://github.com/traceypooh/audio2text.git
cd audio2text
docker build -t audio2text .

RUN

( docker run --rm -i audio2text |tar xf - ) < test.mp3

will make (click each to see the results):

out.json - detailed word/phrase with timings
out.txt - transcript of entire audio/video file
out.srt - timed transcript of audio/video file
out.key - keywords extracted from .txt (above)
out.plo - Persons, Locations, Organizations (and more) extracted from .txt (above)

traceypooh

View profile

GitHub Stars10

CategoryDevelopment

Updated4y ago

Forks0

traceypooh/audio2text

Languages

Shell

Security Score

65/100

Audited on Feb 6, 2022

No findings