SkillAgentSearch skills...

Audio2text

creates text from audio of A/V input file, using docker, sphinx. extracts keywords and NLP entities. leverages OpenNews, Stanford, Oxford, CMU and more

Install / Use

/learn @traceypooh/Audio2text

README

audio2text

docker-based pocketsphinx that takes A/V input file to create text from the audio and extract keywords and entities from the results.

Allows one to, once SETUP is done, run locally on a Mac, network disabled / no cloud, and run speech to text and text analysis on an A/V file.

PREREQUISTES

Docker (eg: https://docs.docker.com/docker-for-mac/install/ )

git (brew or XCode setups have you covered ;-)

SETUP

git clone https://github.com/traceypooh/audio2text.git
cd audio2text
docker build -t audio2text .

RUN

( docker run --rm -i audio2text |tar xf - ) < test.mp3

will make (click each to see the results):

  • out.json - detailed word/phrase with timings
  • out.txt - transcript of entire audio/video file
  • out.srt - timed transcript of audio/video file
  • out.key - keywords extracted from .txt (above)
  • out.plo - Persons, Locations, Organizations (and more) extracted from .txt (above)
View on GitHub
GitHub Stars10
CategoryDevelopment
Updated4y ago
Forks0

Languages

Shell

Security Score

65/100

Audited on Feb 6, 2022

No findings