SkillAgentSearch skills...

LipNet

Lip Reading: Deep Learning Based Spoken Text Generation from Lip Movement

Install / Use

/learn @TVR28/LipNet
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

LipReading Using Deep Learning

  • A Lipreading app which generates spoken text based on lip movement using deep learning
  • Load and convert the video into a GIF using OpenCv and Imageio, and then to text using Tensorflow and creating data pipeline, to take the audio of the video out of the equation
  • The GIF is passed as an input to the deep learning model constructed using 3D CNNs and Bi-Directional LSTMs. The model generates number tokens which we convert to characters to get the generated text
  • Developed a web application using Streamlit where a user can select a video and get the text generated.

Website

Checkout the web application here

View on GitHub
GitHub Stars16
CategoryEducation
Updated2y ago
Forks0

Languages

Jupyter Notebook

Security Score

60/100

Audited on Feb 23, 2024

No findings