LipNet
Lip Reading: Deep Learning Based Spoken Text Generation from Lip Movement
Install / Use
/learn @TVR28/LipNetREADME
LipReading Using Deep Learning
- A Lipreading app which generates spoken text based on lip movement using deep learning
- Load and convert the video into a GIF using
OpenCvandImageio, and then to text usingTensorflowand creating data pipeline, to take the audio of the video out of the equation - The GIF is passed as an input to the deep learning model constructed using
3D CNNsandBi-Directional LSTMs. The model generates number tokens which we convert to characters to get the generated text - Developed a web application using
Streamlitwhere a user can select a video and get the text generated.
Website
Checkout the web application here
