Whisper.tflite
Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices
Install / Use
/learn @nyadla-sys/Whisper.tfliteREADME
<img src="https://github.blog/wp-content/uploads/2020/09/github-stars-logo_Color.png" alt="drawing" width="25"/> Please, star the project on github (see top-right corner) if you appreciate my contribution to the community!
Whisper Enhanced Quantized TFLite Model
This project contains an enhanced version of the Whisper quantized TFLite model optimized for both Android and iOS platforms. The model is designed to perform well on edge devices, making it suitable for a wide range of applications.
Integrate Whisper TFLite for Enhanced Mobile App Performance
If you're interested in incorporating the Whisper TFLite model into your iOS and Android applications, please don't hesitate to reach out to us at yadlaniranjan@gmail.com. Our project features an upgraded iteration of the Whisper quantized TFLite model, finely tuned for optimal performance on both Android and iOS platforms. This model is tailored to excel on edge devices, rendering it versatile for various application scenarios. Contact us for further details and collaboration opportunities
Getting Started
To get started with this enhanced Whisper model, follow these steps:
-
Clone this repository to your local machine:
git clone https://github.com/nyadla-sys/whisper.tflite.git -
You can now use the enhanced Whisper quantized TFLite model in your projects by refering sample code for Android and iOS.
Demo Apps
Android
You can find a sample Android app in the whisper_android folder that demonstrates how to use the Whisper TFLite model for transcription on Android devices.
DTLN quantized tflite model
Our overarching objective is to incorporate real-time noise suppression through the utilization of a quantized DTLN tflite model, delivering noise-reduced audio data to the whisper tflite model.
Courtesy from breizhn/DTLN
TODO
- [ ] Considering adding DTLN noise cancellation tflite model to improve whisper ASR accuracy in noisy environments.
Here are links for comparing TFLite with other Whisper models:
Whisper's Comparative Analysis Speech Recognition Experiments Repository OpenVoiceOS' Whisper TFLite Plugin
Stay Updated
Stay connected to this repository for further developments and updates related to the Whisper enhanced TFLite model. We are constantly working to improve its performance and compatibility with various edge devices.
If you have any questions or encounter any issues, please don't hesitate to open an issue in this repository. We'll be happy to assist you!
Citing
If you are using the Whisper tflite model, please cite:
@misc{nyadla-sys,
author={Niranjan Yadla},
title={{Whisper TFLite: OpenAI Whisper Model Port for Edge Devices}},
year=2022,
howpublished={GitHub Repository},
url={https://github.com/nyadla-sys/whisper.tflite},
url={https://github.com/moonshine-ai/openai-whisper},
note={Original TFLite implementation of OpenAI Whisper for on-device automatic speech recognition}
}
Related Skills
node-connect
348.5kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
109.1kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
348.5kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
348.5kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
