SkillAgentSearch skills...

Kantv

workbench for learning and practicing on-device AI technology in real scenario with online-TV on Android phone, powered by ggml(llama.cpp,whisper.cpp...) and FFmpeg and opencv-mobile

Install / Use

/learn @kantv-ai/Kantv

README

KanTV

KanTV("Kan", aka English "watch") , an open source project focus on study and practise on-device AI technology in <b>real scenario</b>(such as perform <b>online-TV playback</b> and <b>realtime transcription</b> and <b>online-TV record</b> at the same time) on Android phone:

Software architecture of KanTV Android

Image

Building the project

  • Clone this repository and build locally, see how to build
  • Download pre-built Android APK from https://github.com/kantv-ai/kantv/releases
  • Download pre-built Android APK from Github CI-build: https://github.com/kantv-ai/kantv/actions/

Run Android APK on Android phone

  • Android 8.0(2017.08) --- Android 15(2024.10) and higher version with <b>ANY</b> mainstream arm64 mobile SoC.
  • Android smartphone equipped with <b>ANY</b> mainstream <b>high-end</b> mobile SoC is highly <b>recommented</b> for realtime AI-subtitle feature otherwise unexpected behavior would happen.
  • Android smartphone equipped with one of below Qualcomm mobile SoCs(Qualcomm's state-of-the-art high-end mobile SoC <b>Snapdragon 8Gen3 series and Snapdragon 8Elite series</b> are highly recommended) <b>is required</b> for verify/running ggml-hexagon backend on Android phone:
    Snapdragon 8 Gen 1
    Snapdragon 8 Gen 1+
    Snapdragon 8 Gen 2
    Snapdragon 8 Gen 3
    Snapdragon 8 Elite

Screenshots

here is a short video to demostrate realtime AI subtitle by running the great & excellent & amazing<a href="https://github.com/ggerganov/whisper.cpp"> whisper.cpp </a> on an Android phone equipped with Qualcomm Snapdragon 8Gen3 mobile SoC - <b>fully offline, on-device</b>.

https://github.com/kantv-ai/kantv/assets/6889919/2fabcb24-c00b-4289-a06e-05b98ecd22b8


a screenshot to demostrate multi-modal inference by running the magic <a href="https://github.com/ggerganov/llama.cpp"> llama.cpp </a> on an Android phone equipped with Qualcomm Snapdragon 8Elite mobile SoC - <b>fully offline, on-device</b>.

Image


a screenshot to demostrate realtime-video-recognition via MTMD from llama.cpp + a lightweight multimodal model SmolVLM2-256M from Huggingface on an Android phone equipped with Qualcomm Snapdragon 8Elite mobile SoC - <b>fully offline, on-device</b>.

Image

<details> <summary>some other screenshots</summary> <ol>

Image

Image


a screenshot to demostrate ASR inference by running the excellent <a href="https://github.com/ggerganov/whisper.cpp"> whisper.cpp </a> on an Android phone equipped with Qualcomm Snapdragon 8Gen3 mobile SoC - <b>fully offline, on-device</b>.

Image


a screenshot to demostrate Text-2-Image inference by running the amazaing <a href="https://github.com/leejet/stable-diffusion.cpp"> stable-diffusion.cpp </a> on an Android phone equipped with Qualcomm Snapdragon 8Elite mobile SoC - <b>fully offline, on-divice</b>.

713992135


a screenshot to demostrate download LLM model in APK.

1213951738 1242080159

</ol> </details>

Docs

Contribution

Report issue in Android phone equipped with <b>mainstream</b> mobile SoC or submit PR to this project is greatly welcomed.

We use GitHub issues for tracking feature requests and issue reports, please see how to submit issue in this project .

<!-- comment out this section because some contributors in the upstream project might-be don't want to be appeared here ### Contributors [![Contributors](http://contrib.nn.ci/api?repo=kantv-ai/kantv)](https://github.com/kantv-ai/kantv/graphs/contributors) -->

Special Acknowledgement

<ul>AI inference framework <ul> <li> <a href="https://github.com/ggml-org/ggml">GGML</a> </li> </ul> </ul> <ul>AI application engine <ul> <li> ASR engine <a href="https://github.com/ggml-org/whisper.cpp">whisper.cpp</a> </li> <li> LLM engine <a href="https://github.com/ggml-org/llama.cpp">llama.cpp</a> </li> <li> Text2Image engine <a href="https://github.com/leejet/stable-diffusion.cpp">stable-diffusion.cpp</a> </li> <li> CV engine <a href="https://github.com/nihui/opencv-mobile">opencv-mobile</a> </li> <li> MTMD(multimodal) engine <a href="https://github.com/ggml-org/llama.cpp/blob/master/tools/mtmd/README.md">MTMD subsystem in llama.cpp</a> </li> </ul> </ul>
View on GitHub
GitHub Stars187
CategoryEducation
Updated13d ago
Forks24

Languages

C++

Security Score

100/100

Audited on Mar 20, 2026

No findings