SkillAgentSearch skills...

Omnitalker

[NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication

Install / Use

/learn @HumanAIGC/Omnitalker
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

OmniTalker (Accepted by NeurIPS 2025)

This repo hosts the project page for OmniTalker.

OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication

Zhongjian Wang, Peng Zhang, Jinwei Qi, Guangyuan Wang Sheng Xu, Bang Zhang, Liefeng Bo

Tongyi Lab, Alibaba Group

<!-- [![](https://img.shields.io/badge/Demo_(Soon)-ModelScope-blue.svg)](https://humanaigc.github.io/omnitalker/) -->

Citation

@misc{wang2025omnitalkerrealtimetextdriventalking,
      title={OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication}, 
      author={Zhongjian Wang and Peng Zhang and Jinwei Qi and Guangyuan Wang Sheng Xu and Bang Zhang and Liefeng Bo},
      year={2025},
      eprint={2504.02433},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2504.02433}, 
}
View on GitHub
GitHub Stars422
CategoryDevelopment
Updated10d ago
Forks29

Languages

JavaScript

Security Score

85/100

Audited on Mar 19, 2026

No findings