V2vt
video to video translation with voice clone and lip synchronization|带有语音克隆和口型同步的视频翻译,支持中英互换
Install / Use
/learn @halfzm/V2vtREADME
- 转录视频语音并翻译
- 语音克隆
- 口型同步
- 压制字幕
- 支持中英视频互相转换
效果演示
详见res.mp4
技术栈
- 语音识别:fast-whisper
- 文本翻译:
- facebook/m2m
- translators[默认值]
- 音色克隆:
- 口型同步:videotalking
- 脸部超分:gfpgan
- 视频整合:pyvideotrans
环境搭建
- 安装ffmpeg并添加到环境变量,或者是直接把相应可执行程序放到当前目录ffmpeg文件夹
ffmpeg
|- ffmpeg.exe
|- ffprobe.exe
...
- 安装依赖
git clone git@github.com:halfzm/v2vt.git
conda create -n v2vt_clone python=3.11.0
conda activate v2vt_clone
cd v2vt_clone
pip install -r requirements.txt
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
- 把
openvoice中需要的模型放到openvoice_checkpoints文件夹中,目录结构应该如下[optional]
openvoice_checkpoints
|- base_speakers
|- ...
|- converter
|- ...
...
- 把
tts中需要的模型放到tts_models文件夹中,目录结构应该如下
tts_models
|- tts_models--multilingual--multi-dataset--xtts_v2
|- config.json
|- model.pth
|- speakers_xtts.pth
|- vocal.json
|- ...
...
- 把
video_retalking中需要的模型放到video-retalking/checkpoints文件夹中,目录结构应该如下
video-retalking
|- checkpoints
|- ...
...
使用说明
- 快速启动
python app.py
-
关于输入
输入视频不能太短,否则语音克隆的时候报错(最好不要低于5S) -
关于输出
默认是输出到当前目录下的output.mp4,也可以在webui中直接下载
其它
licence和code_of_conduct和video-retalking项目一致
详见LICENSE和CODE_OF_CONDUCT
Related Skills
qqbot-channel
349.0kQQ 频道管理技能。查询频道列表、子频道、成员、发帖、公告、日程等操作。使用 qqbot_channel_api 工具代理 QQ 开放平台 HTTP 接口,自动处理 Token 鉴权。当用户需要查看频道、管理子频道、查询成员、发布帖子/公告/日程时使用。
docs-writer
100.3k`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie
model-usage
349.0kUse CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Design
Campus Second-Hand Trading Platform \- General Design Document (v5.0 \- React Architecture \- Complete Final Version)1\. System Overall Design 1.1. Project Overview This project aims t
