V2vt

video to video translation with voice clone and lip synchronization|带有语音克隆和口型同步的视频翻译，支持中英互换

Generate Convert Improve

Install / Use

/learn @halfzm/V2vt

About this skill

Quality Score

0/100

README

<p align="left"> <a href="readme-en.md">English</a> ｜中文 </p> <h1 align="left"> 带有口型同步功能的视频翻译 </h1>

转录视频语音并翻译
语音克隆
口型同步
压制字幕
支持中英视频互相转换

效果演示

详见res.mp4

技术栈

语音识别：fast-whisper
文本翻译：
- facebook/m2m
- translators[默认值]
音色克隆：
- openvoice（对于中文，使用微软的TTS替换openvoice自带的TTS模型）
- TTS[默认值]
口型同步：videotalking
脸部超分：gfpgan
视频整合：pyvideotrans

环境搭建

安装ffmpeg并添加到环境变量，或者是直接把相应可执行程序放到当前目录ffmpeg文件夹

ffmpeg
   |- ffmpeg.exe
   |- ffprobe.exe
...

安装依赖

git clone git@github.com:halfzm/v2vt.git

conda create -n v2vt_clone python=3.11.0
conda activate v2vt_clone

cd v2vt_clone
pip install -r requirements.txt
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

把openvoice中需要的模型放到openvoice_checkpoints文件夹中，目录结构应该如下[optional]

openvoice_checkpoints
   |- base_speakers
    |- ...
   |- converter
    |- ...
...

把tts中需要的模型放到tts_models文件夹中，目录结构应该如下

tts_models
   |- tts_models--multilingual--multi-dataset--xtts_v2
    |- config.json
    |- model.pth
    |- speakers_xtts.pth
    |- vocal.json
   |- ...
...

把video_retalking中需要的模型放到video-retalking/checkpoints文件夹中，目录结构应该如下

video-retalking
   |- checkpoints
    |- ...
...

使用说明

快速启动

python app.py

关于输入
输入视频不能太短，否则语音克隆的时候报错（最好不要低于5S）
关于输出
默认是输出到当前目录下的output.mp4，也可以在webui中直接下载

其它

licence和code_of_conduct和video-retalking项目一致
详见LICENSE和CODE_OF_CONDUCT

Related Skills

qqbot-channel

349.0k

QQ 频道管理技能。查询频道列表、子频道、成员、发帖、公告、日程等操作。使用 qqbot_channel_api 工具代理 QQ 开放平台 HTTP 接口，自动处理 Token 鉴权。当用户需要查看频道、管理子频道、查询成员、发布帖子/公告/日程时使用。

docs-writer

100.3k

`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie

model-usage

349.0k

Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.

Design

Campus Second-Hand Trading Platform \- General Design Document (v5.0 \- React Architecture \- Complete Final Version)1\. System Overall Design 1.1. Project Overview This project aims t

halfzm

View profile

View on GitHub

GitHub Stars145

CategoryContent

Updated1mo ago

Forks29

halfzm/v2vt

Languages

Python

Security Score

100/100

Audited on Feb 9, 2026

No findings