SkillAgentSearch skills...

Jets

JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech

Install / Use

/learn @imdanboy/Jets
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech

Pytorch implementation based on ESPNet (https://github.com/espnet/espnet) toolkit; tag: v.202204

paper: https://arxiv.org/abs/2203.16852

demo: https://imdanboy.github.io/interspeech2022/

<p align="center"><img src="figures/jets_fig.002.jpeg" alt="figure" width="500"/></p>

JETS consists of FastSpeech2, HiFi-GAN and an Alignment Module. The model files are located at espnet2/gan_tts/jets/

How to use

  1. clone the repo
git clone https://github.com/imdanboy/jets.git
  1. download espnet and patch jets code to espnet
cd jets; ./patch_to_espnet.sh
  1. install espnet as usual
cd jets/espnet/tools
./setup_venv $(which python3)
make
  1. run the training script // tested on 4 V100 GPUs
# LJSPEECH training
cd jets/espnet/egs2/ljspeech/tts1
./run.sh --stage 1 --stop_stage 6 --ngpu 4
# KSS training
cd jets/espnet/egs2/kss/tts1
./run.sh --stage 1 --stop_stage 6 --ngpu 4

Note

JETS is now available in ESPnet officially since v.202205 !!!

View on GitHub
GitHub Stars113
CategoryDevelopment
Updated3mo ago
Forks17

Languages

Python

Security Score

92/100

Audited on Dec 6, 2025

No findings