DiffSingerMiniEngine
A minimum inference engine for DiffSinger
Install / Use
/learn @openvpi/DiffSingerMiniEngineREADME
DiffSingerMiniEngine
A minimum inference engine for DiffSinger MIDI-less mode.
Getting Started
- Install
onnxruntimefollowing the official guidance. - Install other dependencies with
pip install PyYAML soundfile. - Download ONNX version of the NSF-HiFiGAN vocoder from here and unzip it into
assets/vocoderdirectory. - Download an ONNX rhythm predictor from here and put it into
assets/rhythmizerdirectory. - Put your ONNX acoustic models into
assets/acousticdirectory. - Edit
configs/default.yamlor create another config file according to your preference and local environment. - Run server with
python server.pyorpython server.py --config <YOUR_CONFIG>.
API Specification
TBD
How to Obtain Acoustic Models
- Train with your own dataset or download pretrained checkpoints from here.
- Export PyTorch checkpoints to ONNX format. See instructions here.
Related Skills
node-connect
345.9kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
106.4kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
345.9kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
345.9kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
