zhao-kun / VibeVoiceFusion
461VibeVoiceFusion is a full-stack, multi-speaker voice generation web system featuring LoRA fine-tuning, batch generation, and VRAM optimization. Based on Microsoft's VibeVoice (AR + diffusion architecture)
universal
aigcautoregressive-modelsfine-tuning+9
39