4 skills found
FunAudioLLM / ThinkSound[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.
Tencent-Hunyuan / HunyuanVideo FoleyHunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.
Yuan-ManX / AI Audio DatasetsAI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
AI45Lab / UniMarkAIGC watermark & identification toolkit for text, image, audio, video. Supports invisible watermarking and visible marking.