135 skills found · Page 1 of 5
suno-ai / Bark🔊 Text-Prompted Generative Audio Model
Lightricks / LTX 2Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
Stability-AI / Stable Audio ToolsGenerative models for conditional audio generation
SamurAIGPT / Generative Media SkillsMulti-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.
chrisdonahue / WaveganWaveGAN: Learn to synthesize raw audio with generative adversarial networks
gitmylo / Audio WebuiA webui for different audio related Neural Networks
Harmonai-org / Sample GeneratorTools to train a generative model on arbitrary audio samples
Yuan-ManX / AI Audio DatasetsAI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
Text-to-Audio / Make An AudioPyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
ksw0306 / FloWaveNetA Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
deepbrainai-research / Float[ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.
Zeta36 / Tensorflow Tex WavenetThis is a TensorFlow implementation of the WaveNet generative neural network architecture https://deepmind.com/blog/wavenet-generative-model-raw-audio/ for text generation.
npisanti / OfxPDSPopenFrameworks addon for audio synthesis and generative music
Stability-AI / Stable Audio MetricsMetrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.
yuvraj108c / ComfyUI FLOATGenerative Motion Latent Flow Matching for Audio-driven Talking Portrait
resemble-ai / MelNetWIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
yukara-ikemiya / Friendly Stable Audio ToolsRefactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.
Deepest-Project / MelNetImplementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
crlandsc / Tiny Audio DiffusionA repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB VRAM GPU)
KlingAIResearch / X DubTry X-Dub to sync any character in a video with any audio you like | Official repository for "From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping"