Results for "ai-audio-generation"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

55 skills found · Page 1 of 2

SamurAIGPT / Generative Media Skills

3.0k

Multi-modal Generative Media Skills for AI Agents (Claude Code, Cursor, Gemini CLI). High-quality image, video, and audio generation powered by muapi.ai.

claude codeclaude desktop+2

agent-toolsai-agentsai-art+17

Updated 1h ago

archinetai / Audio AI Timeline

1.9k

A timeline of the latest AI models for audio generation, starting in 2023!

universal

artificial-intelligenceaudio-generationmachine-learning

Updated 5d ago

NeptuneHub / AudioMuse AI

1.5k

AudioMuse-AI is an Open Source Dockerized environment that brings automatic playlist generation to Jellyfin, Navidrome, LMS, Lyrion and Emby. Using powerful tools like Librosa and ONNX, it performs sonic analysis on your audio files locally, allowing you to curate the perfect playlist for any mood or occasion without relying on external APIs.

zed

clapdockeremby+17

Updated 26m ago

fspecii / HeartMuLa Studio

517

Suno-like music generation studio for HeartMuLa/heartlib - AI-powered music creation with reference audio style transfer

universal

Updated 1d ago

ammaarreshi / Openjourney

235

Open-source clone of the MidJourney web interface featuring real AI image and video generation powered by Google's Gemini SDK. Use Imagen 4 to generate images and Veo 2 and 3 for image and text to video with audio.

gemini cli

aigenerative-aiimage-generation+5

Updated 5d ago

jgravelle / GroqCasters

139

GroqCasters is a Python application that generates podcast scripts and corresponding audio using AI technologies. It leverages PocketGroq for script generation and Bark for text-to-speech conversion, allowing for custom voice cloning.

universal

notebooklmpodcastpodcasting

Updated 1d ago

okio-ai / Nendo Platform

129

Nendo is an open source platform for AI-driven audio management, intelligence, and generation.

universal

Updated 4mo ago

innovatorved / Realtime Interview Copilot

Realtime Interview Copilot is a web application that assists users in crafting responses during interviews. It leverages real-time audio transcription and AI-powered response generation to provide relevant and concise answers.

vscode copilot

aichatgptinnovatorved+5

Updated 21d ago

aastroza / AI Podcast Generator

AI-powered tool for automatic podcast script and audio generation.

universal

artificial-intelligencechatgptpodcast

Updated 15d ago

Fantety / FrameForge

FrameForge is a web application built with FastAPI and React. As an AI-powered asset generation tool designed specifically for game developers, it offers a variety of AI-driven features to help developers quickly create visual and audio assets required for games.

universal

Updated 27d ago

BernieTv / ElevenLabs Clone

A self-hosted ElevenLabs clone for text-to-speech, voice conversion, and AI audio generation with Docker, FastAPI, and Next.js. 🔊🎙️💡💻

universal

aiaws-s3fine-tuning+6

Updated 10d ago

kousen / OpenAIClient

Demonstrates how to use Spring to access OpenAI restful web services without using the Spring AI project. Tests call ChatGPT for text, DALL-E for image generation, and Whisper for audio transcriptions.

universal

Updated 5mo ago

deepsingh132 / Aionair

A cutting-edge AI SaaS platform that enables users to create, discover, and enjoy podcasts with advanced features like text-to-audio conversion with multi-voice AI, podcast thumbnail image generation, and seamless playback. The platform is built using Next.js, TypeScript, Convex, OpenAI, Stripe, Clerk, ShadCN, and Tailwind CSS.

universal

aiclerkconvex+17

Updated 1mo ago

CloudAI-X / Z AI Playground V2

Z.AI API Playground - Complete examples for GLM-4.7, Vision, Image/Video Generation, Audio, and more. Powered by Z.AI-GLM-4.7-Coding Plan

universal

Updated 1mo ago

RowanUnderwood / Synesthesia AI Video Director

Automate your AI music video workflow with Synesthesia Engine. This local Gradio app bridges audio analysis, LLM-driven storytelling, and LTX Desktop video generation. Simply drop in your song stems and lyrics, let the AI direct your storyboard, and batch-render your final cut.

universal

Updated 15h ago

RhythrosaLabs / Soundstorm

Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusiasts. From sample pack creation and algorithmic composition to AI text-to-audio and onscreen ChatGPT, Soundstorm is a sonic powerhouse.

universal

ai-audioai-audio-generationalgorithmic-composition+16

Updated 16d ago

georgbuechner / Dissonance

A command line and keyboard based strategy-game written in c++, where audio-input determines the AI-strategy and lays the seed for the map-generation.

universal

audiocommand-linecpp+1

Updated 8mo ago

ebowwa / HeyCyanSmartGlassesSDK

Cross-platform SDK for HeyCyan smart glasses - Control photo/video capture, audio recording, and AI image generation via Bluetooth LE on iOS and Android

universal

aarandroid-sdkaugmented-reality+13

Updated 3h ago

nikhil-robinson / Openrouter Client

A comprehensive OpenRouter API client library for ESP32 (ESP-IDF), enabling seamless integration with OpenRouter’s AI models. Supports text generation, streaming responses, function calling, and multimodal capabilities including image and audio processing.

universal

esp-idfesp32llm+1

Updated 1mo ago

wasenderapi / Audio Chat N8n Wasenderapi

An n8n workflow for creating an AI-powered audio chat assistant. This project uses Wasenderapi for messaging, OpenAI for transcription and response generation, and Google Drive for file handling.

universal

Updated 18h ago