26 skills found
Blaizzy / Mlx AudioA text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Blaizzy / Mlx Audio SwiftA modular Swift SDK for audio processing with MLX on Apple Silicon
Blaizzy / Mlx VideoMLX-Video is the best package for inference and finetuning of Image-Video-Audio generation models on your Mac using MLX.
tsmdt / Whisply💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and subtitle generation using OpenAI’s Whisper on CPU, Nvidia GPU and Apple MLX.
DePasqualeOrg / Mlx Swift AudioSwift tools for text to speech (TTS) and speech to text (STT) powered by MLX
lucasnewman / Vocos MlxImplementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX
axot / OpenSuperMLXmacOS app for real-time audio transcription powered by MLX on Apple Silicon
mbotsu / Mlx Speech2textAudio transcription using mlx whisper and vad silence processing
shreyaskarnik / Voice MCPBidirectional voice MCP server for Claude Code — listen (STT) and speak (TTS) on Apple Silicon via mlx-audio
lucasnewman / Vocos SwiftImplementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in Swift using MLX
g2h0 / Qwen3 Tts Mlx StudioLocal text-to-speech WebUI on Apple Silicon, powered by Qwen3-TTS and mlx-audio
cosformula / Openclaw Mlx AudioOpenClaw local TTS plugin powered by mlx-audio, zero API key, zero cloud dependency
lucasnewman / Descript MlxImplementation of the Descript Audio Codec in MLX
EliFuzz / Parakeet MlxParakeet MLX is a next-generation automatic speech recognition (ASR) engine optimized for Apple Silicon (M1/M2/M3), leveraging Apple’s MLX framework for ultra-fast, low-latency transcription. It offers real-time streaming, advanced audio processing. Including noise reduction and silence detection
appautomaton / MLX GenAI🎬 Accelerated LTX-2.3 (22B) text-to-video+audio generation on Apple Silicon 8-bit/4-bit quantized inference via MLX
sandst1 / Stable Audio MlxMLX port of the stable-audio-open-small model + a sampler and cmd-line app built on top of it
kauazin394 / Vibevoice.swift🎤 Create low-latency text-to-speech on macOS with VibeVoice.swift, leveraging Swift and MLX for real-time audio generation and streaming.
Swap98-Coder / Mlx AudioA text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
dgrauet / Ltx 2 MlxPure MLX port of LTX-2 (Lightricks LTX-2.3) for Apple Silicon — video + audio generation
openclirun / OpencliOpenCLI bridges the gap between raw MLX models and AI Agents. Convert local Vision, Audio, and 3D models into Standardized Agent Skills via MCP.