79 skills found · Page 1 of 3
pyannote / Pyannote AudioNeural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
tmoroney / Auto SubsInstantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.
kaixxx / NoScribeCutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
yinruiqing / Pyannote WhisperNo description available
pyannote / Pyannote VideoFace detection, tracking and clustering in videos
revdotcom / ReverbOpen source inference code for Rev's model
pyannote / Pyannote MetricsA toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
narcotic-sh / SenkoVery fast, accurate speaker diarization
thomasmol / Cog Whisper DiarizationCog implementation of transcribing + diarization pipeline with Whisper & Pyannote
pyannote / Pyannote CoreAdvanced data structures for handling temporal segments with attached labels.
pyannote / Pyannote DatabaseReproducible experimental protocols for multimedia (audio, video, text) database
thewh1teagle / Pyannote Rspyannote audio diarization in rust
pengzhendong / Pyannote OnnxONNX Inference of Pyannote Segmentation
FrenchKrab / IS2023 Powerset DiarizationOfficial repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
pyannote / DEPRECATED Pyannote Audio Hub[deprecated] Pretrained models for pyannote-audio 1.x
clement-pages / GryannoteProvide Gradio custom components to make the diarization-based audio labeling process easier and faster.
jfgonsalves / Parakeet DiarizedParakeet 0.6b V2 + Pyannote diarization behind a Whisper API
JSchmie / ScrAIbeTool for automatic transcription and speaker diarization based on whisper and pyannote.
mahdeslami11 / Pyannote AudioNo description available
jeanjerome / EchoInStoneEchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.