Results for "wavlm"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

24 skills found

yl4579 / StyleTTS2

6.2k

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

universal

adversarial-trainingdeep-learningdiffusion-models+9

Updated 1d ago

s3prl / S3prl

2.5k

Self-Supervised Speech Pre-training and Representation Learning Toolkit

universal

apccpcdata2vec+17

Updated 7h ago

wenet-e2e / Wespeaker

1.2k

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

universal

asvcamppluscnceleb+17

Updated 2h ago

ASR-project / Multilingual PR

261

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.

universal

asrcommon-voicedeep-learning+7

Updated 5d ago

lucadellalib / Focalcodec

158

A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation

universal

codecdeep-learningfocal-modulation+6

Updated 3d ago

sinhat98 / Adapter Wavlm

No description available

universal

Updated 1mo ago

lucadellalib / Audiocodecs

A collections of audio codecs with a standardized API

zed

codecdacencodec+10

Updated 15d ago

mjhydri / Singing Vocal Beat Tracking

This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trains linear multi-head self-attention layers on top of them to extract vocal beat activations. Then, it uses HMM decoder to infer signing beats and tempo.

universal

beat-trackinghubertlinear-transformer+5

Updated 9mo ago

lucadellalib / Discrete Wavlm Codec

A neural speech codec based on discrete WavLM representations

universal

clusteringcodechifi-gan+8

Updated 4d ago

hi-paris / Wavlm Vocoder French

WavLM-to-Audio neural vocoder for French speech reconstruction — layer ablation study and adversarial supervision as a foundation for continuous voice conversion (JEP 2026)

universal

french-speechttsvoice-conversion+1

Updated 3h ago

Amir-Ivry / MAPSS Measures

The code for the MAPSS measures for source separation evaluation (ICLR, 2026)

universal

aiaudio-qualitydiffusion-maps+11

Updated 10d ago

bunyaminergen / WavLMMSDD

This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.

universal

diarizationembeddingmicrosoft+5

Updated 2mo ago

AnshKapadia / TS VAD Plus

TS-VAD+: Transformer-based speaker diarization system developed as part of my MS thesis at NTU Singapore, improving diarization in overlapping speech using ECAPA-TDNN, WavLM, VBx, and memory-aware attention.

universal

Updated 1mo ago

Sarasadeghii / Sharif WavLM

In this repository, the wavLM model is used for quality and poor quality data for speaker verification task, and the PyCM library is used for evaluation.

universal

confusion-matrixfarsi-datasetspycm+2

Updated 7mo ago