50 skills found · Page 1 of 2
facebookresearch / AugLyA data augmentations library for audio, image, text, and video.
iver56 / AudiomentationsA Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
AgaMiko / Data Augmentation ReviewList of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
iver56 / Torch AudiomentationsFast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
descriptinc / AudiotoolsObject-oriented handling of audio data, with GPU-powered augmentations, and more.
KentoNishi / Torch Pitch ShiftPitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
shangeth / WavencoderWavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
SuperKogito / Pydiogment:mega: Python library for audio augmentation
a-n-rose / Python Sound ToolSoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enhancement, audio augmentation, feature extraction and visualization, dataset and audio file conversion, and beyond.
emilio-molina / Audio DegraderAudio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmentation, evaluation in noisy conditions, etc.
freds0 / Data Augmentation For AsrA set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.
AppleHolic / Audioset AugmentorSound augmentation using Large-scale audio dataset (Audioset)
alicank / Translation Augmented LibriSpeech CorpusLarge scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and contains English utterances (from audiobooks) automatically aligned with French text. Our dataset offers ~236h of speech aligned to translated text.
felixchenfy / Speech Commands Classification By LSTM PyTorchClassification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.
KentoNishi / Torch Time StretchTime-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
musikalkemist / AudioDataAugmentationTutorialRepository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.
slSeanWU / Beats Conformer Bart Audio CaptionerPyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"
Lallapallooza / Fast Audiomentations⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
alibugra / Audio Data AugmentationAudio data augmentation examples
phrasenmaeher / Audio Transformation VisualizationA streamlit application that lets you explore the effect of different audio augmentation techniques