7 skills found
facebookresearch / AudioMAEThis repo hosts the code and models of "Masked Autoencoders that Listen".
AlanBaade / MAE AST PublicPublic Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
rishikksh20 / AudioMAE PytorchUnofficial PyTorch implementation of Masked Autoencoders that Listen
XuecWu / AVF MAE[CVPR'25] AVF-MAE++ : Scaling Affective Video Facial Masked Autoencoders via Efficient Audio-Visual Self-Supervised Learning
IvanBirkmaier / AudiosetThis repository is built with a focus on practical ways to obtain and work with the audio data of audioset. You can use this repository to download and precprocess audioset wav files for running the recipies of Audio Spectogram Transformer (AST) and Masked Autoencoder that listen (Audio - MAE).
gary920209 / Audio MaestroOfficial implementation of Audio-Maestro
samsad35 / VQ MAE AudioVisual Code[CVIU] A vector quantized masked autoencoder for audiovisual speech emotion recognition