AudioContextEncoder

A context encoder for audio inpainting

Generate Convert Improve

Install / Use

/learn @andimarafioti/AudioContextEncoder

About this skill

Quality Score

0/100

README

Audio inpainting with a context encoder

This project accompanies the research work on audio inpainting of small gaps done at the Acoustics Research Institute in Vienna collaborating with the Swiss Data Science Center. The paper was published at IEEE TASLP available now: https://ieeexplore.ieee.org/document/8867915.

Installation

Install the requirements with pip install -r requirements.txt. For windows users, the numpy version should be 1.14.0+mkl (find it here). For the FMA dataset, librosa requires ffmpeg as an mp3 backend.

Instructions

The paper uses both google's Nsynth dataset and the FMA dataset. In order to recreate the used dataset, execute in the parent folder either python make_nsynthdataset.py or python make_fmadataset.py. The output of the scripts are three tfrecord files for training, validating and testing the model.

The default parameters for the network come pickled in the file magnitude_network_parameters.pkl and complex_network_parameters.pkl. In order to make other architectures use saveParameters.py.

To train the network, execute in the parent folder python trainMagnitudeNetwork.py or python trainComplexNetwork.py. This will train the network for 600k steps with a learning rate of 1e-3. You can select on which tfrecords to train the network, the script assumes you have created the nsynth dataset.

Sound examples

To hear examples please go to the accompanying website.

Related Skills

proje

Interactive vocabulary learning platform with smart flashcards and spaced repetition for effective language acquisition.

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

best-practices-researcher

The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app

groundhog

401

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

andimarafioti

View profile

View on GitHub

GitHub Stars26

CategoryEducation

Updated1mo ago

Forks2

andimarafioti/audioContextEncoder

Languages

Jupyter Notebook

Security Score

80/100

Audited on Mar 2, 2026

No findings