SkillAgentSearch skills...

BinauralLocalizationCNN

Code to create networks that localize sounds sources in 3D environments

Install / Use

/learn @afrancl/BinauralLocalizationCNN
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

BinauralLocalizationCNN

Code to create networks that localize sounds sources in 3D environments

  1. Main training/testing python script is call_model_training_valid_pad_francl.py.

    • This script is responsible for processing the experiment parameters, validating the model folder, saving a copy of the experiment parameters there and ensuring the same folder isn't used for two different training rounds.
    • An example set of parameeters can be found for testing in slurm_CNN_spherical_testing_francl.sh and in slurm_CNN_spherical_training_francl.sh.
  2. Networks weights can be downloaded at: https://www.dropbox.com/sh/af6vaotxt41i7pe/AACfTzMxMLfv-Edmn33S4gTpa?dl=0

  3. The model input should be nervegrams with their associated metadata saved into tensorflow records. The cochlear model we use is the PyCochleagram package . We have a wrapper to transform stereo .wav files into the proper input available here: https://github.com/afrancl/BinauralDataGen

    • The precomputed training data that were used to train the original model can be downloaded here: https://drive.google.com/drive/folders/1pUyMmkurxEWFeofbLKy5-QVWrcMbM8Wz?usp=sharing

Note: Before running, please change the model save folder to point to your directory with the model architecture config file and data folder to point to your data. Both of these are in the associated shell scripts. The code itself contains no absolute paths.

Setup

To aid reproducibility and decrease setup time we provide a Singularity Image that contains all packages necessary to run the code without any further setup. The image is available on dropbox here: https://www.dropbox.com/s/ey74fiw4uquww0n/tfv1.13_tcmalloc.simg?dl=0

View on GitHub
GitHub Stars52
CategoryDevelopment
Updated16d ago
Forks14

Languages

Python

Security Score

80/100

Audited on Mar 25, 2026

No findings