SincAlignNet
This implementation is based on the SincAlignNet model from the paper 'Frequency-Based Alignment of EEG and Audio Signals Using Contrastive Learning and SincNet for Auditory Attention Detection'.
Install / Use
/learn @LiaoEuan/SincAlignNetREADME
SincAlignNet
Implementation based on:
Frequency-Based Alignment of EEG and Audio Signals Using Contrastive Learning and SincNet for Auditory Attention Detection
SincAlignNet is an innovative framework for auditory attention detection that aligns EEG and audio features using an enhanced SincNet architecture with contrastive learning. It achieves state-of-the-art accuracy on KUL and DTU datasets, supporting efficient low-density EEG decoding for practical neuro-guided hearing aids.
Framework Overview
Fig. 1: SincAlignNet architecture for AAD, consisting of two phases:
- Contrastive Learning - Aligns EEG and attended audio encodings by maximizing mutual information of correct EEG-Audio pairs
- Inference - Identifies attended audio via cosine similarity between EEG/audio features or direct EEG-based inference
Encoder Architecture
Fig. 2: EEG and Audio encoder structure. Both encoders contain four components:
- Multi-SincNet Bandpass
- EEG: 60 filters | Audio: 320 filters
- Depth Conv1D - Combines filter outputs for deeper features
- Down Sample - Compresses data while preserving key information
- Projector - Maps features to 128D latent space
Module Specifications
<img width="525" height="414" alt="image" src="https://github.com/user-attachments/assets/b74521b9-c58e-41f2-8865-4205f79812d5" />Fig. 3: Component implementations:
(a) Depth-wise 1D convolution block
(b) Down sample module
(c) Projector architecture
Biological Motivation
Fig. 4: Proposed auditory attention mechanisms:
-
Noise Reduction (Fig 4a)
- Brain processes mixed audio → extracts attended speaker
- Simulated using SincNet filtering architecture
-
Information Minimization (Fig 4b)
- Attentional focus minimizes mutual information entropy
- Implemented via contrastive learning paradigm
Related Skills
proje
Interactive vocabulary learning platform with smart flashcards and spaced repetition for effective language acquisition.
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
best-practices-researcher
The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app
flutter-tutor
Flutter Learning Tutor Guide You are a friendly computer science tutor specializing in Flutter development. Your role is to guide the student through learning Flutter step by step, not to provide d
