Nupic.audio

Audio (analog, digital) experiments using NuPIC HTM/CLA

Generate Convert Improve

Install / Use

/learn @htm-community/Nupic.audio

About this skill

Quality Score

0/100

README

nupic.audio

![Gitter](https://badges.gitter.im/Join Chat.svg) Gitter public chat channel

Auditory experiments using cortical learning algorithms (CLA) and hierarchical temporal memory (HTM).

Repositories of interest

Numenta's nupic.critic Audio streaming
NuMozart Digital (MIDI) streaming and composition
HTMforGenreClassification Genre classification
Hackathon scripts and data for Musenta

Note: These repositories currently are all work-in-progress.

Online videos of interest

Taken from the collection gathered via Gitter channel https://gitter.im/rcrowder/EncodingSpecificityPrinciple -

Online books and references

https://jp.mathworks.com/matlabcentral/answers/uploaded_files/23580/index.pdf Spectral Envelopes in Sound Analysis and Synthesis By Diemo Schwarz, Diplomarbeit Nr. 1622, IRCAM (Institut de la Recherche et Coordination Acoustique/Musique)
https://ccrma.stanford.edu/~jos/dft/
Mathematics of the Discrete Fourier Transform (DFT) with audio appliccations
By Julius O. Smith III, Center for Computer Research in Music and Acoustics (CCRMA)
http://www.dspguide.com/
The Scientist and Engineer's Guide to Digital Signal Processing
By Steven W. Smith, Ph.D.
http://www.eecs.qmul.ac.uk/~simond/pub/2012/PlumbleyDixon12-ima-tutorial-slides.pdf
Tutorial: Music Signal Processing
By Mark Plumbley and Simon Dixon, Centre for Digital Music (Queen Mary University of London)

Potential areas of investigation

Genre and style classification
Musical prediction and composition
Acoustic correlation using canonical correlation analysis (CCA)
Transient analysis (harmonic tracking)
Motion derivative encoding (similar to optical flow)
Echo location and spatial positioning (e.g. Anterior Ventral Cochlea Nucleus)
Stream segmentation and seperation (includes selective attention)
Cortical pathways and projections, 'What' and 'Where' pathways (belts?)
Auditory nerve spike firing (e.g. IHC to CN GBC integrators)
Dendritic micro-circuits and synaptic placement (temporal smoothing)
Spike-timing dependent plasticity
Acetylcholine inhibition enhancing discharge frequency but decreasing synaptic adaption
Acoustic related cell, and dendrite, membrane properties (cascading conductances, shunting)

An alternative for the encoding of audio signals is the modelling of spike firing of auditory-nerve fibers. A collection of models can be found in the EarLab @ Boston University (http://earlab.bu.edu/ See Modelling -> Downloadable Models). If you plan to use these models, beware of their history and limitations. For example, early models lack some necessary non-linearity in their responses.

Related Skills

node-connect

347.9k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

108.7k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

347.9k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

347.9k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。