ASR

A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).

Generate Convert Improve

Install / Use

/learn @amitchone/ASR

About this skill

Quality Score

0/100

README

MFCC Automatic Speech Recognition Algorithm Implementation

A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).

Method

Read audio data and sampling frequency from .wav file
Frame signal
Apply window function to frame (default=hamming)
Calculate DFT of frame
Calculate periodogram power spectral density estimate for each DFT bin
Apply Mel-Frequency filterbank to signal
Sum energies within each filter and take the base 10 logarithm
Take DCT of each filter
Keep coefficients [1:13]
Compute DTW best path and euclidean distance of reference vector and input vector

To-do

Noise gate
Pre-emphasis / Lifter
Feature vector database
Audio record / playback (audio.py)
Multithread MFCC extraction
Create MFCC extractor as class?

Related Skills

node-connect

340.5k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

claude-opus-4-5-migration

84.2k

Migrate prompts and code from Claude Sonnet 4.0, Sonnet 4.5, or Opus 4.1 to Opus 4.5

frontend-design

84.2k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

model-usage

340.5k

Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.

amitchone

View profile

View on GitHub

GitHub Stars17

CategoryDevelopment

Updated1y ago

Forks4

amitchone/ASR

Languages

Python

Security Score

65/100

Audited on Jul 3, 2024

No findings