HarkVisualizer
A web app written with the Tornado framework for speech detection and localization in 8-channel flac/wav audio. Try it out with the test.wav file.
Install / Use
/learn @chrislgarry/HarkVisualizerREADME
Hark Visualizer
A web app written with the Tornado framework for visualizing audio. The web app only processes 8-channel flac and wav files (for the time being) to detect direction of sound, duration of sounds, and a transcription using speech recognition. Try it out using the test.wav file. It only supports Japanese at the moment.
Thirdy-Party Services
Amazon AWS
Used for hosting the web server in Japan.
Tornado
Used as the web server for serving the webapp and delivering data to the browser via websockets.
Microsoft Cognitive Services/Azure
Used for hosting the Bing Speech API server instance.
Hark SaaS
Used for analyzing the audio files.
Speech Recognition
Used for transcribing the audio file via a wrapper around the Bing Speech API, as Google Speech API is not available anymore.
d3.js
Used for creating real-time data visualizations in the browser.
crossfilter.js
Used for n-dimensional filtering of multivariate datasets across D3 charts.
c3.js
A wrapper around D3.js for building charts quickly.
Related Skills
node-connect
353.3kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
111.7kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
353.3kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
353.3kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
