LetItBee

A Python Implementation of Driedger's "Let It Bee" Technique for Audio Mosaicing

Generate Convert Improve

Install / Use

/learn @ctralie/LetItBee

About this skill

Quality Score

0/100

README

LetItBee

A Python Implementation of Driedger's "Let It Bee" Technique for Audio Mosaicing (1), which mashes up sounds from a source file (e.g. buzzing bees) to approximate an audio file (e.g. The Beatles' "Let It Be") to create a result which sounds like the target, but with the timbre of the source (e.g. bees buzzing "Let It Be"). This uses a modified version of NMF which uses the source audio as the W matrix and the target as the V matrix, searching for V ~= WH, but with modifications so that timbral characteristics are preserved. See for more details.

(1) Driedger, Jonathan, Thomas Prätzlich, and Meinard Müller. "Let it Bee-Towards NMF-Inspired Audio Mosaicing." ISMIR. 2015.

Dependencies

Numpy/Scipy/Matplotlib
librosa
The Rubberband Library

Running

To see all options, run the script as follows

python Musaicing.py --help

To obtain the buzzing bees (Bees_Buzzing.mp3) and The Beatles (Beatles_LetItBe.mp3) audio file, please visit the supporting web site for the original paper. Below shows an example of how to execute the code with this example, plotting progress

python Musaicing.py --source Bees_Buzzing.mp3 --target Beatles_LetItBe.mp3 --result result.wav --saveplots 1

The audio result is as follows:

Result

Below shows a plot of the iterations and final spectrogram decomposition

NMF Decomposition of Let It Be into Bees Buzzing

Notice that the KL error does not monotonically decrease and starts to go up towards the end, but this is consistent with the choices made in (1) to preserve timbral characteristics of the source.

Related Skills

node-connect

344.1k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

96.8k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

344.1k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

344.1k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。