LetItBee
A Python Implementation of Driedger's "Let It Bee" Technique for Audio Mosaicing
Install / Use
/learn @ctralie/LetItBeeREADME
LetItBee
A Python Implementation of Driedger's "Let It Bee" Technique for Audio Mosaicing (1), which mashes up sounds from a source file (e.g. buzzing bees) to approximate an audio file (e.g. The Beatles' "Let It Be") to create a result which sounds like the target, but with the timbre of the source (e.g. bees buzzing "Let It Be"). This uses a modified version of NMF which uses the source audio as the W matrix and the target as the V matrix, searching for V ~= WH, but with modifications so that timbral characteristics are preserved. See for more details.
- (1) Driedger, Jonathan, Thomas Prätzlich, and Meinard Müller. "Let it Bee-Towards NMF-Inspired Audio Mosaicing." ISMIR. 2015.
Dependencies
- Numpy/Scipy/Matplotlib
- librosa
- The Rubberband Library
Running
To see all options, run the script as follows
python Musaicing.py --help
To obtain the buzzing bees (Bees_Buzzing.mp3) and The Beatles (Beatles_LetItBe.mp3) audio file, please visit the supporting web site for the original paper. Below shows an example of how to execute the code with this example, plotting progress
python Musaicing.py --source Bees_Buzzing.mp3 --target Beatles_LetItBe.mp3 --result result.wav --saveplots 1
The audio result is as follows:
Below shows a plot of the iterations and final spectrogram decomposition

Notice that the KL error does not monotonically decrease and starts to go up towards the end, but this is consistent with the choices made in (1) to preserve timbral characteristics of the source.
Related Skills
node-connect
344.1kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
96.8kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
344.1kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
344.1kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
