Neuralnoise
The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio ๐๏ธ๐
Install / Use
/learn @leopiney/NeuralnoiseREADME
NeuralNoise: The AI Podcast Studio
<p align="center"> <a href="https://colab.research.google.com/drive/1-1aaRFoxJL03oUn7IB0DcfxFeWq7Vw5n?usp=sharing" alt="Open in Google Colab"> <img src="https://colab.research.google.com/assets/colab-badge.svg" /></a> <a href="https://github.com/badges/shields/pulse" alt="Activity"> <img src="https://img.shields.io/github/commit-activity/m/leopiney/neuralnoise" /></a> <a href="https://pypi.python.org/pypi/neuralnoise" alt="PyPI - Latest version"> <img src="https://img.shields.io/pypi/v/neuralnoise.svg" /></a> <a href="https://pypistats.org/packages/neuralnoise" alt="Latest version"> <img alt="PyPI - Downloads" src="https://img.shields.io/pypi/dm/neuralnoise"></a> </p> <div align="center"> <img src="./assets/banner.png" alt="NeuralNoise banner" /> </div>NeuralNoise is an AI-powered podcast studio that uses multiple AI agents working together. These agents collaborate to analyze content, write scripts, and generate audio, creating high-quality podcast content with minimal human input. The team generates a script that the cast team (using a TTS tool of your choice) will then record.
Features
- ๐ Content analysis, script generation, and content edition using AG2 group chat of agents
- ๐ข High-quality voice synthesis with ElevenLabs or OpenAI
- ๐ Audio processing and manipulation with pydub
- ๐ Easy way to manually edit parts of the script and regenerate the podcast
Examples
| Source | Type | NeuralNoise | Download | | ---- | ---- | ---- | ---- | | TikTok owner sacks intern for sabotaging AI project | ๐ Web article | <video src="https://github.com/user-attachments/assets/e79982c8-bb58-4395-8bce-aa25eee0b5c5" /> | Link | | Before you buy a domain name, first check to see if it's haunted| ๐ Web article | <video src="https://github.com/user-attachments/assets/53fabfd9-5422-431a-9ed5-6d9dd58de92e" /> | Link | | Linus Torvalds Comments On The Russian Linux Maintainers Being Delisted | ๐ Web article | <video src="https://github.com/user-attachments/assets/85671e26-ae06-4152-b6a2-e5aa6916e5b0" /> | Link | | Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation | ๐ PDF | <video src="https://github.com/user-attachments/assets/9bf999f7-59d9-4f04-a2aa-892c4d727a21" /> | Link | | Ep17. Welcome Jensen Huang | BG2 w/ Bill Gurley & Brad Gerstner | ๐บ YouTube | <video src="https://github.com/user-attachments/assets/e5ac1c08-46d3-4e8b-bea7-4b30b083dc4b" /> | Link | | Notepad++ turns 21, Apple releases M4, OpenAI Search release | ๐ Multiple web articles | <video src="https://github.com/user-attachments/assets/6fea6b51-d75c-4990-9441-3a45118b9ce0" /> | Link |
Objective
The main objective of NeuralNoise is to create a Python package that simplifies the process of generating AI podcasts. It utilizes OpenAI for content analysis and script generation, ElevenLabs for high-quality text-to-speech conversion, and Streamlit for an intuitive user interface.
Installation
To install NeuralNoise, follow these steps:
-
Install the package:
pip install neuralnoiseor from source:
git clone https://github.com/leopiney/neuralnoise.git cd neuralnoise pip install . -
Set up your API keys:
-
Create a
.envfile in the project root -
Add your OpenAI and ElevenLabs API keys:
OPENAI_API_KEY=your_openai_api_key # Optional ELEVENLABS_API_KEY=your_elevenlabs_api_key
-
Usage
To run the NeuralNoise application first make sure that you create a configuration file you want to use. There are examples in the config folder.
Then you can run the application with:
nn generate --name <name> <url|file> [<url|file>...]
Want to edit the generated script?
The generated script and audio segments are saved in the output/<name> folder. To edit the script:
- Locate the JSON file in this folder containing all script segments and their text content.
- Make your desired changes to specific segments in the JSON file. Locate the "sections" and "segments" content in this file that you want to change, then feel free to edit the content of the segments you want to change.
- Run the same command as before with the same name (
nn generate --name <name>) to regenerate the podcast.
The application will regenerate the podcast, preserving unmodified segments and only processing the changed ones. This approach allows for efficient editing without regenerating the entire podcast from scratch.
Roadmap
- [x] Better PDF and articles content extraction.
- [ ] Add interactive ways of using NeuralNoise (Gradio/Colab/etc)
- [ ] Add local LLM provider. More generic LLM configuration. Leverage AutoGen for this.
- [ ] Add local TTS provider
- [ ] Add podcast generation format options: interview, narrative, etc.
- [x] Add podcast generation from multiple source files
- [ ] Add more agent roles to the studio. For example, a "Content Curator" or "Content Researcher" that uses tools to find and curate content before being analyzed. Or a "Sponsor" agent that writes sague into ads sections in the podcast script (ร la LTT).
- [ ] Add music and sound effects options
- [ ] Real-time podcast generation with human and AI collaboration (๐ค)
Contributing
Contributions to NeuralNoise are welcome! Please feel free to submit a Pull Request.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Related projects
Related Skills
node-connect
354.3kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
112.3kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
354.3kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
354.3kQQBot ๅฏๅชไฝๆถๅ่ฝๅใไฝฟ็จ <qqmedia> ๆ ็ญพ๏ผ็ณป็ปๆ นๆฎๆไปถๆฉๅฑๅ่ชๅจ่ฏๅซ็ฑปๅ๏ผๅพ็/่ฏญ้ณ/่ง้ข/ๆไปถ๏ผใ
