Results for "python-audio"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

1,092 skills found · Page 18 of 37

monodera / Pyannote Whisper Chatgpt

A Python package to transcribe speech by Whisper with diarization (speaker identification) using pyannote.audio and send the results to OpenAI Chat API to generate, for example, the summary of the conversation.

universal

Updated 11mo ago

fangyuan99 / Audiototxt

A python script based on gemini and yt-dlp can extract text scripts of local audio or online video. 一个基于 gemini 和 yt-dlp 的 python 脚本，可以提取本地音频或者在线视频的文字稿。

gemini cli

Updated 5d ago

landell / Landell

Landell captures audio and video and streams it to Icecast servers. It provides a number of different configurations for the user, picture-in-picture and supports different sources of video. Landell is developed with GStreamer, Python and Gtk+.

universal

Updated 3y ago

gbrlpzz / Scribe

Scribe is a Python script that transcribes audio and video files using OpenAI Whisper and exports the transcriptions as PDF documents, enhanced by the gpt-3.5-turbo model.

universal

audio-transcriptiongpt-35-turboopenai+6

Updated 4mo ago

poseidon-code / Ytmp3 Dl

Python script for multi-threaded download of audio (in .mp3) from any YouTube video/audio link.

universal

ffmpegmp3-downloaderpython+3

Updated 5d ago

Pezz89 / PySoundConcat

A Python project for generating concatenative synthesis driven representations of audio files based on audio database analysis.

universal

Updated 4mo ago

elidepb / Audio Transcription With Python

Jupyter notebook for audio processing using OpenAI's Whisper model. Includes library installation, dependency setup, and examples for transcription and audio analysis in Python. Perfect for exploring advanced speech recognition tools.

universal

Updated 1mo ago

cxyfer / GeminiASR

A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.

gemini cli

asrgeminigemini-api+1

Updated 11d ago

takaswie / Hinawa Utils

Python 3 module and scripts to handle Audio and Music units on IEEE1394 bus via libhinawa API with a help of PyGObject.

universal

alsadistutilsieee1394+3

Updated 1y ago

PranavPutsa1006 / Speaker Diarization

Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python

universal

deep-learningembeddings-extractionmfcc+8

Updated 1y ago

whittlbc / Python Speech Recog

Boilerplate for real-time streaming of mic audio to Google Cloud Speech API in Python

universal

Updated 2y ago

jack-tol / Youtube To Audio

A lightweight Python package and command-line interface (CLI) tool that extracts audio from YouTube videos and playlists in multiple formats, such as MP3, WAV, OGG, AAC, and FLAC.

universal

audiocliconverter+1

Updated 2mo ago

henrikschnor / Pasimple

A python wrapper for the "PulseAudio simple API". Supports playing and recording audio via PulseAudio and PipeWire.

universal

Updated 6mo ago

Bravco / Warcraft Fishing Bot

A first one of a kind fully-automatic World of Warcraft audio-based fishing bot programmed in Python.

universal

audioautomaticautomation+6

Updated 22d ago

iSpeech / ISpeech Speech Recognition ASR Voice Recognition.js

iSpeech's open source javascript SDK for speech recognition (ASR) API, enables you to easily create Web applications using iSpeech freeform, command or custom statistical language models. The speech recognition API powering this speech recognition SDK supports nearly 30 languages and accents. The acoustic models are based on huge amounts of low and high quality hand labeled audio data (millions of utterances). iSpeech is a viable alternative to Google ASR (Web Speech API), which only includes Voice Search and Freeform Models, or others that do not offer a Javascript SDK to our knowledge. -Currently this SDK has been tested and works on Chrome and Firefox, but not Internet Explorer (IE) or Safari - (has not been tested on Opera browser) *you may need to contact iSpeech to create custom models **if you do not have an iSpeech account and need to test, you may request credits *** iOS, Android, Java, .NET, Python and other SDKs are also available

universal

Updated 1y ago