1,092 skills found · Page 18 of 37
monodera / Pyannote Whisper ChatgptA Python package to transcribe speech by Whisper with diarization (speaker identification) using pyannote.audio and send the results to OpenAI Chat API to generate, for example, the summary of the conversation.
fangyuan99 / AudiototxtA python script based on gemini and yt-dlp can extract text scripts of local audio or online video. 一个基于 gemini 和 yt-dlp 的 python 脚本,可以提取本地音频或者在线视频的文字稿。
landell / LandellLandell captures audio and video and streams it to Icecast servers. It provides a number of different configurations for the user, picture-in-picture and supports different sources of video. Landell is developed with GStreamer, Python and Gtk+.
gbrlpzz / ScribeScribe is a Python script that transcribes audio and video files using OpenAI Whisper and exports the transcriptions as PDF documents, enhanced by the gpt-3.5-turbo model.
poseidon-code / Ytmp3 DlPython script for multi-threaded download of audio (in .mp3) from any YouTube video/audio link.
Pezz89 / PySoundConcatA Python project for generating concatenative synthesis driven representations of audio files based on audio database analysis.
elidepb / Audio Transcription With PythonJupyter notebook for audio processing using OpenAI's Whisper model. Includes library installation, dependency setup, and examples for transcription and audio analysis in Python. Perfect for exploring advanced speech recognition tools.
cxyfer / GeminiASRA Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.
takaswie / Hinawa UtilsPython 3 module and scripts to handle Audio and Music units on IEEE1394 bus via libhinawa API with a help of PyGObject.
PranavPutsa1006 / Speaker DiarizationIdentifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
whittlbc / Python Speech RecogBoilerplate for real-time streaming of mic audio to Google Cloud Speech API in Python
jack-tol / Youtube To AudioA lightweight Python package and command-line interface (CLI) tool that extracts audio from YouTube videos and playlists in multiple formats, such as MP3, WAV, OGG, AAC, and FLAC.
henrikschnor / PasimpleA python wrapper for the "PulseAudio simple API". Supports playing and recording audio via PulseAudio and PipeWire.
Bravco / Warcraft Fishing BotA first one of a kind fully-automatic World of Warcraft audio-based fishing bot programmed in Python.
iSpeech / ISpeech Speech Recognition ASR Voice Recognition.jsiSpeech's open source javascript SDK for speech recognition (ASR) API, enables you to easily create Web applications using iSpeech freeform, command or custom statistical language models. The speech recognition API powering this speech recognition SDK supports nearly 30 languages and accents. The acoustic models are based on huge amounts of low and high quality hand labeled audio data (millions of utterances). iSpeech is a viable alternative to Google ASR (Web Speech API), which only includes Voice Search and Freeform Models, or others that do not offer a Javascript SDK to our knowledge. -Currently this SDK has been tested and works on Chrome and Firefox, but not Internet Explorer (IE) or Safari - (has not been tested on Opera browser) *you may need to contact iSpeech to create custom models **if you do not have an iSpeech account and need to test, you may request credits *** iOS, Android, Java, .NET, Python and other SDKs are also available
rhelmot / Sound MachinePython library for musical audio synthesis
shalabycr7 / AudioHazeAn Audio Wave Processing App to Manipulate/Edit Audio Files Using Python
Dynam1co / Python Youtube Video DownloaderPrograma en Python para descargar vídeo o audio de Youtube
rknLA / PyjackJack Audio Server client written in Python
ngbala6 / Audio ProcessingThis repo is for Audio Processing Techniques and the Silence Remove using Python