1,090 skills found · Page 14 of 37
sockheadrps / RpaudioRust audio api binding for python
dohyeondk / Sub ToolsA robust Python toolkit for converting video/audio content into accurate, multilingual subtitles using WhisperX for transcription and Google's Gemini API for proofreading and translation.
DRX-Lab / Atmos EncoderAtmos-Encoder is a Python-based tool that converts audio files into Dolby Digital Plus (E-AC3) format, preserving Dolby Atmos metadata when present. It automatically detects Atmos content and outputs both 5.1 and 7.1 channel tracks. Requires Dolby Encoding Engine (DEE) binaries (not included).
DarrenKe / Music Mood Detection Using Text Mining On Lyrics During my undergrad, I implemented a music recommendation system based on music digital track analysis. However, it's time for me to use text mining technology on lyrics to upgrade that project. Goals: (1)build a music mood(happy or sad) classifier based on lyrics analysis (2)what words and their distributions are in different mood categories? (3)How are the key words change in songs for the recent years? Project evaluation: (1)data collection: the training data and validation data will be collected from the largest lyric database on Lyricwiki.org (2)feature selection: the most common feature type to consider are BOW(bag of word) and POS(part of speech) combined with stemming using word-net (3)Training model : SVM, Naive Bayes using grid search method. (4)data visualization for goal two and three This project will be done using python on jupyter notebook. reference: Hu, X. (2010). Improving music mood classification using lyrics, audio and social tags (Doctoral dissertation, University of Arizona).
riad-azz / Audio Extractaudio-extract is a Python library that allows you to extract audio from video files and trim the audio according to your needs.
flavioeverardo / Erb BandsERB representation of an audio file implemented in Python
lukerbs / ForcealignForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level text alignments of audio, with each word or phoneme's start and end time within the audio. ForceAlign was designed to be easy to install and use, without requiring any third-party, non-Python dependencies.
sevengivings / Subtitle XtranslatorA Python script for AI speech recognition of video or audio file using whisper, stable-ts, or faster-whisper and it can translate subtitles using Google Cloud, Naver Papago and DeepL APIs
jaqx0r / PymadA Python wrapper for the MPEG Audio Decoder (MAD) library
arfoll / Rs232serverPython application permitting the control of a AV receivers and TVs with serial (rs232) connections (cambridge audio xxxR series, LG TVs, pionner TVs, Arcam receivers)
Quentendo64 / TonieToolboxA Python tool for converting audio files to Tonie box compatible format (TAF - Tonie Audio Format).
said-ohamouddou / AudioChatGPTDesktop application written in python and PyQt6 which allows to interact with openai ChatGPT API, either by text messages or by audios.
Automate-Animation / SynctoonPython-based automated 2D animation tool that generates videos from text scripts and audio files. Uses AI for text analysis, lip sync, and animation cue extraction to create frame-by-frame character animations, then compiles them into a final video.
nerufuyo / NeruconvertoA comprehensive Python-based audio converter that transforms M4A files to MP3 format with multiple usage options: command-line interface, GUI application, and batch processing.
ronnyandersson / ZignalPython audio signal processing library
aflr-archive / Apiaudio Pythonapi.audio Python SDK
pzelasko / CylimiterA C++/Cython audio limiter for Python.
ctralie / LetItBeeA Python Implementation of Driedger's "Let It Bee" Technique for Audio Mosaicing
nnyj / Python Audio Separator LivePython Audio Separator in Real Time using MDX-NET model
ga642381 / AudioCodec HubAudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models