SkillAgentSearch skills...

Seda2text

Seda2Text is a desktop application that converts Persian (Farsi) speech from audio or video files into text subtitles (.srt).

Install / Use

/learn @shahabbai/Seda2text
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

Seda2Text 🎙️

Seda2Text is a desktop application that converts Persian (Farsi) speech from audio or video files into text subtitles (.srt). It uses the powerful OpenAI Whisper model via the Hugging Face Transformers library and provides a clean graphical interface built with PyQt6.

🧩 Features

-Beautiful, user-friendly GUI built with PyQt6

-Supports audio and video input formats (.mp3, .wav, .mp4, .mkv, etc.)

-Uses the Persian Whisper model (vhdm/whisper-large-fa-v1) for accurate transcription

-Option to use a local Whisper model for offline transcription

-CUDA GPU acceleration for faster processing (if available)

-Generates synchronized SRT subtitle files

🧠 Requirements

-Python 3.9 or later

-FFmpeg installed and added to your system PATH

-(Optional) NVIDIA GPU with CUDA support

📦 Installation

1- Clone the repository:

git clone https://github.com/yourusername/seda2text.git
cd seda2text

2- Install dependencies:

pip install -r requirements.txt

3- Check FFmpeg installation:

ffmpeg -version

🚀 Usage

1- Run the application:

python main.py

2- In the GUI:

Select a video or audio file.

(Optional) Check “Use a local Whisper model” and choose your model directory.

Click Generate Subtitles. The generated .srt subtitle file will be saved in the same directory as your input file.

View on GitHub
GitHub Stars4
CategoryContent
Updated1mo ago
Forks0

Languages

Python

Security Score

85/100

Audited on Feb 13, 2026

No findings