Festival
Upstream releases and custom modifications to the Festival TTS project.
Install / Use
/learn @rhdunn/FestivalREADME
The Festival Speech Synthesis System
This directory contains the Festival Speech Synthesis System, developed at CSTR, University of Edinburgh. The project was originally started by Alan W Black and Paul Taylor but many others have been involved.
Festival offers a general framework for building speech synthesis systems as well as including examples of various modules. As a whole it offers full text to speech through a number APIs: from shell level, though a Scheme command interpreter, as a C++ library, and an Emacs interface. Festival is multi-lingual (currently English (US and UK) and Spanish are distributed but a host of other voices have been developed by others) though English is the most advanced.
The system is written in C++ and uses the Edinburgh Speech Tools for low level architecture and has a Scheme (SIOD) based command interpreter for control. Documentation is given in the FSF texinfo format which can generate, a printed manual, info files and HTML.
Compilation
In order to build festival, the Edinburgh Speech Tools project needs to
be built. The location of build tree of these tools needs to be specified
in the ESTDIR environment variable. For example:
export ESTDIR=`pwd`/../speech_tools
The festival project uses a standard autogen-based build system. It
can be built using the following commands:
./autogen.sh
./configure --prefix=/usr
make
sudo make install
Installation
Festival should run on any standard Unix platform. It has already run on Solaris, SunOS, Linux and FreeBSD. It requires a C++ compiler (GCC 2.7.2, 2.8.1, 2.95.[123], 3.2.3 3.3.2 RedHat "gcc-2.96", gcc 3.3, gcc 4.4.x and gcc-4.5.x are our standard compilers) to install. A port to Windows XP/NT/95/98 and 2000 using either Cygnus GNUWIN32, this is still new but many people are successfully using it.
A detailed description of installation and requirements for the whole system is given in the INSTALL file.
License
Festival is free. Earlier versions were restricted to non-commercial use but we have now relaxed those conditions. The 4-clause BSD licence is an X11 style licence thus it can be incorporated in commercial products and free source products without restriction. It has the following copyright:
Centre for Speech Technology Research
University of Edinburgh, UK
Copyright (c) 1996-2014
All Rights Reserved.
The changes to the project are described in the CHANGELOG.md file in order to comply with clause 2 of the BSD license. This also includes the changes in the official festival releases.
Related Skills
node-connect
347.9kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
108.7kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
347.9kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
347.9kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
