SkillAgentSearch skills...

Emovoice

Build your own Real-time Speech Emotion Recognizer

Install / Use

/learn @hcmlab/Emovoice
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

alt text

EmoVoice is a set of tools, which allow you to build your own real-time emotion recognizer based on acoustic properties of speech (not using word information).

Platform

Windows

Installation

Make sure Visual Studio 2015 Redistributable is installed on your machine. Then run install.cmd to download core binaries and install an embedded version of Python.

If you plan to extract SoundNet features, you will also have to execute install_tensorflow.cmd and download the file sound8.npy into the chains folder.

Documentation

https://rawgit.com/hcmlab/emovoice/master/docs/index.html

Credits

  • SSI -- Social Signal Interpretation Framework
  • LIBSVM -- A Library for Support Vector Machines
  • LIBLINEAR -- A Library for Large Linear Classification
  • openSMILE -- The Munich Versatile and Fast Open-Source Audio Feature Extractor
  • Emo-DB -- Berlin Database of Emotional Speech
  • SoundNet -- TensorFlow implementation of "SoundNet"

Reference

@inproceedings{Wagner13,
 author = {Wagner, Johannes and Lingenfelser, Florian and Baur, Tobias and Damian, Ionut and Kistler, Felix and Andr{\'e}, Elisabeth},
 title = {The social signal interpretation (SSI) framework: multimodal signal processing and recognition in real-time},
 booktitle = {Proceedings of the 21st ACM international conference on Multimedia},
 series = {MM '13},
 year = {2013},
 isbn = {978-1-4503-2404-5},
 location = {Barcelona, Spain},
 pages = {831--834},
 numpages = {4},
 url = {http://doi.acm.org/10.1145/2502081.2502223},
 doi = {10.1145/2502081.2502223},
 acmid = {2502223},
 publisher = {ACM},
 address = {New York, NY, USA},
 keywords = {multimodal fusion, open source framework, real-time pattern recognition, social signal processing},
}

License

The framework is released under LGPL (see LICENSE). Please note custom license files for the plug-ins (see LICENSE.*).

Author

Johannes Wagner, Lab for Human Centered Multimedia, 2018

Related Skills

View on GitHub
GitHub Stars119
CategoryDevelopment
Updated28d ago
Forks35

Languages

Python

Security Score

95/100

Audited on Mar 5, 2026

No findings