MultimodalAnalysis
Python examples for the course "Multimodal Information Processing & Analysis" of the MSc in Data Science in NCSR Demokritos
Install / Use
/learn @tyiannak/MultimodalAnalysisREADME
multimodalAnalysis
Related News
Special issue in Pattern Recognition in Multimedia Signal Analysis, Deadline 2021 28 February
General
This code contains sample code for the Courses
- "Machine Learning for Multimodal Data" of the MSc in Artificial Intelligence, of the University of Pireaus and the National Centre for Scientific Research "Demokritos".
- "Multimodal Information Processing and Analysis" of the MSc in Data Science, of the National Centre for Scientific Research "Demokritos" and the University of Pelloponese. .
This material covers introductory issues with regards to audio segmentation and classification, image processing, image feature extraction, segmentation and classification, video analysis and multimodal fusion.
Dependencies
All code has been tested in Python3. Dependencies can be installed using pip and the requirements.txt files in each folder (e.g. audio/requirements.txt).
Course Presentations
| Link | Title | | ----------------------------------------------------------------------------------------- | --------------------------------------------| | <a href="https://drive.google.com/open?id=15P2gumoXUbfvm4L2ghWfoyYZHWrD7WBB370ca4T-Cko" target="_blank">Course 1</a> | Intro to Multimodal Signal Analysis | | <a href="https://drive.google.com/open?id=1heH7rKGEEySVh3sK583MuwqlNwACiAerHQw4JQTntI4" target="_blank">Course 2</a> | Audio Representations and Feature Extraction | | <a href="https://drive.google.com/open?id=18fkOP3GjAggdg86BGz_TvOxNxxh5YDeL3YMGOI2cMhQ" target="_blank">Course 3</a> | Audio Classification / Regression | | <a href="https://drive.google.com/open?id=1prbiNhaU7xrj0qfOnk4bgWMXHWax_hReQYf6hGkZk0A" target="_blank">Course 4</a> | Audio Segmentation| | <a href="https://drive.google.com/open?id=1mCMSCQadfkkRkblHHo9CkPDpgiXajIad1UHnZjZnYhc" target="_blank">Course 5</a> | Image Feature Extraction - 1 | | <a href="https://drive.google.com/open?id=1h9WBQZnLHikqIAqgmR_uCJUEH0Pfvqs6-D5JouAaLkw" target="_blank">Course 6</a> | Image Feature Extraction - 2 | | <a href="https://drive.google.com/open?id=1k3qJzSh-ytyZktvTJ7cVZpH0jFBWD6aZhu33u_4-hOM" target="_blank">Course 7</a> | Video Feature Extraction - 1 | | <a href="https://drive.google.com/open?id=1qNEdx25RdtzfPY8jWB5FzWbFWoZfnUhvtrfi3xi4nMw" target="_blank">Course 8</a> | Audio Fingerprinting | | <a href="https://drive.google.com/open?id=1ojM7AtQVdwOYXMkyrjpswhAyd4PnpDKh5kzWsSxwWSQ" target="_blank">Course 9</a> | DL 1 | | <a href="https://drive.google.com/open?id=1d_qBD7ootzFPWdI3qJ_RhUAopDRhDGssFg6NCkiVwq0" target="_blank">Course 10</a> | DL 2 |
Author
<img src="https://tyiannak.github.io/files/3.JPG" align="left" height="100"/>Theodoros Giannakopoulos, Director of Machine Learning at Behavioral Signals
Related Skills
node-connect
337.1kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
83.1kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
337.1kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
83.1kCommit, push, and open a PR
