MultimodalAnalysis

Python examples for the course "Multimodal Information Processing & Analysis" of the MSc in Data Science in NCSR Demokritos

Generate Convert Improve

Install / Use

/learn @tyiannak/MultimodalAnalysis

About this skill

Quality Score

0/100

README

multimodalAnalysis

Related News

Special issue in Pattern Recognition in Multimedia Signal Analysis, Deadline 2021 28 February

General

This code contains sample code for the Courses

"Machine Learning for Multimodal Data" of the MSc in Artificial Intelligence, of the University of Pireaus and the National Centre for Scientific Research "Demokritos".
"Multimodal Information Processing and Analysis" of the MSc in Data Science, of the National Centre for Scientific Research "Demokritos" and the University of Pelloponese. .

This material covers introductory issues with regards to audio segmentation and classification, image processing, image feature extraction, segmentation and classification, video analysis and multimodal fusion.

Dependencies

All code has been tested in Python3. Dependencies can be installed using pip and the requirements.txt files in each folder (e.g. audio/requirements.txt).

Course Presentations

| Link | Title | | ----------------------------------------------------------------------------------------- | --------------------------------------------| | <a href="https://drive.google.com/open?id=15P2gumoXUbfvm4L2ghWfoyYZHWrD7WBB370ca4T-Cko" target="_blank">Course 1</a> | Intro to Multimodal Signal Analysis | | <a href="https://drive.google.com/open?id=1heH7rKGEEySVh3sK583MuwqlNwACiAerHQw4JQTntI4" target="_blank">Course 2</a> | Audio Representations and Feature Extraction | | <a href="https://drive.google.com/open?id=18fkOP3GjAggdg86BGz_TvOxNxxh5YDeL3YMGOI2cMhQ" target="_blank">Course 3</a> | Audio Classification / Regression | | <a href="https://drive.google.com/open?id=1prbiNhaU7xrj0qfOnk4bgWMXHWax_hReQYf6hGkZk0A" target="_blank">Course 4</a> | Audio Segmentation| | <a href="https://drive.google.com/open?id=1mCMSCQadfkkRkblHHo9CkPDpgiXajIad1UHnZjZnYhc" target="_blank">Course 5</a> | Image Feature Extraction - 1 | | <a href="https://drive.google.com/open?id=1h9WBQZnLHikqIAqgmR_uCJUEH0Pfvqs6-D5JouAaLkw" target="_blank">Course 6</a> | Image Feature Extraction - 2 | | <a href="https://drive.google.com/open?id=1k3qJzSh-ytyZktvTJ7cVZpH0jFBWD6aZhu33u_4-hOM" target="_blank">Course 7</a> | Video Feature Extraction - 1 | | <a href="https://drive.google.com/open?id=1qNEdx25RdtzfPY8jWB5FzWbFWoZfnUhvtrfi3xi4nMw" target="_blank">Course 8</a> | Audio Fingerprinting | | <a href="https://drive.google.com/open?id=1ojM7AtQVdwOYXMkyrjpswhAyd4PnpDKh5kzWsSxwWSQ" target="_blank">Course 9</a> | DL 1 | | <a href="https://drive.google.com/open?id=1d_qBD7ootzFPWdI3qJ_RhUAopDRhDGssFg6NCkiVwq0" target="_blank">Course 10</a> | DL 2 |

Author

Theodoros Giannakopoulos, Director of Machine Learning at Behavioral Signals

Related Skills

node-connect

337.1k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

83.1k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

337.1k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

commit-push-pr

83.1k

Commit, push, and open a PR