10 skills found
krantiparida / Awesome Audio VisualA curated list of different papers and datasets in various areas of audio-visual processing
LittlePey / SFDSparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion (CVPR 2022, Oral)
mhawksey / GeminiAppGeminiApp is a library that allows integration to Google's Gemini API in your Google Apps Script projects. It allows for mutli-modal prompts, structured conversations and function calling
chakka-guna-sekhar-venkata-chennaiah / Mutli Modal RAG ChaBotBuilding Essence Towards Personalized Knowledge Model - PKM
youngbin-ro / Audiotext TransformerMultimodal Transformer for Korean Sentiment Analysis with Audio and Text Features
VachanVY / Transfusion.torchPyTorch Implementation of Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
atlas-2192 / Multi AI Chat APPNo description available
AlmasM / EmotionDetectionMutli-modal research project that combines text analysis and image processing to determine emotions
Kcrypto126 / Multi AI Chat Appchatting app
zjukg / MANS[Paper][IJCNN2023] Modality-Aware Negative Sampling for Multi-modal Knowledge Graph Embedding