64 skills found · Page 1 of 3
Q-Future / Q Align③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.
Alibaba-NLP / OmniSearchRepo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
clvrai / Relation Network TensorflowTensorflow implementations of Relational Networks and a VQA dataset named Sort-of-CLEVR proposed by DeepMind.
xiaoman-zhang / PMC VQAPMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modalities or diseases.
lab-rasool / Awesome Medical VLMs And DatasetsA list of VLMs tailored for medical RG and VQA; and a list of medical vision-language datasets
vztu / BVQA BenchmarkA resource list and performance benchmark for blind video quality assessment (BVQA) models on user-generated content (UGC) datasets. [IEEE TIP'2021] "UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content", Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
baeseongsu / Mimic Cxr VqaA new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images'. (NeurIPS 2023 D&B)
abachaa / VQA Med 2019Visual Question Answering in the Medical Domain VQA-Med 2019
chakravarthi589 / Video Question Answering ResourcesVideo Question Answering | Video QA | VQA
multimodal / MultimodalA collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"
Cloud-CV / VQACloudCV Visual Question Answering Demo
sutdcv / SUTD TrafficQA[CVPR 2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
facebookresearch / CausalVQAWe introduce CausalVQA, a benchmark dataset for video question answering (VQA) composed of question-answer pairs that probe models’ understanding of causality in the physical world.
CVI-SZU / FaceBench[CVPR 2025] FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs
CAMMA-public / SSG VQA[IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge
findalexli / SciGraphQASciGraphQA: Large-Scale Synthetic Multi-Turn Question-Answering Dataset for Scientific Graphs
cdancette / Vqa Cp LeaderboardA collections of papers about VQA-CP datasets and their results
CCYChongyanChen / VQA AlgorithmDatasetsNo description available
vzhou842 / Easy VQAThe Easy Visual Question Answering dataset.
fraction-ai / GAPGamified Adversarial Prompting (GAP): Crowdsourcing AI-weakness-targeting data through gamification. Boost model performance with community-driven, strategic data collection