SkillAgentSearch skills...

MiCEval

An automatic evaluation framework for Multimodal Chain-of-Thought.

Install / Use

/learn @alenai97/MiCEval
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning

The MiCEval dataset is stored in dataset/MiCEval_hard.jsonl and dataset/MiCEval_Normal.jsonl.

The images of the MiCEval dataset are stored in the dataset/image folder.

To reproduce the results of the verifier experiment, please run scripts/verifier_zeroshot.sh and scripts/verifier_fewshot.sh.

The code for the evaluator experiment will be updated soon.

Related Skills

View on GitHub
GitHub Stars5
CategoryDevelopment
Updated1y ago
Forks0

Languages

Python

Security Score

55/100

Audited on Mar 24, 2025

No findings