PEIFG
[ACM MM 2024] The released code of paper "Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor"
Install / Use
/learn @Gary-code/PEIFGREADME
<b>Learning to Correction</b>: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor
Jiali Chen<sup>1</sup>, Xusen Hei<sup>1</sup>, Yuqi Xue<sup>1</sup>, Yuancheng Wei<sup>1</sup>, Jiayuan Xie<sup>2</sup>, Yi Cai<sup>*,1</sup> Qing Li<sup>2</sup>
<p><sup>1</sup>South China University of Technology <sup>2</sup>The Hong Kong Polytechnic University <br><sup>*</sup>Corresponding author <h5 align="center"> </h5>
Overview of Pedagogical Expert Instructed Feedback Generation (PEIFG) model
:building_construction: Run PEIFG
Installation
- Clone this repository and navigate to the PEIFG folder
gt clone https://github.com/Gary-code/PEIFG.git
cd PEIFG
- Install Package
- Install Anaconda or Miniconda distribution based on Python3.10 from their downloads' site.
- Main packages: PyTorch = 1.13, transformers = 4.43
- Download model weights:
:rocket: Instructblip (Q-Former), Qwen1.8B, OPT-350M
Data Preparation
Our VCR-DF dataset reuse the images from the official VCR dataset. You can download in VCR.
The annotation of our VCR-DF dataset is in ./vcr_df_dataset folder.
Train
cd train
bash lora_train_feedback.sh
📑 Citation
@inproceedings{peifg,
author = {Jiali Chen and
Xusen Hei and
Yuqi Xue and
Yuancheng Wei and
Jiayuan Xie and
Yi Cai and
Qing Li},
editor = {Jianfei Cai and
Mohan S. Kankanhalli and
Balakrishnan Prabhakaran and
Susanne Boll and
Ramanathan Subramanian and
Liang Zheng and
Vivek K. Singh and
Pablo C{\'{e}}sar and
Lexing Xie and
Dong Xu},
title = {Learning to Correction: Explainable Feedback Generation for Visual
Commonsense Reasoning Distractor},
booktitle = {Proceedings of the 32nd {ACM} International Conference on Multimedia,
{MM} 2024, Melbourne, VIC, Australia, 28 October 2024 - 1 November
2024},
pages = {8209--8218},
publisher = {{ACM}},
year = {2024},
}
Related Skills
proje
Interactive vocabulary learning platform with smart flashcards and spaced repetition for effective language acquisition.
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
API
A learning and reflection platform designed to cultivate clarity, resilience, and antifragile thinking in an uncertain world.
openclaw-plugin-loom
Loom Learning Graph Skill This skill guides agents on how to use the Loom plugin to build and expand a learning graph over time. Purpose - Help users navigate learning paths (e.g., Nix, German)
