SkillAgentSearch skills...

PEIFG

[ACM MM 2024] The released code of paper "Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor"

Install / Use

/learn @Gary-code/PEIFG
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

<b>Learning to Correction</b>: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor

Jiali Chen<sup>1</sup>, Xusen Hei<sup>1</sup>, Yuqi Xue<sup>1</sup>, Yuancheng Wei<sup>1</sup>, Jiayuan Xie<sup>2</sup>, Yi Cai<sup>*,1</sup> Qing Li<sup>2</sup>

<p><sup>1</sup>South China University of Technology &nbsp;&nbsp;<sup>2</sup>The Hong Kong Polytechnic University &nbsp;&nbsp; <br><sup>*</sup>Corresponding author &nbsp;&nbsp; <h5 align="center">

arXiv

</h5>

figure1

Overview of Pedagogical Expert Instructed Feedback Generation (PEIFG) model

:building_construction: Run PEIFG

Installation

  1. Clone this repository and navigate to the PEIFG folder
gt clone https://github.com/Gary-code/PEIFG.git
cd PEIFG
  1. Install Package
  • Install Anaconda or Miniconda distribution based on Python3.10 from their downloads' site.
  • Main packages: PyTorch = 1.13, transformers = 4.43
  1. Download model weights:

:rocket: Instructblip (Q-Former), Qwen1.8B, OPT-350M

Data Preparation

Our VCR-DF dataset reuse the images from the official VCR dataset. You can download in VCR.

The annotation of our VCR-DF dataset is in ./vcr_df_dataset folder.

Train

cd train
bash lora_train_feedback.sh

📑 Citation

@inproceedings{peifg,
  author       = {Jiali Chen and
                  Xusen Hei and
                  Yuqi Xue and
                  Yuancheng Wei and
                  Jiayuan Xie and
                  Yi Cai and
                  Qing Li},
  editor       = {Jianfei Cai and
                  Mohan S. Kankanhalli and
                  Balakrishnan Prabhakaran and
                  Susanne Boll and
                  Ramanathan Subramanian and
                  Liang Zheng and
                  Vivek K. Singh and
                  Pablo C{\'{e}}sar and
                  Lexing Xie and
                  Dong Xu},
  title        = {Learning to Correction: Explainable Feedback Generation for Visual
                  Commonsense Reasoning Distractor},
  booktitle    = {Proceedings of the 32nd {ACM} International Conference on Multimedia,
                  {MM} 2024, Melbourne, VIC, Australia, 28 October 2024 - 1 November
                  2024},
  pages        = {8209--8218},
  publisher    = {{ACM}},
  year         = {2024},
}

Related Skills

View on GitHub
GitHub Stars8
CategoryEducation
Updated9mo ago
Forks0

Languages

Python

Security Score

62/100

Audited on Jun 28, 2025

No findings