SyntheticDebiasing

[COLM2024] ChatGPT Based Data Augmentation For Improved Parameter-Efficient Debiasing of LLMs

Generate Convert Improve

Install / Use

/learn @Pengrui-Han/SyntheticDebiasing

About this skill

Quality Score

0/100

README

ChatGPT Based Data Augmentation For Improved Parameter-Efficient Debiasing of LLMs

This official repository holds code for the paper "ChatGPT Based Data Augmentation For Improved Parameter-Efficient Debiasing of LLMs". Our Paper is accepted at COLM 2024. We open source all code and results here under a permissive MIT license, to encourage reproduction and further research exploration.

🗞️ Paper

We propose a light and efficient pipeline that enables both domain and non-domain experts to quickly generate synthetic debiasing data to mitigate specific or general bias in their models with parameter-efficient fine-tuning. <img width="887" alt="WordCloud" src="https://github.com/user-attachments/assets/6fb2c57d-23e3-4444-8d5a-d6a159deee5b">

Prompting Examples:

API Based Generation: 'Example_data_generation.ipynb'

UI Based Generation:

General Prompting:
- Sentence Generation
Targeted Prompting:
- Term Generation
- Sentence Generation

Data Format:

Example data of Targeted and General Prompting, including sentence, subject word, and attribute word for each example(The '[S,T,A]' format).

<table> <thead> <tr> <th>De-biasing Strategy</th> <th>Bias Type</th> <th>Sentence</th> <th>Subject Word</th> <th>Attribute Word</th> </tr> </thead> <tbody> <tr> <td rowspan="3">Targeted Prompting</td> <td>Gender</td> <td>"Love the Godfather not for his power, but for his genuine kindness."</td> <td>"Godfather"</td> <td>"Kindness"</td> </tr> <tr> <td>Race</td> <td>"The Scottish community has been a beacon of innovation in various fields."</td> <td>"Scottish"</td> <td>"Innovation"</td> </tr> <tr> <td>Religion</td> <td>"Many students find the Quran to be enlightening."</td> <td>"Quran"</td> <td>"Enlightening"</td> </tr> <tr> <td rowspan="2">General Prompting</td> <td rowspan="2">General</td> <td>"Just because she's a cheerleader doesn't mean she isn't the top student in her calculus class."</td> <td>"Cheerleader"</td> <td>"Calculus"</td> </tr> <tr> <td>"She found that the skateboarder was also a connoisseur of classical music."</td> <td>"Skateboarder"</td> <td>"Classical"</td> </tr> </tbody> </table> <hr>

📁 Main Files

Synthetic Data

General Debiasing Data: Data/General_Debiasing
Targeted Debiasing Data:
- Gender: Data/Targeted_Debiasing/Gender
- Racial: Data/Targeted_Debiasing/Racial
- Religion: Data/Targeted_Debiasing/Religon

Colab Notebooks

Example_data_generation.ipynb:
- This is an example of the code and prompt used for synthetic data generation.
Adapter-BERT.ipynb:
- This is the interactive Colab notebook that trains and evaluates the BERT model.
- An example of getting high and low loss data as examples for guiding in-distribution generation (loss-guided prompting) is also included.
- Notebooks/Adapter_BERT.ipynb
Adapter-GPT2.ipynb:
- This is the interactive Colab notebook that trains and evaluates the BERT model.
- Notebooks/Adapter_GPT2.ipynb
- Note: you can easily replace GPT2 with other auto-regressive models.

Evaluation

Code: Contains default score evaluators that are imported by the notebooks for evaluation.
Eval_data: Contains data from StereoSet, CrowSPairs, and BiasTestGPT for evaluation.

<hr>

📧 Get In Touch

To report a potential problem, please open an issue. In the issue, please include the exact steps to reproduce the error, and complete logs. Our team is willing to help.

<hr>

📝 Citation

If you find our work useful, please kindly cite our paper.

@article{han2024chatgpt,
  title={ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs},
  author={Han, Pengrui and Kocielnik, Rafal and Saravanan, Adhithya and Jiang, Roy and Sharir, Or and Anandkumar, Anima},
  journal={arXiv preprint arXiv:2402.11764},
  year={2024}
}

The current repository is not fully updated at the moment. Our team is actively engaged in the process of updating it to include all the latest code. We aim to provide a comprehensive and up-to-date resource as soon as possible.

Related Skills

node-connect

343.3k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

92.1k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

343.3k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

343.3k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。