SyntheticDebiasing
[COLM2024] ChatGPT Based Data Augmentation For Improved Parameter-Efficient Debiasing of LLMs
Install / Use
/learn @Pengrui-Han/SyntheticDebiasingREADME
ChatGPT Based Data Augmentation For Improved Parameter-Efficient Debiasing of LLMs
This official repository holds code for the paper "ChatGPT Based Data Augmentation For Improved Parameter-Efficient Debiasing of LLMs". Our Paper is accepted at COLM 2024. We open source all code and results here under a permissive MIT license, to encourage reproduction and further research exploration.
<img width="1366" alt="Framework" src="https://github.com/user-attachments/assets/dd16c454-5ace-40c0-8075-ab3abf33e684"> <hr>🗞️ Paper
We propose a light and efficient pipeline that enables both domain and non-domain experts to quickly generate synthetic debiasing data to mitigate specific or general bias in their models with parameter-efficient fine-tuning. <img width="887" alt="WordCloud" src="https://github.com/user-attachments/assets/6fb2c57d-23e3-4444-8d5a-d6a159deee5b">
Prompting Examples:
API Based Generation: 'Example_data_generation.ipynb'
UI Based Generation:
- General Prompting:
- Targeted Prompting:
Data Format:
Example data of Targeted and General Prompting, including sentence, subject word, and attribute word for each example(The '[S,T,A]' format).
<table> <thead> <tr> <th>De-biasing Strategy</th> <th>Bias Type</th> <th>Sentence</th> <th>Subject Word</th> <th>Attribute Word</th> </tr> </thead> <tbody> <tr> <td rowspan="3">Targeted Prompting</td> <td>Gender</td> <td>"Love the Godfather not for his power, but for his genuine kindness."</td> <td>"Godfather"</td> <td>"Kindness"</td> </tr> <tr> <td>Race</td> <td>"The Scottish community has been a beacon of innovation in various fields."</td> <td>"Scottish"</td> <td>"Innovation"</td> </tr> <tr> <td>Religion</td> <td>"Many students find the Quran to be enlightening."</td> <td>"Quran"</td> <td>"Enlightening"</td> </tr> <tr> <td rowspan="2">General Prompting</td> <td rowspan="2">General</td> <td>"Just because she's a cheerleader doesn't mean she isn't the top student in her calculus class."</td> <td>"Cheerleader"</td> <td>"Calculus"</td> </tr> <tr> <td>"She found that the skateboarder was also a connoisseur of classical music."</td> <td>"Skateboarder"</td> <td>"Classical"</td> </tr> </tbody> </table> <hr>📁 Main Files
Synthetic Data
- General Debiasing Data:
Data/General_Debiasing - Targeted Debiasing Data:
- Gender:
Data/Targeted_Debiasing/Gender - Racial:
Data/Targeted_Debiasing/Racial - Religion:
Data/Targeted_Debiasing/Religon
- Gender:
Colab Notebooks
- Example_data_generation.ipynb:
- This is an example of the code and prompt used for synthetic data generation.
- Adapter-BERT.ipynb:
- This is the interactive Colab notebook that trains and evaluates the BERT model.
- An example of getting high and low loss data as examples for guiding in-distribution generation (loss-guided prompting) is also included.
Notebooks/Adapter_BERT.ipynb
- Adapter-GPT2.ipynb:
- This is the interactive Colab notebook that trains and evaluates the BERT model.
Notebooks/Adapter_GPT2.ipynb- Note: you can easily replace GPT2 with other auto-regressive models.
Evaluation
- Code: Contains default score evaluators that are imported by the notebooks for evaluation.
- Eval_data: Contains data from StereoSet, CrowSPairs, and BiasTestGPT for evaluation.
📧 Get In Touch
- To report a potential problem, please open an issue. In the issue, please include the exact steps to reproduce the error, and complete logs. Our team is willing to help.
📝 Citation
If you find our work useful, please kindly cite our paper.
@article{han2024chatgpt,
title={ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs},
author={Han, Pengrui and Kocielnik, Rafal and Saravanan, Adhithya and Jiang, Roy and Sharir, Or and Anandkumar, Anima},
journal={arXiv preprint arXiv:2402.11764},
year={2024}
}
The current repository is not fully updated at the moment. Our team is actively engaged in the process of updating it to include all the latest code. We aim to provide a comprehensive and up-to-date resource as soon as possible.
Related Skills
node-connect
343.3kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
92.1kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
343.3kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
343.3kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
