ConfPO

[ICML'25] Official code for "ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization"

Generate Convert Improve

Install / Use

/learn @hee-suk-yoon/ConfPO

About this skill

Quality Score

0/100

README

ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization (ICML 2025)

This repository provides the official implementation of our ICML 2025 paper:

ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization
Authors: Hee Suk Yoon, Eunseop Yoon, Mark Hasegawa-Johnson, Sungwoong Kim, Chang D. Yoo

[Paper Link]

Installation

# Clone this repo
git clone https://github.com/hee-suk-yoon/ConfPO
cd ConfPo

# Create a conda enviroment
1. conda env create --name confpo python=3.11
2. conda activate confpo
3. pip install torch==2.4.0 torchvision==0.19.0 torchaudio==2.4.0 --index-url https://download.pytorch.org/whl/cu121
4. conda env update --file environment.yml --prune

Running Experiments

SimPO (Baseline)

cd trl-main
bash commands/run_simpo.sh

ConfPO (Ours)

cd trl-main
bash commands/run_confpo.sh

Acknowledgement

This work was supported by Artificial intelligence industrial convergence cluster development project funded by the Ministry of Science and ICT(MSIT, Korea)&Gwangju Metropolitan City, Institute for Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government(MSIT) (No.RS-2021-II211381, Development of Causal AI through Video Understanding and Reinforcement Learning, and Its Applications to Real Environments), and Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government(MSIT) (No.RS-2022-II220184, Development and Study of AI Technologies to Inexpensively Conform to Evolving Policy on Ethics).

Also, we thank the authors of the SimPO for their open-source contributions.

Citation

If you find our work useful in your research, please cite:

@inproceedings{
yoon2025confpo,
title={Conf{PO}: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization},
author={Hee Suk Yoon and Eunseop Yoon and Mark A. Hasegawa-Johnson and Sungwoong Kim and Chang D. Yoo},
booktitle={Forty-second International Conference on Machine Learning},
year={2025},
url={https://openreview.net/forum?id=ZG7bkp6ScT}
}

Contact

If you have any questions, please feel free to email hskyoon@kaist.ac.kr

Related Skills

node-connect

349.2k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

109.5k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

349.2k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

349.2k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。