GlitchMiner
[AAAI 2026] Code of the paper "GlitchMiner: Mining Glitch Tokens in Large Language Models via Gradient-based Discrete Optimization"
Install / Use
/learn @wooozihui/GlitchMinerREADME
GlitchMiner: Mining Glitch Tokens in Large Language Models via Gradient-based Discrete Optimization
Update
- 2025/11/8 🎉🎉🎉 Our paper has been accepted by AAAI 2026!
- 2024/11/7 We add reproduction code of the two baseline method in this paper.
Read our paper for detailed insights.
🔍 Introduction
GlitchMiner is a robust framework designed to detect glitch tokens—tokens that cause unexpected behaviors in large language models (LLMs). These anomalies can severely impact model outputs, particularly in sensitive applications such as healthcare or finance. It uses gradient-based discrete optimization to identify glitch tokens effectively. <img width="1018" height="525" alt="image" src="https://github.com/user-attachments/assets/35dde1fa-c932-45b6-8b65-ce0fbdc092cc" />
-
Left: The pipeline.
-
Right: Visualization of GlitchMiner's local search strategy.
🛠️ Getting Started
Install GlitchMiner with pip:
pip install git+https://github.com/wooozihui/GlitchMiner.git
Usage Example
from transformers import AutoTokenizer, AutoModelForCausalLM
from glitchminer import GlitchMiner
import torch
if __name__ == "__main__":
model_path = "Qwen/Qwen2.5-7B-Instruct"
tokenizer = AutoTokenizer.from_pretrained(model_path)
model = AutoModelForCausalLM.from_pretrained(
model_path,
device_map="cuda",
torch_dtype=torch.bfloat16,
)
# Run GlitchMiner for glitch token detection
glitch_tokens, glitch_token_ids = GlitchMiner(
model,
tokenizer,
num_iterations=125,
batch_size=8,
k=32,
if_print=True,
print_language="CN",
)
Strictly Glitch Token Verification
To eliminate false positives, we recommend using the strictly_glitch_verification function for cross-validation.
from glitchminer import strictly_glitch_verification
glitch_count, verified_glitch_ids = strictly_glitch_verification(model, tokenizer, glitch_token_ids)
print(glitch_count)
⚙️ GlitchMiner Parameters
Here are the configurable parameters for GlitchMiner, with explanations of their purpose and usage:
| Parameter | Type | Default Value | Description | |-------------------|----------|---------------|---------------------------------------------------------------------------------------------------| | model | Model | Required | A Hugging Face AutoModelForCausalLM model used for glitch token detection. | | tokenizer | Tokenizer| Required | A Hugging Face AutoTokenizer for encoding and decoding tokens. | | num_iterations | int | 125 | The number of iterations to run the glitch token search. | | batch_size | int | 8 | Number of tokens processed per batch during the search process. | | k | int | 32 | Number of top similar tokens to evaluate during each iteration using cosine similarity. | | if_print | bool | True | If True, prints detailed progress and results during execution. | | print_language | str | "CN" | Output language for printed messages. Supports "CN" for Chinese and "ENG" for English. | | skip_tokens | list | None | Optional list of token IDs to exclude from the glitch detection process. |
🌟 Citing
If you find GlitchMiner helpful in your research, please consider give us a star or cite:
@misc{wu2024glitchminerminingglitchtokens,
title={GlitchMiner: Mining Glitch Tokens in Large Language Models via Gradient-based Discrete Optimization},
author={Zihui Wu and Haichang Gao and Ping Wang and Shudong Zhang and Zhaoxiang Liu and Shiguo Lian},
year={2024},
eprint={2410.15052},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2410.15052},
}
Related Skills
node-connect
352.2kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
111.1kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
352.2kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
352.2kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
