LoRI

[COLM 2025] LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation

Generate Convert Improve

Install / Use

/learn @juzhengz/LoRI

About this skill

Quality Score

0/100

README

LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation

Juzheng Zhang, Jiacheng You, Ashwinee Panda, Tom Goldstein

📄 Paper | 💻 Code | 🤗 HuggingFace | COLM 2025

LoRI (LoRA with Reduced Interference) is a simple yet effective variant of LoRA for fine-tuning LLMs. It freezes the projection matrices A as random projections and sparsifies B using task-specific masks. LoRI significantly reduces the number of trainable parameters, preserves single-task performance, and minimizes cross-task interference during adapter merging and continual learning.

Installation

Create and activate a Conda environment:

conda create -n lori python=3.10 -y
conda activate lori

Clone the repository and install dependencies:

git clone https://github.com/juzhengz/LoRI.git
cd LoRI
pip install -r requirements.txt

Training from Scratch

LoRI is implemented using Fully Sharded Data Parallel and can be executed in a multi-GPU environment. We provide training scripts covering Natural language understanding (NLU), Code generation, Mathematical reasoning, and Safety alignment. These scripts support LLaMA-3-8B and Mistral-7B base models with adapter ranks of 32 and 64. Each script performs LoRI-D training, extracts sparse masks, continues with LoRI-S training at 90% sparsity, and evaluates on downstream tasks.

Example script

Training code generation adapters LoRI-D and LoRI-S on the CodeAlpaca dataset using LLaMA-3-8B with rank 32:

bash scripts/codealpaca_llama3_r_32.sh

Available scripts

scripts/codealpaca_*.sh — Code generation tasks
scripts/gsm8k_*.sh — Mathematical reasoning tasks
scripts/nlu_*.sh — Natural language understanding tasks
scripts/saferpaca_*.sh — Safety alignment tasks

Inference with Pretrained Adapters

Pretrained LoRI adapters are available via our HuggingFace collection and can be loaded as follows:

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

base_model = AutoModelForCausalLM.from_pretrained("meta-llama/Meta-Llama-3-8B")
adapter = PeftModel.from_pretrained(base_model, "tomg-group-umd/LoRI-S_code_llama3_rank_32")
tokenizer = AutoTokenizer.from_pretrained("meta-llama/Meta-Llama-3-8B")

LoRI-D and LoRI-S adapters are provided for code, math, NLU, and safety tasks, using LLaMA-3-8B and Mistral-7B models at ranks 32 and 64.

Adapter Merging

Use the following scripts for merging adapters:

Update the adapter paths in the scripts to point to either your own trained adapters or those available on HuggingFace.

Continual Learning

Use these scripts to perform continual learning with LoRI:

For LoRI-D:

Before running the scripts, set model_archive to the path of your trained LoRI-D safety adapter, or use the safety adapter from our HuggingFace collection.

model_archive=/path/to/your/lori-d/safety/adapter

For LoRI-S:

Before running the scripts, set model_archive to the path of your LoRI-S safety adapter and set mask_path to the path of the sparse mask for the downstream task. Alternatively, you can use the safety adapter and the corresponding sparse mask from our HuggingFace collection.

model_archive=/path/to/your/lori-s/safety/adapter
mask_path=/path/to/your/lori-d/code/adapter/masks/0.9_mask.pt

Customizing Base Models and Losses

LoRI supports a variety of base models and loss functions, which can be found in the config/model and config/loss directories of the repository. To add a new model or loss function, you can simply create a new .yaml file in the respective directory.

Acknowledgements

This project builds on the codebase of dpo-rlaif and incorporates code from lottery-ticket-adaptation. We evaluate code generation performance on HumanEval using the bigcode-evaluation-harness.

Citation

If you use LoRI in your work, please cite:

@article{zhang2025lori,
  title={LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation},
  author={Zhang, Juzheng and You, Jiacheng and Panda, Ashwinee and Goldstein, Tom},
  journal={arXiv preprint arXiv:2504.07448},
  year={2025}
}

Feel free to reach out if you have any questions!

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

best-practices-researcher

The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app

groundhog

400

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

last30days-skill

19.5k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary