TreeLoRA

A pytorch implementation of the paper "TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Similarity Tree".

Generate Convert Improve

Install / Use

/learn @ZinYY/TreeLoRA

About this skill

Quality Score

0/100

README

Pytorch Implementation for "TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Similarity Tree"

This repository contains the official PyTorch implementation of TreeLoRA, an efficient continual learning method for Large Language Models (LLMs) that uses layer-wise LoRA adapters guided by a hierarchical gradient-similarity tree.

Code Structure

.
├── data/                    # Data directory for LLM-CL-Benchmark
├── model/                   # Model implementations
│   ├── Regular/            # Regular model implementations
│   │   └── Tree_LoRA.py   # TreeLoRA implementation
│   ├── Dynamic_network/    # Dynamic network implementations
│   └── Replay/            # Replay-based methods
├── training/               # Training related code
│   ├── main.py            # Main training script
│   └── params.py          # Training parameters
├── utils/                  # Utility functions
│   ├── data/              # Data processing utilities
│   ├── flash_attention/   # Flash attention implementation
│   ├── my_peft/          # Custom PEFT implementations
│   └── kd_lora_tree.py   # KD-tree implementation for TreeLoRA
├── inference/             # Inference related code
└── scripts/               # Training and evaluation scripts

Requirements

The main dependencies are listed below. For a complete list, see requirements.txt:

accelerate==1.0.1
bitsandbytes==0.46.1
deepspeed==0.15.3+cu124torch2.4
torch==2.4.1
torchvision==0.19.1

Quick Start

1. Installation

# Install dependencies
pip install -r requirements.txt

2. Data and Model Preparation

1. Extract the dataset in the data/LLM-CL-Benchmark directory. Our benchmark includes 24 different tasks, a mixing of TRACE-LLM and the datasets used in O-LoRA. Specifically, the tasks are:
  
  | C-STANCE | NumGLUE-cm | QQP | | :-------------- | :-----------: | :-----------: | | NumGLUE-ds | MultiRC | RTE | | yelp | ScienceQA | amazon | | MeetingBank | FOMC | Lima | | BoolQA | CB | Py150 | | dbpedia | WiC | yahoo | | IMDB | MNLI | 20Minuten | | agnews | COPA | SST-2 |
1. Download the pre-trained model from HuggingFace and place it in the ./PTM/ directory. e.g., for Llama-3.2-1B-Instruct:
```
cd ./PTM
git clone https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct
```

3. Training and Evaluating

To train and evaluate a method on the TRACE dataset, just run:

export model_name="Llama-3.2-1B-Instruct"

# Run training script with default parameters (e.g., TreeLoRA)
bash scripts/lora_based_methods/Tree_LoRA.sh

Key parameters in the training script:

--model_name_or_path: Path to the pretrained model
--data_path: Path to the training dataset
--dataset_name: Names of the datasets to train on
--reg: Regularization parameter (default: 0.5)
--num_train_epochs: Number of training epochs per task

Or simply, run ./scripts/run_all_exps.sh to run all the experiments.

Features

Efficient continual learning through layer-wise LoRA adapters
Hierarchical gradient-similarity tree for adapter organization
Support for multiple LLM architectures (Gemma, LLaMA, Mistral, etc.)
DeepSpeed integration for efficient training
Flash attention implementation for improved performance

Citation

If you find this code useful, please cite our paper:

@inproceedings{ICML'25:TreeLoRA,
    author = {Yu-Yang Qian and Yuan-Ze Xu and Zhen-Yu Zhang and Peng Zhao and Zhi-Hua Zhou},
    title = {{T}ree{L}o{RA}: Efficient Continual Learning via Layer-Wise {L}o{RA}s Guided by a Hierarchical Gradient-Similarity Tree},
    booktitle = {Proceedings of the 42nd International Conference on Machine Learning (ICML)},
    year = {2025},
    pages = {50066--50085}
}

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

best-practices-researcher

The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

isf-agent

a repo for an agent that helps researchers apply for isf funding