LSG

[NeurIPS 2023] Official Implementation of Language Semantic Graph Guided Data-Efficient Learning

Generate Convert Improve

Install / Use

/learn @BIT-DA/LSG

About this skill

Quality Score

0/100

README

Language Semantic Graph Guided Data-Efficient Learning

Wenxuan Ma, Shuang Li*, Lincan Cai, Jingxuan Kang

</div>

This repository provides the official code for the NeurIPS 2023 paper Language Semantic Graph Guided Data-Efficient Learning (LSG).

Abstract

Developing generalizable models that can effectively learn from limited data and with minimal reliance on human supervision is a significant objective within the machine learning community, particularly in the era of deep neural networks. Therefore, to achieve data-efficient learning, researchers typically explore approaches that can leverage more related or unlabeled data without necessitating additional manual labeling efforts, such as Semi-Supervised Learning (SSL), Transfer Learning (TL), and Data Augmentation.

However, the significance of additional knowledge contained within labels has been largely overlooked in research. In this paper, we propose a novel perspective on data efficiency that involves exploiting the semantic information contained in the labels of the available data. Specifically, we introduce a Language Semantic Graph (LSG) which is constructed from labels manifest as natural language descriptions. Upon this graph, an auxiliary graph neural network is trained to extract high-level semantic relations and then used to guide the training of the primary model, enabling more adequate utilization of label knowledge. Across image, video, and audio modalities, we utilize the LSG method in both TL and SSL scenarios and illustrate its versatility in significantly enhancing performance compared to other data-efficient learning approaches.

Paradigm Introduction

LSG is a simple data-efficient learning paradigm that includes three steps:

Obtain text embedding of labels via pre-trained language model (PLM).
(The first training stage, lightweighted.) Consturct Language Semantic Graph (LSG) and correspondingly train a GCN to model the semantic relationship between embedded labels.
(The primary training stage.) Train the model with main task loss (i.e., CrossEntropy), and the two proposed alignment loss that leverage LSG and the pretrained GCN to learn label semantic information.

Dataset

This repository is build upon Self-Tuning, and the three dataset involved would be automatically downloaded once the code in run. Thanks for their amazing works.

If the datasets are not automatically downloaded, they can be manually downloaded as follows.

<details> <summary>Aircraft</summary> Create directory `./lsg_training/Aircraft`. Download image_list.zip from [here](https://cloud.tsinghua.edu.cn/f/449157d27987463cbdb1/?dl=1), train.tgz from [here](https://cloud.tsinghua.edu.cn/f/06804f17fdb947aa9401/?dl=1) and test.tgz from [here](https://cloud.tsinghua.edu.cn/f/164996d09cc749abbdeb/?dl=1). Extract them, and the directory will look like:

./lsg_training/Aircraft
├── train
├── test
├── image_list
│   ├── train100.txt
│   ├── train50.txt
│   ├── train30.txt
│   ├── train15.txt
│   ├── test.txt

</details>

Training

Step 1: Obtain label embeddings via Pretrained Language Models.

Codes involving obtaining label embeddings are in "./label_embedding/". Run the following command to obtain text embeddings.

You need to modify the following things in the code:

classnames: List
save_name: str

cd label_embedding

# Bert-Large
python text_embedding.py

# GPT-2
python text_embedding_gpt.py

# CLIP ViT-Large
python text_embedding_clip.py

Step 2: Construct the LSG and train a GCN model for each task.

Codes involving training the GCN and the main model are in "./lsg_training/". Run the following command to train GCNs:

cd lsg_training
sh train_stage1.sh

Step 3: Use the LSG and the trained GCN to train the main model.

Run the following command to train the main model:

sh train_stage2.sh

Citation

If you find this project useful, please consider citing:

@article{ma2023language,
  title={Language Semantic Graph Guided Data-Efficient Learning},
  author={Ma, Wenxuan and Li, Shuang and Cai, Lincan and Kang Jingxuan},
  journal={Advances in Neural Information Processing Systems},
  year={2023}
}

Contact

If you have any questions about our code, feel free to contact us or describe your problem in Issues.

Email address: wenxuanma@bit.edu.cn, shuangli@bit.edu.cn

Related Skills

proje

Interactive vocabulary learning platform with smart flashcards and spaced repetition for effective language acquisition.

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

best-practices-researcher

The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app

groundhog

400

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

BIT-DA

View profile

View on GitHub

GitHub Stars7

CategoryEducation

Updated7mo ago

Forks0

BIT-DA/LSG

Languages

Python

Security Score

77/100

Audited on Sep 3, 2025

No findings