LLM4Rec

(WWW'24 + LinkedIn) The first RS that tightly combines LLM with ID-based RS

Generate Convert Improve

Install / Use

/learn @yaochenzhu/LLM4Rec

About this skill

Quality Score

0/100

README

CLLM4Rec: Collaborative Large Language Model for Recommender Systems

These codes are associated with the following paper [pdf]:

Collaborative Large Language Model for Recommender Systems
Yaochen Zhu, Liang Wu, Qi Guo, Liangjie Hong, Jundong Li,
The ACM Web Conference (WWW) 2024.

which is a joint research from the University of Virginia VAST LAB and LinkedIn.

1. Introduction

The proposed CLLM4Rec is the first recommender system that tightly combines the ID-based paradigm and LLM-based paradigm and leverages the advantages of both worlds.

With the following mutually-regularized pretraining with soft+hard prompting strategy, language modeling can be effectively conducted on recommendation-oriented corpora with heterogenous user/item tokens.

We also proposed a recommendation-oriented finetuning strategty, such that recommendation of multiple items with the whole item space as the candidate set can be effectively generated without hallucination.

2. Structure of Codes

2.1. Horizontal Structure

We implement the following main classes based on the Hugging Face🤗 transformer library.

2.1.1. GPT4Rec Tokenizer Class:

TokenizerWithUserItemIDTokens breaks down the word sequence into tokens, where user/item tokens are introduced. Specifically, if the vocabulary size of the original tokenizer is $N$, for a system with $I$ users and $J$ items, user ID words, i.e., "user_i" and "item_j", are treated as atomic tokens, where the tokenized ID for token "user_i" is $N+i$, whereas the tokenized ID for token "item_j" is $N+I+j$.

Demo:

-----Show the encoding process:-----
Hello, user_1! Have you seen item_2?
['Hello', ',', 'user_1', '!', 'ĠHave', 'Ġyou', 'Ġseen', 'item_2', '?']
[15496, 11, 50258, 0, 8192, 345, 1775, 50269, 30]

2.1.2. GPT4Rec Base Model Class:

GPT4RecommendationBaseModel is the base class for collaborative GPT for recommender systems.

This class extends the vocabulary of the original GPT2 with the user/item ID tokens. In our implementation, we randomly initialize the user/item ID embeddings. In the training time, we freeze the token embeddings for the original vocabulary and the transformer weights and only user/item ID embeddings can be updated.

Demo:

input_ids:
tensor([[0, 1, 2],
        [3, 4, 5],
        [6, 7, 8]])

-----Calculated Masks-----
vocab_mask:
tensor([[1, 1, 1],
        [0, 0, 0],
        [0, 0, 0]])

user_mask:
tensor([[0, 0, 0],
        [1, 1, 1],
        [0, 0, 0]])

item_mask:
tensor([[0, 0, 0],
        [0, 0, 0],
        [1, 1, 1]])

-----Embed Vocabulary Tokens-----
vocab_ids:
tensor([[0, 1, 2],
        [0, 0, 0],
        [0, 0, 0]])

vocab_embeddings:
tensor([[[ 1.4444,  0.0186],
         [-0.3905,  1.5463],
         [-0.2093, -1.3653]],

        [[ 0.0000,  0.0000],
         [ 0.0000,  0.0000],
         [ 0.0000,  0.0000]],

        [[ 0.0000,  0.0000],
         [ 0.0000,  0.0000],
         [ 0.0000,  0.0000]]], grad_fn=<MulBackward0>)

-----Embed User Tokens-----
user_ids:
tensor([[0, 0, 0],
        [0, 1, 2],
        [0, 0, 0]])

user_embeds:
tensor([[[-0.0000,  0.0000],
         [-0.0000,  0.0000],
         [-0.0000,  0.0000]],

        [[-0.1392,  1.1265],
         [-0.7857,  1.4319],
         [ 0.4087, -0.0928]],

        [[-0.0000,  0.0000],
         [-0.0000,  0.0000],
         [-0.0000,  0.0000]]], grad_fn=<MulBackward0>)

-----Embed Item Tokens-----
item_ids:
tensor([[0, 0, 0],
        [0, 0, 0],
        [0, 1, 2]])

item_embeds:
tensor([[[-0.0000,  0.0000],
         [-0.0000,  0.0000],
         [-0.0000,  0.0000]],

        [[-0.0000,  0.0000],
         [-0.0000,  0.0000],
         [-0.0000,  0.0000]],

        [[-0.3141,  0.6641],
         [-1.4622, -0.5424],
         [ 0.6969, -0.6390]]], grad_fn=<MulBackward0>)

-----The Whole Embeddings-----
input_embeddings:
tensor([[[ 1.4444,  0.0186],
         [-0.3905,  1.5463],
         [-0.2093, -1.3653]],

        [[-0.1392,  1.1265],
         [-0.7857,  1.4319],
         [ 0.4087, -0.0928]],

        [[-0.3141,  0.6641],
         [-1.4622, -0.5424],
         [ 0.6969, -0.6390]]], grad_fn=<AddBackward0>)

2.1.3. Collaborative GPT Class:

CollaborativeGPTwithItemLMHeadBatch defines the collaborative GPT, which gives prompts in the form "user_i has interacted with", to do language modeling (i.e., next token prediction) for the interacted item sequences, i.e., "item_j item_k item_z".

In this case, when doing next token prediction, we only need to calculate softmax over the item space.

Demo:

Prompt ids: tensor([[50257,   468, 49236,   351],
        [50258,   468, 49236,   351],
        [50259,   468, 49236,   351],
        [50260,   468, 49236,   351],
        [50261,   468, 49236,   351],
        [50262,   468, 49236,   351],
        [50263,   468, 49236,   351],
        [50264,   468, 49236,   351],
        [50265,   468, 49236,   351],
        [50266,   468, 49236,   351],
        [50267,   468, 49236,   351],
        [50268,   468, 49236,   351],
        [50269,   468, 49236,   351],
        [50270,   468, 49236,   351],
        [50271,   468, 49236,   351],
        [50272,   468, 49236,   351]])
Main ids: tensor([[51602, 51603, 51604, 51605, 51607, 51608, 51609, 51610, 51613, 51614,
         51615, 51616, 51617, 51618, 51619, 51621, 51622, 51624, 51625, 51626,
         51628, 51630, 51632, 51633, 51634, 51635, 51636, 51637,     0,     0,
             0,     0],
        [51638, 51640, 51641, 51642, 51643, 51645,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51647, 51648, 51649, 51650, 51652, 51653, 51654, 51655,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51605, 51623, 51656, 51657, 51659, 51660, 51662, 51663,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51664, 51665, 51666, 51667, 51668, 51670, 51672,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51673, 51674, 51676, 51677, 51678, 51679, 51680, 51681, 51682, 51683,
         51684, 51685, 51686, 51687, 51691, 51695, 51696, 51698, 51699, 51700,
         51701, 51702, 51703, 51704, 51705, 51706, 51707, 51708, 51709, 51710,
         51711, 51712],
        [51713, 51714, 51716, 51717, 51718, 51719, 51720, 51721, 51722, 51723,
         51724,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51604, 51611, 51612, 51616, 51666, 51727, 51728, 51729, 51731, 51732,
         51733, 51734, 51735, 51737, 51738, 51740,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51741, 51742, 51743, 51744, 51747, 51748, 51749,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51619, 51625, 51732, 51750, 51751, 51752, 51753, 51754,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51621, 51640, 51645, 51672, 51741, 51756, 51758, 51759, 51760, 51761,
         51763, 51765,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51618, 51763, 51767, 51768, 51769, 51770,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51625, 51769, 51771, 51772, 51773, 51775, 51776, 51777, 51778, 51780,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51673, 51674, 51675, 51676, 51677, 51679, 51681, 51694, 51699, 51701,
         51781, 51782, 51783, 51785, 51786,     0,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51660, 51737, 51758, 51787, 51788, 51789, 51790, 51792, 51793, 51794,
         51795, 51796, 51798, 51799, 51800, 51801,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0],
        [51661, 51760, 51793, 51804, 51805, 51806,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0,     0,     0,     0,     0,     0,     0,     0,     0,
             0,     0]])
Calculated loss: 14.4347

2.1.4. Content GPT Class:

ContentGPTForUserItemWithLMHeadBatch defines the content GPT that conducts language modeling on user/item content.

Take Amazon review data as an example, it treats "user_i writes the following review for item_j" as the prompt, while conducting language modeling (i.e., next token prediction) on the main review texts.

In this case, when predicting next token

Related Skills

node-connect

347.0k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

107.8k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

347.0k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

347.0k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。