HierGR
Code for "HierGR: Hierarchical Semantic Representation Enhancement for Generative Retrieval in Food Delivery Search". ACL 2025.
Install / Use
/learn @zhangfw123/HierGRREADME
Code for HierGR
Offline Training on MSMARCO
Dataset
MSMARCO dataset is obtained from https://github.com/liyongqi67/MINDER.
RQ-VAE
code for offline hierarchical RQ-VAE training.
bash train_tokenizer.sh
bash tokenize.sh
GR_train
bash train.sh
bash test.sh
Large-scale Deployment
online_deployment_version/RQ-VAE
This is a version that utilize pytorch DDP to train hierarchical RQ-VAE. The embeddings of all items can be divide into several npy files, named "semantic_emb_*.npy".
Online GR Model Training
We directly use LLama-Factory to train our GR model based on Qwen2.5-1.5B-Instruct. LLama-Factory can be accessed from https://github.com/hiyouga/LLaMA-Factory.
Related Skills
node-connect
339.3kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
83.9kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
339.3kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
83.9kCommit, push, and open a PR
