Sqlbpe
The implementation for the paper `Byte-Pair Encoding for Text-to-SQL Generation`.
Install / Use
/learn @SamuelGabriel/SqlbpeREADME
The Code for Byte-Pair Encoding for Text-to-SQL Generation
This is the code we used to conduct the experiments for this paper.
It allows you to group tokens in SQL corpora, which are commonly co-located, for easier prediction. The tools even allow to restrict the grouping to neighbors in the AST. Below you can see an illustration of an encoded example sentence.
If you want to use BPE encodings for your own projects on SQL data, please check the little doc on it.
If you are using torchtext you might also find some functions in utils.py helpful.
Related Skills
feishu-drive
338.7k|
things-mac
338.7kManage Things 3 via the `things` CLI on macOS (add/update projects+todos via URL scheme; read/search/list from the local Things database)
clawhub
338.7kUse the ClawHub CLI to search, install, update, and publish agent skills from clawhub.com
yu-ai-agent
1.9k编程导航 2025 年 AI 开发实战新项目,基于 Spring Boot 3 + Java 21 + Spring AI 构建 AI 恋爱大师应用和 ReAct 模式自主规划智能体YuManus,覆盖 AI 大模型接入、Spring AI 核心特性、Prompt 工程和优化、RAG 检索增强、向量数据库、Tool Calling 工具调用、MCP 模型上下文协议、AI Agent 开发(Manas Java 实现)、Cursor AI 工具等核心知识。用一套教程将程序员必知必会的 AI 技术一网打尽,帮你成为 AI 时代企业的香饽饽,给你的简历和求职大幅增加竞争力。
