ALP
ALP: Adaptive Lossless Floating-Point Compression
Install / Use
/learn @cwida/ALPREADME
ALP: Adaptive Lossless Floating-Point Compression
Authors: Azim Afroozeh, Leonardo Kuffó, Peter Boncz
Conference: ACM SIGMOD 2024
<img src="https://github.githubassets.com/images/modules/logos_page/GitHub-Mark.png" alt="GitHub" width="32" style="vertical-align:middle;"> What is this repo?
This repository contains the source code and benchmarks for the paper ALP: Adaptive Lossless Floating-Point Compression, published at ACM SIGMOD 2024.
ALP is a state-of-the-art lossless compression algorithm designed for IEEE 754 floating-point data. It encodes data by exploiting two common patterns found in real-world floating-point values:
-
Decimal Floating-Point Numbers:
A large portion of floats/doubles in real-world datasets are decimals. ALP maps these values into integers by multiplying the number by a power of 10 and then compressing the result using a FastLanes variant of Frame-of-Reference encoding[^1], which is SIMD-friendly.
Example: the number10.12becomes1012and is then fed to the FastLanes encoder. -
High-Precision Floating-Point Numbers:
The remaining values are typically high-precision floats/doubles. ALP targets compression opportunities in only the left part of these values, which it compresses using FastLanes dictionary encoding. The right part is left uncompressed, as it is required to preserve high precision and is often highly random and incompressible.
📊 How does ALP perform?

These results highlight ALP’s superior performance across all three key metrics of a compression algorithm:
Decoding Speed, Compression Ratio, and Compression Speed—outperforming other schemes in every category.
🧪 How to Reproduce Results
Just run the following script:
./publication/script/master_script.sh
For more information on reproducing our benchmarks, refer to our guide here,
or read the official ACM reproducibility report:
https://dl.acm.org/doi/10.1145/3687998.3717057
🏅 ACM Artifacts & Awards
We are happy to share that we participated in the SIGMOD Availability & Reproducibility Initiative, and our paper earned all three badges:
<p align="center"> <img src="assets/artifacts_available_v1_1.png" alt="ACM Artifacts Available" height="100"/> <img src="assets/artifacts_evaluated_reusable_v1_1.png" alt="ACM Artifacts Evaluated" height="100"/> <img src="assets/results_reproduced_v1_1.png" alt="ACM Results Reproduced" height="100"/> </p>🎉 We're also proud to share that ALP won the SIGMOD Best Artifact Award!
<p align="center"> <img src="assets/trophy.png" alt="Trophy" height="100"/> </p>⏱️ Want to Benchmark Your Dataset?
Check out our guide: How to Benchmark Your Dataset
It explains how to run ALP on your own data.
🗂️ Repository Structure
src/: Core implementation of ALP and ALP_RDbenchmarks/: Benchmarking tools and datasetsinclude/: Header files for integrationscripts/: Utility scripts for data processingtest/: Unit testspublication/: Publications and supplementary materials
📚 Publications
-
Conference Paper:
ALP: Adaptive Lossless Floating-Point Compression, ACM SIGMOD 2024
https://dl.acm.org/doi/10.1145/3626717 -
Reproducibility Report:
Reproducibility Report for ACM SIGMOD 2024 Paper: 'ALP: Adaptive Lossless Floating-Point Compression'
https://dl.acm.org/doi/10.1145/3687998.3717057
📄 License
This project is licensed under the MIT License. See the LICENSE file for details.
📬 Contact
If you have questions, want to contribute, or just want to stay up to date with ALP and related projects, join our community on Discord:
🧩 Used By
ALP has been integrated into the following systems:
[^1]: Learn more about FastLanes here: https://github.com/cwida/fastlanes
Related Skills
node-connect
339.5kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
83.9kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
339.5kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
83.9kCommit, push, and open a PR
