Marconi
Artifact for "Marconi: Prefix Caching for the Era of Hybrid LLMs" [MLSys '25 Outstanding Paper Award, Honorable Mention]
Install / Use
/learn @ruipeterpan/MarconiREADME
Marconi: Prefix Caching for the Era of Hybrid LLMs
This repository contains the source code implementation of the MLSys '25 paper Marconi: Prefix Caching for the Era of Hybrid LLMs.
Getting Started
Marconi is implemented in Python. We have tested Marconi on Ubuntu 22.04 with Python 3.11.9.
Detailed instructions on how to reproduce the main results from our MLSys paper are in artifact_evaluation.md.
References
@article{pan2024marconi,
title={Marconi: Prefix Caching for the Era of Hybrid LLMs},
author={Pan, Rui and Wang, Zhuang and Jia, Zhen and Karakus, Can and Zancato, Luca and Dao, Tri and Netravali, Ravi and Wang, Yida},
journal={arXiv preprint arXiv:2411.19379},
year={2024}
}
Related Skills
node-connect
341.8kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
84.6kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
341.8kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
84.6kCommit, push, and open a PR
