Results for "token-compression"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

93 skills found · Page 1 of 4

open-compress / Claw Compactor

2.1k

14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero LLM inference cost. MIT licensed.

universal

ai-agent-toolsai-infrastructureast-code-analysis+17

Updated 28m ago

cokeshao / Awesome Multimodal Token Compression

335

[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198

universal

awesome-listefficient-aiefficient-mllm+4

Updated 9h ago

xuyang-liu16 / Awesome Token Level Model Compression

194

📚 Collection of token-level model compression resources.

universal

computer-visionefficient-deep-learningmodel-acceleration+4

Updated 1d ago

Hannibal046 / XRAG

173

[Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token

universal

Updated 5d ago

yaolinli / MLLM Token Compression

149

Towards Efficient Multimodal Large Language Models: A Survey on Token Compression

universal

Updated 5h ago

S1LV4 / Th0th

130

🏛️ Ancient knowledge keeper for modern code. Semantic search with 98% token reduction for AI assistants. Features: hybrid search, context compression, persistent memory.

universal

Updated 14h ago

HelgeSverre / Toon Php

120

Token-Oriented Object Notation - A compact data format for reducing token consumption when sending structured data to LLMs (PHP implementation)

universal

aidata-formatllm+4

Updated 49m ago

HumanMLLM / LLaVA Scissor

119

The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

universal

connected-componentsmllmmultimodal-large-language-models+3

Updated 27d ago

KD-TAO / DyCoke

105

[CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models

universal

Updated 2d ago

OpenGVLab / DiffRate

103

[ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging techniques, while incorporating a differentiable compression rate.

universal

Updated 2d ago

ppgranger / Token Saver

Content-aware output compression for AI coding assistants. Replaces blind truncation with intelligent strategies per file type: structural summaries for code, schema extraction for configs, error-focused filtering for logs, and smart sampling for CSVs. Saves tokens while preserving what the model actually needs.

universal

Updated 2d ago

HVision-NKU / GlimpsePrune

Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"

universal

inference-efficiencylvlmsmllms+2

Updated 22d ago

opendilab / HH Codec

[ICML 2025 Tokenization Workshop] HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling

universal

Updated 3d ago

Huzaifa785 / Context Compressor

AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning with advanced compression strategies.

claude codeclaude desktop

aianthropicapi-optimization+17

Updated 4d ago

yaolinli / DeCo

Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language models

universal

Updated 1h ago

turingmotors / One D Piece

[ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression

universal

computer-visionimage-processingimage-tokenizer+1

Updated 1mo ago

MaxDevv / Un LOCC

Un-LOCC: Universal Lossy Optical Context Compression for Vision-Based Language Models Achieve nearly 3x token compression at over 93% retrieval accuracy using existing Vision-Language Models.

universal

aicompressiondeepseek-ocr+2

Updated 14d ago

KD-TAO / OmniZip

OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models

universal

Updated 2d ago

asahi417 / Lm Vocab Trimmer

Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting irrelevant tokens from its vocabulary. This repository contains a python-library vocabtrimmer, that remove irrelevant tokens from a multilingual LM vocabulary for the target language.

universal

bertgptlanguage-model+3

Updated 1mo ago

ilang-ai / Autocode

You say it. AutoCode builds it. 38 professional skills, persistent memory, 60%+ dev cost savings. Zero dependencies. Free forever.

claude codeclaude desktop

ai-agentsai-memoryanthropic+10

Updated 18h ago