Results for "semantic-caching"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

77 skills found · Page 1 of 3

zilliztech / GPTCache

8.0k

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

universal

aigcautogptbabyagi+16

Updated 6h ago

olimorris / Onedarkpro.nvim

1.0k

🎨 Atom's iconic One Dark theme. Cacheable, fully customisable, Tree-sitter and LSP semantic token support. Comes with variants

universal

colorschemedark-themelight-theme+8

Updated 1d ago

codefuse-ai / ModelCache

961

A LLM semantic caching system aiming to enhance user experience by reducing response time via cached query-result pairs.

universal

llmsemantic-cache

Updated 2d ago

redis-developer / ArXivChatGuru

558

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

universal

aiarxivlangchain+11

Updated 12h ago

edwinkys / Oasysdb

378

In-memory vector store with efficient read and write performance for semantic caching and retrieval system. Redis for Semantic Caching.

universal

approximate-nearest-neighborsivfpqmysql+8

Updated 22d ago

thu-nics / C2C

373

[ICLR'26] The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"

universal

kv-cachellmmulti-agent

Updated 1d ago

messkan / Prompt Cache

216

Cut LLM costs by up to 80% and unlock sub-millisecond responses with intelligent semantic caching.A drop-in, provider-agnostic LLM proxy written in Go with sub-millisecond response

claude codeclaude desktop

aibadgerdbcache+13

Updated 1d ago

aqstack / Mimir

141

mimir is a drop-in proxy that caches LLM API responses using semantic similarity, reducing costs and latency for repeated or similar queries.

universal

cachingcost-optimizationgolang+5

Updated 20h ago

peva3 / SmarterRouter

103

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

universal

ai-cacheai-gatewaydocker+13

Updated 19h ago

colossus-lab / Openarg Backend

AI-powered analysis engine for Argentine government open data. Multi-agent pipeline (LangGraph) with 10 data connectors, NL2SQL, semantic caching, and real-time streaming. Built with FastAPI, PostgreSQL + pgvector, Celery, and Gemini/Claude LLMs.

claude codeclaude desktop+1

Updated 1d ago

sensoris / Semcache

Semantic caching layer for your LLM applications. Reuse responses and reduce token usage.

claude codeclaude desktop+1

anthropicgeminigenai+3

Updated 26d ago

vcache-project / VCache

Reliable and Efficient Semantic Prompt Caching with vCache

universal

cachechatbotconsistency+17

Updated 4d ago

marmeladema / Clru Rs

An LRU cache implementation with constant time operations and weighted semantic.

universal

cachehashmaplru+1

Updated 1mo ago

shivendrasoni / Vector Cache

A simple semantic cache implementaion. It caches responses from an LLM based on semantic similarity.

universal

Updated 26d ago

winter2020 / Kleespectre

KLEESpectre is a symbolic execution engine with speculation semantic and cache modelling

universal

meltdownspectrespeculative-execution+1

Updated 4mo ago

albertobadia / Zoocache

Semantic dependency based cache with high performance and concurrency in mind.

universal

Updated 28d ago

AzozzALFiras / Claude Context Optimizer

MCP server that cuts Claude Code token usage by up to 98% — smart file caching, semantic read, log compression, task checkpoints, and context watchdog. Zero native dependencies.

claude codeclaude desktop+1

claude-codecluade-mcp

Updated 1d ago

databricks-industry-solutions / Semantic Caching

This project implements a caching system for Databricks, designed to improve response times and reduce cost for frequently asked questions or similar queries.

universal

xindustry

Updated 4d ago

sjtu-zhao-lab / ClusterKV

ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression (DAC'25)

universal

Updated 22d ago

Canonical-AI-Inc / Canonical

Context-Aware Semantic Cache for Conversational AI

universal

Updated 6mo ago