Results for "inference-gateway"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

30 skills found

diegosouzapw / OmniRoute

1.4k

OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for reliable, cost-aware inference.

universal

Updated 14m ago

kubernetes-sigs / Gateway Api Inference Extension

632

Gateway API Inference Extension

universal

k8s-sig-network

Updated 5h ago

lightseekorg / Smg

131

Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across SGLang, vLLM, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat history, tokenization caching, Responses API, embeddings, WASM plugins, MCP, and multi-tenant auth.

claude codeclaude desktop+2

anthropicanthropic-apichat+13

Updated 5h ago

Nayjest / Lm Proxy

109

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.

claude codeclaude desktop

aianthropicapi-proxy+13

Updated 2d ago

inference-gateway / Inference Gateway

106

An open-source, cloud-native, high-performance gateway unifying multiple LLM providers, from local solutions like Ollama to major cloud providers such as OpenAI, Groq, Cohere, Anthropic, Cloudflare and DeepSeek.

claude codeclaude desktop

agnosticanthropicapi+17

Updated 5d ago

NightmareAI / Cogflare

A flexible gateway for running ML inference jobs through cloud providers or your own GPU. Powered by Replicate and Cloudflare Workers.

universal

Updated 4mo ago

modelgw / Modelgw

Gateway and load balancer to your LLM inference endpoints

universal

aiartificial-intelligenceazure-openai+6

Updated 22d ago

Cognipeer / Console

Operate inference, LLM Gateways, vector stores, tracing, guardrails, RAG, config, and incident workflows behind one production-ready console with tenant isolation built in.

universal

Updated 1d ago

inference-gateway / Adk

An Agent Development Kit (ADK) allowing for seamless creation of A2A-compatible agents written in Go

universal

a2aa2a-protocola2a-server+4

Updated 18d ago

microsoft / InnerEye Gateway

The InnerEye-Gateway is a Windows service that acts as a DICOM end point to run inference on https://github.com/microsoft/InnerEye-DeepLearning models.

universal

aiazureazureml+2

Updated 1mo ago

inference-gateway / Google Calendar Agent

A2A agent server enabling Google Calendar scheduling, retrieval, and automation

universal

a2aa2a-protocolcalendar+6

Updated 27d ago

cameronking4 / Programmatic Tool Calling AI SDK

⚡ Cut LLM inference costs 80% with Programmatic Tool Calling. Instead of N tool call round-trips, generate JavaScript to orchestrate tools in Vercel Sandbox. Supports Anthropic, OpenAI, 100+ models via AI Gateway. Novel MCP Bridge for external service integration.

claude codeclaude desktop+1

ai-elementsai-gatewayai-sdk+13

Updated 25d ago

alvaropaco / Haif

Production-ready microservices framework for AI inference over RPC. It provides a Gateway for client requests, an Orchestrator that schedules work, a Registry for model metadata, Workers that run inference, and a full observability stack (Prometheus, Grafana, Loki, Jaeger) — all wired together with Docker Compose.

universal

aiai-agentshuggingface+4

Updated 4mo ago

sofianhamiti / Amazon Sagemaker Pipelines Serverless Inference

Deploying a serverless inference service with Amazon SageMaker Pipelines, AWS Lambda, Amazon API Gateway, and CDK

universal

Updated 2y ago

busthorne / Simp

A simple point of consumption for text inference providers, & OpenAI-compatible gateway daemon.

universal

Updated 1mo ago

forpublicai / Platform.publicai.co

The API gateway for the Public AI Inference Utility, based on Zuplo

universal

Updated 16d ago

sofianhamiti / Aws Lambda R Inference

Deploying a Serverless R Inference Service Using AWS Lambda, Amazon API Gateway, and the AWS CDK

universal

Updated 3mo ago

savinims / DATAS Causal Discovery

Causal inference tutorials written as part of the Data Analysis Tools for Atmospheric Scientists (DATAS) Gateway.

universal

causal-discoverycausal-inferencecausality+3

Updated 2y ago

sofianhamiti / Aws Lambda Multi Model Express Workflow

Deploying a Multi-Model Inference Service With AWS Lambda, Synchronous Express Workflows, Amazon API Gateway, and CDK

universal

Updated 1y ago

forsterdan51 / Edge Vision AI

A secure, lightweight inference runtime optimized for ARM boards and IoT gateways. It enables real-time computer-vision processing without cloud dependencies.

zed

Updated 14d ago