Results for "model-inference"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

2,180 skills found · Page 1 of 73

huggingface / Transformers

158.6k

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

universal

audiodeep-learningdeepseek+16

100

Updated 18m ago

meta-llama / Llama

59.3k

Inference code for Llama models

universal

Updated 2h ago

facebookresearch / Segment Anything

53.8k

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

universal

Updated 3h ago

black-forest-labs / Flux

25.4k

Official inference repo for FLUX.1 models

universal

Updated 13m ago

FunAudioLLM / CosyVoice

20.3k

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

universal

audio-generationcantonesechatbot+16

Updated 51m ago

facebookresearch / Sam2

18.8k

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

universal

Updated 2h ago

meta-llama / Llama Cookbook

18.3k

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

universal

aifinetuninglangchain+7

Updated 16h ago

meta-llama / Codellama

16.3k

Inference code for CodeLlama models

universal

Updated 18h ago

NVIDIA / TensorRT LLM

13.2k

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

universal

blackwellcudallm-serving+2

Updated 2h ago

bentoml / OpenLLM

12.2k

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

universal

bentomlfine-tuningllama+15

Updated 1h ago

huggingface / Text Generation Inference

10.8k

Large Language Model Text Generation Inference

universal

bloomdeep-learningfalcon+6

Updated 42m ago

mistralai / Mistral Inference

10.7k

Official inference library for Mistral models

universal

llmllm-inferencemistralai

Updated 3h ago

Const-me / Whisper

10.3k

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

universal

Updated 7h ago

xorbitsai / Inference

9.2k

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

universal

artificial-intelligencechatglmdeployment+17

Updated 11h ago

facebookresearch / Sam3

8.6k

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

universal

Updated 26m ago

bentoml / BentoML

8.5k

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

universal

ai-inferencedeep-learninggenerative-ai+12

Updated 3h ago

OptimalScale / LMFlow

8.5k

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

universal

chatgptdeep-learninginstruction-following+4

Updated 2d ago

py-why / Dowhy

8.0k

DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.

universal

bayesian-networkscausal-inferencecausal-machine-learning+8

Updated 1h ago

google / Gemma.cpp

6.8k

lightweight, standalone C++ inference engine for Google's Gemma models.

universal

Updated 6h ago

allenai / OLMo

6.4k

Modeling, training, eval, and inference code for OLMo

universal

Updated 8h ago