Results for "gguf-model-support"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

15 skills found

city96 / ComfyUI GGUF

3.5k

GGUF Quantization support for native ComfyUI models

universal

Updated 2h ago

brontoguana / Krasis

402

Krasis is a Hybrid LLM runtime which focuses on efficient running of larger models on consumer grade VRAM limited hardware

universal

cpu-inferencegguf-model-supportgpu-inference+9

Updated 1h ago

1038lab / ComfyUI JoyCaption

249

Joy Caption is a ComfyUI node using the LLaVA model to generate stylized image captions, supporting batch processing and GGUF models.

zed

comfyuiggufjoycaption+2

Updated 1d ago

1038lab / ComfyUI MiniCPM

148

A custom ComfyUI node for MiniCPM vision-language models, supporting v4, v4.5, and v4 GGUF formats, enabling high-quality image captioning and visual analysis.

universal

comfyuicustom-nodesgguf+5

Updated 15d ago

ai-joe-git / ComfyUI Intel Arc Clean Install Windows Venv XPU

137

Fully automated installation scripts for ComfyUI optimized for Intel Arc GPUs (A-Series) and Intel Core Ultra iGPUs with XPU backend, Triton acceleration, and GGUF quantized model support.

zed

Updated 7d ago

airesearch-official / Z Image Turbo Windows

One-click Windows installer for Z-Image Turbo AI image generation. Optimized for low-VRAM GPUs (4GB+). Features Gradio web UI, automatic setup, and GGUF model support.

zed

ai-image-generationai-toolsgguf+7

Updated 7d ago

meganoob1337 / Llama Swap Vllm Boilerplate

Dynamic LLM model swapping system with Docker, vLLM integration, and GPU acceleration. Supports GGUF & Hugging Face models with automatic swapping and Traefik routing.

universal

Updated 11d ago

kantan-kanto / ComfyUI LLM Session

Local LLM session nodes for ComfyUI using GGUF and llama.cpp, supporting Llama, Mistral, Qwen, DeepSeek, GLM, Gemma, Phi, LLaVA and gpt-oss, enabling both user–model chat and model-to-model dialogue without external runtimes like Ollama.

universal

Updated 2d ago

DevMaan707 / Llm Toolkit

A comprehensive Flutter SDK for running Large Language Models (LLMs) locally on mobile and desktop devices. Supports multiple inference engines including Gemma (TFLite) and Llama (GGUF) with integrated model discovery, download, and chat capabilities.

universal

Updated 19d ago

nexusjuan12 / FLUX.1 Kontext Multi Image

Multi-image implementation of Flux.1-Kontext with quantized model support in GGUF format. Also includes an app that produces a series of portraits using the same model.

zed

Updated 2d ago

nareshis21 / Truelarge RT

Android inference engine running 20B+ parameter LLMs on 4GB-8GB RAM devices. Features proprietary Layer-by-Layer (LBL) streaming, zero-copy mmap loading, and native C++/Kotlin architecture.

universal

androidcppedgeai+10

Updated 1mo ago

kantan-kanto / ComfyUI MultiModal Prompt Nodes

Multimodal prompt generator nodes for ComfyUI, designed to generate prompts for QwenImageEdit and Wan2.2. Supports local LLM / local GGUF models (Qwen3.5, Qwen3-VL and Qwen2.5-VL) and Qwen API for image and video prompt generation and enhancement.

universal

comfy-uicomfyui-custom-nodecomfyui-custom-nodes+10

Updated 1d ago

ml-rust / Blazr

Production-grade inference server for LLMs. Supports standard HuggingFace models (Llama, Mistral, Qwen, Phi, Gemma, DeepSeek) and custom hybrid architectures (Mamba2, MLA, MoE). Loads SafeTensors, AWQ, GPTQ, and GGUF formats

universal

Updated 11d ago

Divith123 / LoRA The Second Brain

An open-source AI chatbot app that runs models locally using Ollama, supporting a wide variety of Small Language Models (SLMs) from Meta, Google, Alibaba, and others in GGUF and H2O-Danube formats. Features

universal

Updated 1mo ago

duoyuncloud / ModelConverterTool

A CLI and API tool for converting, validating, and managing machine learning models across multiple formats. Supports ONNX, FP16, HuggingFace, TorchScript, GGUF, MLX, GPTQ, AWQ, and more.

universal

Updated 2mo ago