Results for "gguf-quantization"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

36 skills found · Page 1 of 2

city96 / ComfyUI GGUF

3.5k

GGUF Quantization support for native ComfyUI models

universal

Updated 2h ago

iuliaturc / Gguf Docs

409

Docs for GGUF quantization (unofficial)

universal

Updated 2d ago

matt-c1 / Llama 3 Quant Comparison

170

Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.

universal

Updated 3d ago

ai-joe-git / ComfyUI Intel Arc Clean Install Windows Venv XPU

137

Fully automated installation scripts for ComfyUI optimized for Intel Arc GPUs (A-Series) and Intel Core Ultra iGPUs with XPU backend, Triton acceleration, and GGUF quantized model support.

zed

Updated 5d ago

Thireus / GGUF Tool Suite

Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolchain will create a GGUF recipe tuned to your system within seconds — flexible model sizing and lowest achievable perplexity/kld for advanced users seeking precise and automated GGUF dynamic quant production.

universal

Updated 6h ago

xhedit / Quantkit

cli tool to quantize gguf, gptq, awq, hqq and exl2 models

universal

Updated 21d ago

jjang-ai / Jangq

JANG — GGUF for MLX. YOU MUST USE JANG_Q RUNTIME. Adaptive Mixed-Precision Quantization + Runtime for Apple Silicon

universal

apple-siliconggufjang-quantization+7

Updated 7h ago

IST-DASLab / Gptq Gguf Toolkit

Efficient non-uniform quantization with GPTQ for GGUF

universal

Updated 16d ago

AIAnytime / GGUF Quantization Of Any LLM

GGUF Quantization of any LLM.

universal

Updated 17d ago

jina-ai / Jina Embeddings V4 Gguf

A collection of GGUF and quantizations for jina-embeddings-v4

universal

Updated 6d ago

electroglyph / Quant Clone

Generate a llama-quantize command to copy the quantization parameters of any GGUF

universal

Updated 16d ago

3eeps / Cherry Py

simple prompt script to convert hf/ggml files to gguf, and to quantize

universal

Updated 7mo ago

magiccodingman / MagicQuant Wiki

Evolution process to find the best quant tensor weights to build the most optimal GGUF options for an AI model.

universal

aiggufgguf-hybrid+3

Updated 1mo ago

caiovicentino / Eoq Quantization

EOQ: Entropy-Optimal Quantization for LLMs. 11-41% smaller than GGUF Q4_K_M with near-FP16 perplexity.

universal

Updated 5h ago

XinYu-pumch / ZFusion

Z-Fusion: One-Click LoRA Merger & GGUF Quantizer

universal

Updated 22d ago

robbiemu / Llama Gguf Optimize

Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.

universal

Updated 7d ago

qskousen / Ggufy

CLI tool for efficient and easy safetensors and gguf model conversion

universal

comfyuidiffusion-modelsggml+4

Updated 2d ago

r-vage / ComfyUI Eclipse

Comprehensive ComfyUI custom node suite featuring Smart Loaders (multi-format checkpoint support with Nunchaku/GGUF quantization), Smart Prompt system with wildcards, sophisticated pipe ecosystem, universal type converters, image/video utilities, and workflow helpers.

universal

Updated 7h ago

laelhalawani / Gguf Modeldb

A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more models from hf repos and more. It's super easy to use and comes prepacked with best preconfigured open source models: dolphin phi-2 2.7b, mistral 7b v0.2, mixtral 8x7b v0.1, solar 10.7b and zephyr 3b

zed

databasehugginfaceinference+6

Updated 1y ago

shettysach / CandleMist

Fullstack chatbot built using Rust. Made using Candle, Leptos, Actix, Tokio and Tailwind. Uses quantized Mistral 7B Instruct v0.1 GGUF models.

zed

actixbackendcandle+11

Updated 17d ago