36 skills found · Page 1 of 2
city96 / ComfyUI GGUFGGUF Quantization support for native ComfyUI models
iuliaturc / Gguf DocsDocs for GGUF quantization (unofficial)
matt-c1 / Llama 3 Quant ComparisonComparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
ai-joe-git / ComfyUI Intel Arc Clean Install Windows Venv XPU Fully automated installation scripts for ComfyUI optimized for Intel Arc GPUs (A-Series) and Intel Core Ultra iGPUs with XPU backend, Triton acceleration, and GGUF quantized model support.
Thireus / GGUF Tool SuiteProduce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolchain will create a GGUF recipe tuned to your system within seconds — flexible model sizing and lowest achievable perplexity/kld for advanced users seeking precise and automated GGUF dynamic quant production.
xhedit / Quantkitcli tool to quantize gguf, gptq, awq, hqq and exl2 models
jjang-ai / JangqJANG — GGUF for MLX. YOU MUST USE JANG_Q RUNTIME. Adaptive Mixed-Precision Quantization + Runtime for Apple Silicon
IST-DASLab / Gptq Gguf ToolkitEfficient non-uniform quantization with GPTQ for GGUF
AIAnytime / GGUF Quantization Of Any LLMGGUF Quantization of any LLM.
jina-ai / Jina Embeddings V4 GgufA collection of GGUF and quantizations for jina-embeddings-v4
electroglyph / Quant CloneGenerate a llama-quantize command to copy the quantization parameters of any GGUF
3eeps / Cherry Pysimple prompt script to convert hf/ggml files to gguf, and to quantize
magiccodingman / MagicQuant WikiEvolution process to find the best quant tensor weights to build the most optimal GGUF options for an AI model.
caiovicentino / Eoq QuantizationEOQ: Entropy-Optimal Quantization for LLMs. 11-41% smaller than GGUF Q4_K_M with near-FP16 perplexity.
XinYu-pumch / ZFusionZ-Fusion: One-Click LoRA Merger & GGUF Quantizer
robbiemu / Llama Gguf OptimizeScripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.
qskousen / GgufyCLI tool for efficient and easy safetensors and gguf model conversion
r-vage / ComfyUI EclipseComprehensive ComfyUI custom node suite featuring Smart Loaders (multi-format checkpoint support with Nunchaku/GGUF quantization), Smart Prompt system with wildcards, sophisticated pipe ecosystem, universal type converters, image/video utilities, and workflow helpers.
laelhalawani / Gguf ModeldbA quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more models from hf repos and more. It's super easy to use and comes prepacked with best preconfigured open source models: dolphin phi-2 2.7b, mistral 7b v0.2, mixtral 8x7b v0.1, solar 10.7b and zephyr 3b
shettysach / CandleMistFullstack chatbot built using Rust. Made using Candle, Leptos, Actix, Tokio and Tailwind. Uses quantized Mistral 7B Instruct v0.1 GGUF models.