Results for "model-quantization"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

427 skills found · Page 4 of 15

PKULab1806 / Fairy Plus Minus I

124

Fairy±i (iFairy): Complex-valued Quantization Framework for Large Language Models

universal

Updated 10d ago

xiaoxiao0406 / VQ VLA

115

The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)

zed

Updated 6h ago

vinx13 / Tvm Cuda Int8 Benchmark

112

Benchmark of TVM quantized model on CUDA

zed

Updated 3mo ago

lum3on / ComfyUI ModelQuantizer

111

A repo to quantize diffusion models directly in ComfyUI

universal

Updated 3d ago

ictnlp / SLED TTS

110

Streamable Text-to-Speech model using a language modeling approach, without vector quantization

universal

speech-language-modelspeech-synthesisstreaming-inference+1

Updated 10d ago

MinusZoneAI / ComfyUI CogVideoX MZ

110

CogVideoX-5B 4-bit quantization model

universal

cogvideoxcomfyuicomfyui-nodes+1

Updated 7d ago

ModelTC / TFMQ DM

109

[CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".

universal

cvprcvpr2024ddim+8

Updated 12d ago

google-ai-edge / AI Edge Quantizer

107

AI Edge Quantizer: flexible post training quantization for LiteRT models.

universal

Updated 3d ago

ziplab / PTQD

103

The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models

universal

Updated 2mo ago

iconben / Z Image Studio

101

A Cli, a webUI, and a MCP server for the Z-Image-Turbo text-to-image generation model (Tongyi-MAI/Z-Image-Turbo base model as well as quantized models)

zedclaude code+1

aiappleapple-silicon+13

Updated 1d ago

kingreza / Quantization

A deep dive into Apple's coremltools quantization and how to reduce the size of a Core ML model without losing accuracy and performance

universal

Updated 1mo ago

BrotherHappy / OSTQuant

[ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting

universal

Updated 6d ago

Laicheng0830 / Pytorch Model Quantization

OpenPose uses Pytorch for static quantization, saving, and loading of models

universal

Updated 1mo ago

Thireus / GGUF Tool Suite

Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolchain will create a GGUF recipe tuned to your system within seconds — flexible model sizing and lowest achievable perplexity/kld for advanced users seeking precise and automated GGUF dynamic quant production.

universal

Updated 2h ago