427 skills found · Page 4 of 15
PKULab1806 / Fairy Plus Minus IFairy±i (iFairy): Complex-valued Quantization Framework for Large Language Models
xiaoxiao0406 / VQ VLAThe offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
vinx13 / Tvm Cuda Int8 BenchmarkBenchmark of TVM quantized model on CUDA
lum3on / ComfyUI ModelQuantizerA repo to quantize diffusion models directly in ComfyUI
ictnlp / SLED TTSStreamable Text-to-Speech model using a language modeling approach, without vector quantization
MinusZoneAI / ComfyUI CogVideoX MZCogVideoX-5B 4-bit quantization model
ModelTC / TFMQ DM[CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".
google-ai-edge / AI Edge QuantizerAI Edge Quantizer: flexible post training quantization for LiteRT models.
ziplab / PTQDThe official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models
A Cli, a webUI, and a MCP server for the Z-Image-Turbo text-to-image generation model (Tongyi-MAI/Z-Image-Turbo base model as well as quantized models)
kingreza / QuantizationA deep dive into Apple's coremltools quantization and how to reduce the size of a Core ML model without losing accuracy and performance
BrotherHappy / OSTQuant[ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting
Laicheng0830 / Pytorch Model QuantizationOpenPose uses Pytorch for static quantization, saving, and loading of models
Thireus / GGUF Tool SuiteProduce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolchain will create a GGUF recipe tuned to your system within seconds — flexible model sizing and lowest achievable perplexity/kld for advanced users seeking precise and automated GGUF dynamic quant production.
TropComplique / Image Classification Caltech 256Exploring CNNs and model quantization on Caltech-256 dataset
thu-nics / MBQThe code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"
tangchen2 / Model CompressionModel Compression 1. Pruning(BN Pruning) 2. Knowledge Distillation (Hinton) 3. Quantization (MNN) 4. Deployment (MNN)
xhedit / Quantkitcli tool to quantize gguf, gptq, awq, hqq and exl2 models
RodolfoFerro / Psychopathology Fer Assistant[WINNER! 🏆] Psychopathology FER Assistant. Because mental health matters. My project submission for #TFWorld TF 2.0 Challenge at Devpost.
pentilm / Torch QuantA PyTorch quantization tool for machine learning models