Results for "quantizing"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

1,691 skills found · Page 1 of 57

artidoro / Qlora

10.9k

QLoRA: Efficient Finetuning of Quantized LLMs

zed

Updated 12h ago

bitsandbytes-foundation / Bitsandbytes

8.1k

Accessible large language models via k-bit quantization for PyTorch.

universal

llmmachine-learningpytorch+2

Updated 16h ago

Lightning-AI / Lit Llama

6.1k

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

universal

Updated 23h ago

AutoGPTQ / AutoGPTQ

5.0k

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

universal

deep-learninginferencelarge-language-models+6

Updated 17h ago

lucidrains / Vector Quantize Pytorch

3.9k

Vector (and Scalar) Quantization, in Pytorch

universal

artificial-intelligencedeep-learningpytorch+2

Updated 23h ago

mit-han-lab / Llm Awq

3.5k

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

universal

Updated 3h ago

city96 / ComfyUI GGUF

3.5k

GGUF Quantization support for native ComfyUI models

universal

Updated 13h ago

thu-ml / SageAttention

3.3k

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

zed

attentioncudaefficient-attention+9

Updated 5h ago

qwopqwop200 / GPTQ For LLaMa

3.1k

4 bits quantization of LLaMA using GPTQ

universal

Updated 4d ago

turboderp / Exllama

2.9k

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

zed

Updated 2d ago

pytorch / Ao

2.8k

PyTorch native quantization and sparsity for training and inference

universal

brrrcudadtypes+9

Updated 1h ago

intel / Neural Compressor

2.6k

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

universal

auto-tuningawqfp4+14

Updated 23h ago

quic / Aimet

2.6k

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

universal

auto-mlcompressiondeep-learning+8

Updated 5h ago

NVIDIA / Model Optimizer

2.4k

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

universal

Updated 1h ago

Efficient-ML / Awesome Model Quantization

2.3k

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

zed

awesomebinarized-neural-networksbinary-network+7

Updated 23h ago

casper-hansen / AutoAWQ

2.3k

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

universal

Updated 1d ago

microsoft / Olive

2.3k

Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.

universal

Updated 17h ago

IST-DASLab / Gptq

2.3k

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

universal

Updated 2d ago

666DZY666 / Micronet

2.3k

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

universal

batch-normalization-fusebnnconvolutional-networks+17

Updated 4d ago

TimmyOVO / Deepseek Ocr.rs

2.2k

Rust multi‑backend OCR/VLM engine (DeepSeek‑OCR-1/2, PaddleOCR‑VL, DotsOCR) with DSQ quantization and an OpenAI‑compatible server & CLI – run locally without Python.

universal

candleocrocr-recognition+2

Updated 23h ago