Results for "smoothquant"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

4 skills found

intel / Neural Compressor

2.6k

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

universal

auto-tuningawqfp4+14

Updated 1d ago

mit-han-lab / Smoothquant

1.6k

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

universal

Updated 2d ago

ModelTC / LightCompress

695

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

universal

awqbenchmarkdeepseek-v3+15

Updated 8h ago

AniZpZ / AutoSmoothQuant

111

An easy-to-use package for implementing SmoothQuant for LLMs

universal

Updated 1mo ago