Results for "int8"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

138 skills found · Page 1 of 5

RangiLyu / Nanodet

6.2k

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

universal

anchor-freeandroiddeep-learning+12

Updated 16h ago

Lightning-AI / Lit Llama

6.1k

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

universal

Updated 4h ago

PINTO0309 / PINTO Model Zoo

4.1k

A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8), EdgeTPU, CoreML.

universal

caffecomputer-visioncoreml+17

Updated 1h ago

intel / Neural Compressor

2.6k

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

universal

auto-tuningawqfp4+14

Updated 15h ago

ppogg / YOLOv5 Lite

2.5k

🍅🍅🍅YOLOv5-Lite: Evolved from yolov5 and the size of model is only 900+kb (int8) and 1.7M (fp16). Reach 15 FPS on the Raspberry Pi 4B~

universal

android-appmnnmobilenet+12

Updated 6d ago

666DZY666 / Micronet

2.3k

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

universal

batch-normalization-fusebnnconvolutional-networks+17

Updated 15h ago

RWKV / Rwkv.cpp

1.6k

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

universal

deep-learningggmllanguage-model+4

Updated 2d ago

CaoWGG / TensorRT CenterNet

770

tensorrt5 , centernet , centerface, deform conv, int8

universal

Updated 14d ago

grimoire / Mmdetection To Tensorrt

600

convert mmdetection model to tensorrt, support fp16, int8, batch input, dynamic shape etc.

universal

cascade-rcnnfaster-rcnninference+6

Updated 14d ago

DerryHub / BEVFormer Tensorrt

565

BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).

universal

bevformercudaint8-inference+3

Updated 5d ago

BUG1989 / Caffe Int8 Convert Tools

518

Generate a quantization parameter file for ncnn framework int8 inference

zed

caffedeeplearning-aiint8-inference+2

Updated 1mo ago

intel / Neural Speed

351

An innovative library for efficient LLM inference via low-bit quantization

universal

cpufp4fp8+17

Updated 2d ago

AlexeyAB / Yolo2 Light

306

Light version of convolutional neural network Yolo v3 & v2 for objects detection with a minimum of dependencies (INT8-inference, BIT1-XNOR-inference)

universal

bit1-xnor-inferencecomputer-visionmachine-learning+2

Updated 1mo ago

yaof20 / Flash RL

299

Implementation for FP8/INT8 Rollout for RL training without performence drop.

universal

reinforcement-learningvllm

Updated 15h ago

clancylian / Retinaface

298

Reimplement RetinaFace use C++ and TensorRT

universal

caffeint8mxnet2caffe+2

Updated 4mo ago

PINTO0309 / Tflite2tensorflow

273

Generate saved_model, tfjs, tf-trt, EdgeTPU, CoreML, quantized tflite, ONNX, OpenVINO, Myriad Inference Engine blob and .pb from .tflite. Support for building environments with Docker. It is possible to directly access the host PC GUI and the camera to verify the operation. NVIDIA GPU (dGPU) support. Intel iHD GPU (iGPU) support. Supports inverse quantization of INT8 quantization model.

zed

convertercoremldepthai+15

Updated 1mo ago