Results for "cutlass"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

62 skills found · Page 1 of 3

NVIDIA / Cutlass

9.5k

CUDA Templates and Python DSLs for High-Performance Linear Algebra

universal

cppcudadeep-learning+4

Updated 1h ago

bytedance / Flux

1.3k

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

universal

cudacutlassgpu+1

Updated 2d ago

NVlabs / Vibetensor

612

Our first fully AI generated deep learning system

universal

cudacutlassmachine-learning+2

Updated 9h ago

66RING / Tiny Flash Attention

497

flash attention tutorial written in python, triton, cuda, cutlass

universal

Updated 20h ago

coderonion / Awesome Cuda And Hpc

462

🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.

universal

awesomeblascublas+17

Updated 2d ago

DD-DuDa / Cute Learning

272

Examples of CUDA implementations by Cutlass CuTe

universal

cudacutlassgpu

Updated 4d ago

ColfaxResearch / Cutlass Kernels

261

No description available

universal

Updated 10d ago

MekkCyber / CutlassAcademy

254

A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS

universal

Updated 3d ago

gbprod / Cutlass.nvim

226

Plugin that adds a 'cut' operation separate from 'delete'

universal

neovimneovim-pluginnvim+1

Updated 2d ago

ArthurinRUC / Cutlass Notes

192

From Minimal GEMM to Everything

universal

Updated 1d ago

IST-DASLab / Qutlass

174

QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

zed

blackwellcudapost-training-quantization+1

Updated 2d ago

zach-adams / Cutlass Wp Theme

163

Cutlass is a Wordpress Starter Theme that incorporates the power of Laravel's Blade to make theme development even quicker and easier then before - http://cutlasswp.com

universal

Updated 1y ago

tgale96 / Grouped Gemm

147

PyTorch bindings for CUTLASS grouped GEMM.

universal

Updated 3d ago

leimao / CUTLASS Examples

134

CUTLASS and CuTe Examples

universal

cudacutlassdocker

Updated 6d ago

tlc-pack / Cutlass FpA IntB Gemm

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer

zed

Updated 10d ago

weishengying / Cutlass Flash Atten Fp8

使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention

universal

Updated 14h ago

psmarter / CUDA Practice

CUDA编程练习项目-Hands-on CUDA kernels and performance optimization, covering GEMM, FlashAttention, Tensor Cores, CUTLASS, quantization, KV cache, NCCL, and profiling.

universal

cudacuda-kernelscutlass+12

Updated 2h ago

flashinfer-ai / Cutlass Viz

No description available

universal

Updated 4mo ago

weishengying / Tiny Flash Attention

使用 cutlass 实现 flash-attention 精简版，具有教学意义

universal

Updated 10d ago

andrewarrow / Cutlass

swiss army knife for generating fcpxml files

universal

fcpfcpxmlfinal-cut-pro

Updated 2d ago