Results for "spmm"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

40 skills found · Page 1 of 2

hgyhungry / Ge Spmm

113

No description available

universal

Updated 16d ago

linghaosong / Sextans

An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).

universal

Updated 1mo ago

ParCIS / Magicube

Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.

zed

Updated 29d ago

owensgroup / Merge Spmm

Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018

universal

Updated 7mo ago

HPMLL / DTC SpMM ASPLOS24

No description available

universal

Updated 1mo ago

jinhojsk515 / SPMM

[Nat. Comm. 2024] Multimodal learning for chemical domain, with SMILES and properties.

universal

Updated 6d ago

ParCIS / FlashSparse

FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swap-and-Transpose mapping strategy. FlashSparse is accepted by PPoPP 2025.

universal

Updated 8d ago

spcl / Smat

Code for High Performance Unstructured SpMM Computation Using Tensor Cores

universal

Updated 7d ago

HipGraph / FusedMM

Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks"

universal

fused-kernelgeneral-purpose-librarygraph-embedding+6

Updated 10mo ago

pnnl / S Blas

This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Triangular-Solve (SpTRSV), Sparse-Matrix-Transposition (SpTrans) and Sparse-Matrix-Matrix-Multiplication (SpMM) for Single-node Multi-GPU (scale-up) platforms such as NVIDIA DGX-1 and DGX-2.

universal

Updated 19d ago

lsq314 / SpMM TCAD

No description available

universal

Updated 10d ago

ddps-lab / Dos

Dense or Sparse : Optimal SPMM-as-a-Service for Big-Data Processing

universal

Updated 10mo ago

pku-liang / Hlcd Spmm Project

Course Project for High Level Chip Design （高层次芯片设计）

universal

coursehardware-designs

Updated 1mo ago

YusukeNagasaka / Batched SpMM

New batched algorithm for sparse matrix-matrix multiplication (SpMM)

universal

Updated 1y ago

loveSunning / FastCuda

FastCuda is a handwritten CUDA operator library featuring progressive GEMM and Reduce kernels, cuBLAS benchmarking, and C/C++/Python interfaces for learning, profiling, and performance optimization.

universal

cudacflash-attentionhgemm+7

Updated 14d ago