40 skills found · Page 1 of 2
hgyhungry / Ge SpmmNo description available
linghaosong / SextansAn FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).
ParCIS / MagicubeMagicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
owensgroup / Merge SpmmCode for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
HPMLL / DTC SpMM ASPLOS24No description available
jinhojsk515 / SPMM[Nat. Comm. 2024] Multimodal learning for chemical domain, with SMILES and properties.
ParCIS / FlashSparseFlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swap-and-Transpose mapping strategy. FlashSparse is accepted by PPoPP 2025.
spcl / SmatCode for High Performance Unstructured SpMM Computation Using Tensor Cores
HipGraph / FusedMMImplementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks"
pnnl / S BlasThis package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Triangular-Solve (SpTRSV), Sparse-Matrix-Transposition (SpTrans) and Sparse-Matrix-Matrix-Multiplication (SpMM) for Single-node Multi-GPU (scale-up) platforms such as NVIDIA DGX-1 and DGX-2.
lsq314 / SpMM TCADNo description available
ddps-lab / DosDense or Sparse : Optimal SPMM-as-a-Service for Big-Data Processing
pku-liang / Hlcd Spmm ProjectCourse Project for High Level Chip Design (高层次芯片设计)
YusukeNagasaka / Batched SpMMNew batched algorithm for sparse matrix-matrix multiplication (SpMM)
loveSunning / FastCudaFastCuda is a handwritten CUDA operator library featuring progressive GEMM and Reduce kernels, cuBLAS benchmarking, and C/C++/Python interfaces for learning, profiling, and performance optimization.
Hyaloid / AccSpMMOfficial implementation of Acc-SpMM: Accelerating General-purpose Sparse Matrix-Matrix Multiplication with GPU Tensor Cores.
ddps-lab / Spark Spmm ComputeNo description available
Guangxuan-Xiao / SPMM CUDANo description available
ZJU-DAILY / HC SpMMNo description available
YaqiXia / Voltrix SpMMRevitalizing Sparse Matrix-Matrix Multiplication on Tensor Cores with Asynchronous and Balanced Kernel Optimization