Results for "sparse-transformer"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

115 skills found · Page 1 of 4

openai / Sparse Attention

1.6k

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

universal

Updated 1d ago

svg-project / Sparse VideoGen

651

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

universal

diffusiondiffusion-modelefficientml+3

Updated 2d ago

Haiyang-W / DSVT

451

[CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"

universal

3d-object-detectionbackbonecvpr2023+3

Updated 2d ago

cschenxiang / DRSformer

341

Learning A Sparse Transformer Network for Effective Image Deraining (CVPR 2023)

universal

Updated 1d ago

thu-ml / SLA

292

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention

universal

ai-infradiffusion-transformerinference-acceleration+7

Updated 14d ago

microsoft / Swin3D

289

A shift-window based transformer for 3D sparse tasks

universal

Updated 6d ago

VITA-Group / SLaK

284

[ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers than Transformers for ConvNets?"

universal

51x51convnetdeep-learning+6

Updated 1mo ago

lucidrains / Sinkhorn Transformer

270

Sinkhorn Transformer - Practical implementation of Sparse Sinkhorn Attention

universal

artificial-intelligenceattention-mechanismdeep-learning+2

Updated 8d ago

DerrickXuNu / CoBEVT

255

[CoRL2022] CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers

universal

autonomous-drivingautonomous-vehiclesbev-perception+9

Updated 3d ago

microsoft / SwinBERT

249

Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"

universal

Updated 5d ago

NimbleEdge / Sparse Transformers

216

Sparse Inferencing for transformer based LLMs

universal

llmllm-inferencesparsity+1

Updated 1mo ago

facebookresearch / Mixture Of Transformers

189

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.

universal

Updated 3d ago

ThomasVonWu / SparseEnd2End

177

End2EndPerception deployment solution based on vision sparse transformer paradigm is open sourced.

universal

autonomous-drivingcuda-acclerationend2end-perception+3

Updated 13d ago

joshyZhou / AST

174

Adapt or Perish: Adaptive Sparse Transformer with Attentive Feature Refinement for Image Restoration

universal

Updated 2d ago

JIA-Lab-research / SparseTransformer

170

A fast and memory-efficient libarary for sparse transformer with varying token numbers (e.g., 3D point cloud).

universal

3d-point-cloudcudasparse-transformer+1

Updated 1mo ago

Ephemeral182 / UDR S2Former Deraining

148

[ICCV'23] Sparse Sampling Transformer with Uncertainty-Driven Ranking for Unified Removal of Raindrops and Rain Streaks

universal

adverse-weather-conditiondeep-learningdemo+8

Updated 1mo ago

hihihihiwsf / AST

140

Adversarial Sparse Transformer for Time Series Forecasting

universal

Updated 4mo ago

kyegomez / SwitchTransformers

138

Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity"

universal

aigpt4llama+6

Updated 10d ago

sharc-lab / Edge MoE

134

Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts

universal

Updated 1mo ago

kyegomez / SparseAttention

Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with Sparse Transformers"

universal

artificial-intelligenceattention-is-all-you-needattention-mechanism+4

Updated 1mo ago