2,347 skills found · Page 1 of 79
Dao-AILab / Flash AttentionFast and memory-efficient exact attention
xmu-xiaoma666 / External Attention Pytorch🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
jadore801120 / Attention Is All You Need PytorchA PyTorch implementation of the Transformer model in "Attention is All You Need".
cmhungsteve / Awesome Transformer AttentionAn ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
fla-org / Flash Linear Attention🚀 Efficient implementations for emerging model architectures
thu-ml / SageAttention[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
MoonshotAI / Attention ResidualsNo description available
MenghaoGuo / Awesome Vision AttentionsSummary of related papers on visual attention. Related code will be released based on Jittor gradually.
philipperemy / Keras AttentionKeras Attention Layer (Luong and Bahdanau scores).
heykeetae / Self Attention GANPytorch implementation of Self-Attention Generative Adversarial Networks (SAGAN)
Jongchan / Attention ModuleOfficial PyTorch code for "BAM: Bottleneck Attention Module (BMVC2018)" and "CBAM: Convolutional Block Attention Module (ECCV2018)"
ozan-oktay / Attention Gated NetworksUse of Attention Gates in a Convolutional Neural Network / Medical Image Classification and Segmentation
openai / Sparse AttentionExamples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
peteanderson80 / Bottom Up AttentionBottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
szagoruyko / Attention TransferImproving Convolutional Networks via Attention Transfer (ICLR 2017)
bojone / Attentionsome attention implements
wouterkool / Attention Learn To RouteAttention based model for learning to solve different routing problems
bloc97 / CrossAttentionControlUnofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
pprp / Awesome Attention Mechanism In CvAwesome List of Attention Modules and Plug&Play Modules in Computer Vision
mjun0812 / Flash Attention Prebuild WheelsProvide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions