2 skills found
kyegomez / MultiQueryAttentionThis is a simple torch implementation of the high performance Multi-Query Attention
kyegomez / MGQAThe open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints"