SkillAgentSearch skills...

Hedgehog

Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"

Install / Use

/learn @kyegomez/Hedgehog

README

Multi-Modality

HedgeHog

Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry". This paper implements MLPs to mimic the softmax of a transformer. Suppodesly hits SOTA on wikitext for sub quadratic models. I've too been thinking about replacing softmax with MLPs. This past month we saw doezens of papers on mamba and convolutions but MLPs might have undiscovered powers.

License

MIT

Related Skills

View on GitHub
GitHub Stars15
CategoryDevelopment
Updated2mo ago
Forks0

Languages

Python

Security Score

95/100

Audited on Feb 4, 2026

No findings