LightRelu
Customized PyTorch implementation of LiSHT (linear scaled hyperbolic tangent) activation function for deep learning
Install / Use
/learn @lessw2020/LightReluREADME
LightRelu
Customized PyTorch implementation of LiSHT (linear scaled hyperbolic tangent) activation function for deep learning, with mean shift and clamping.
Original paper here:
#LiSHT: Non-Parametric Linearly Scaled Hyperbolic Tangent Activation Function for Neural Networks https://arxiv.org/abs/1901.05894
Activation map comparison: <img src='images/lisht-activation-curve.jpg' width=50% height=50% />
MNIST - Relu vs Lisht: <img src='images/mnist-activation-compare.jpg' width=50% height=50% />
LightRelu = customized LiSHT in PyTorch, with mean shift and clamp:
I implemented using Pytorch and wrapped it with a clamp and mean shift.(.46 and 7.5).
More testing in progress, but so far looks very promising!
Note - cut your learning rates in half vs ReLU, it learns very rapidly.
Comparisons of LightRelu vs ReLU and General Relu
(GeneralRelu is an upcoming Relu with leakiness, mean shift and clamp):
ReLU:
<img src='images/means-stds-relu.jpg' width=70% height=70% />LightRelU:
<img src='images/means-stds-lightrelu.jpg' width=70% height=70% />Histogram of activations (smoother is better) - General ReLU vs LightRelu...and in last place, ReLU:
GeneralReLU:
<img src='images/general-relu-histo.jpg' width=80% height=80% />LightRelU:
<img src='images/lightrelu-histo.jpg' width=80% height=80% />ReLU:
<img src='images/relu-histo.jpg' width=80% height=80% />Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
best-practices-researcher
The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app
groundhog
400Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
last30days-skill
19.1kAI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
