LightRelu

Customized PyTorch implementation of LiSHT (linear scaled hyperbolic tangent) activation function for deep learning

Generate Convert Improve

Install / Use

/learn @lessw2020/LightRelu

About this skill

Quality Score

0/100

README

LightRelu

Customized PyTorch implementation of LiSHT (linear scaled hyperbolic tangent) activation function for deep learning, with mean shift and clamping.

Original paper here:

#LiSHT: Non-Parametric Linearly Scaled Hyperbolic Tangent Activation Function for Neural Networks https://arxiv.org/abs/1901.05894

Activation map comparison: <img src='images/lisht-activation-curve.jpg' width=50% height=50% />

MNIST - Relu vs Lisht: <img src='images/mnist-activation-compare.jpg' width=50% height=50% />

LightRelu = customized LiSHT in PyTorch, with mean shift and clamp:

I implemented using Pytorch and wrapped it with a clamp and mean shift.(.46 and 7.5).
More testing in progress, but so far looks very promising!
Note - cut your learning rates in half vs ReLU, it learns very rapidly.

Comparisons of LightRelu vs ReLU and General Relu

(GeneralRelu is an upcoming Relu with leakiness, mean shift and clamp):

ReLU:

LightRelU:

Histogram of activations (smoother is better) - General ReLU vs LightRelu...and in last place, ReLU:

GeneralReLU:

LightRelU:

ReLU:

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

best-practices-researcher

The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app

groundhog

400

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

last30days-skill

19.1k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary