SkillAgentSearch skills...

Mish

Official Repository for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]

Install / Use

/learn @digantamisra98/Mish

README

<h1 align="center">Mish: Self Regularized <br> Non-Monotonic Activation Function</h1> <p align="center"> <a href="LICENSE" alt="License"> <img src="https://img.shields.io/badge/License-MIT-brightgreen.svg" /></a> <a href="https://arxiv.org/abs/1908.08681v3" alt="ArXiv"> <img src="https://img.shields.io/badge/Paper-arXiv-blue.svg" /></a> <a href="https://scholar.googleusercontent.com/scholar.bib?q=info:j0C1gbodjP4J:scholar.google.com/&output=citation&scisdr=CgX0hbDMEOzUo74J6TM:AAGBfm0AAAAAX1QM8TNcu4tND6FEofKsXzM3cs1uCAAW&scisig=AAGBfm0AAAAAX1QM8Y5elaJ1IW-BKOuU1zFTYNp-QaNQ&scisf=4&ct=citation&cd=-1&hl=en" alt="Cite"> <img src="https://img.shields.io/badge/Cite-BibTex-blue.svg" /></a> <a href=" " alt="Citations"> <img src="https://img.shields.io/badge/Google Scholar-2855-lightgrey.svg" /></a> <a href="https://www.bmvc2020-conference.com/conference/papers/paper_0928.html" alt="Publication"> <img src="https://img.shields.io/badge/BMVC-2020-red.svg" /></a> <a href="https://console.paperspace.com/github/digantamisra98/Mish/blob/master/Layers_Acc.ipynb"> <img src="https://assets.paperspace.io/img/gradient-badge.svg" alt="Run on Gradient"/> </a> </p> <p align="center">BMVC 2020 <a href="https://www.bmvc2020-conference.com/assets/papers/0928.pdf" target="_blank">(Official Paper)</a></p> <br> <br> <details> <summary>Notes: (Click to expand)</summary>
  • A considerably faster version based on CUDA can be found here - Mish CUDA (All credits to Thomas Brandon for the same)
  • Memory Efficient Experimental version of Mish can be found here
  • Faster variants for Mish and H-Mish by Yashas Samaga can be found here - ConvolutionBuildingBlocks
  • Alternative (experimental improved) variant of H-Mish developed by Páll Haraldsson can be found here - H-Mish (Available in Julia)
  • Variance based initialization method for Mish (experimental) by Federico Andres Lois can be found here - Mish_init
</details> <details> <summary>Changelogs/ Updates: (Click to expand)</summary> </details>

News/ Media Coverage:

<p float="center"> &emsp; &emsp; <a href="https://podcasts.apple.com/hu/podcast/mish-activation-function-with-diganta-misra-007/id1490681799?i=1000464407163" alt="Apple Podcasts"> <img src="podcast_logo/applepodcasts.png" width="150"/></a> <a href="https://open.spotify.com/episode/4sT9sxjSbAKtvJ6hTFg9zc" alt="Spotify"> <img src="https://github.com/digantamisra98/Mish/blob/master/podcast_logo/spotify.png" width="150"/></a> </p>
  • (02/2020): Talk on Mish and Non-Linear Dynamics at Sicara is out now. Watch on:
<p float="center"> &emsp; &emsp; <a href="https://youtu.be/T2CRFROKcLM" alt="YouTube"> <img src="podcast_logo/yt1.png" width="100"/></a> </p>
  • (07/2020): CROWN: A comparison of morphology for Mish, Swish and ReLU produced in collaboration with Javier Ideami. Watch on:
<p float="center"> &emsp; &emsp; <a href="https://www.youtube.com/watch?v=XRGu23hfzaQ" alt="YouTube"> <img src="podcast_logo/yt1.png" width="100"/></a> </p> <p float="center"> &emsp; &emsp; <a href="https://youtu.be/whOdg-yrgdI" alt="YouTube"> <img src="podcast_logo/yt1.png" width="100"/></a> </p>
  • (12/2020): Talk on From Smooth Activations to Robustness to Catastrophic Forgetting at Weights & Biases Salon is out now. Watch on:
<p float="center"> &emsp; &emsp; <a href="https://www.youtube.com/watch?v=1U-7TWysqIg" alt="YouTube"> <img src="podcast_logo/yt1.png" width="100"/></a> <br> </p> <br> <details> <summary><a href="https://dlrl.ca/"><b>MILA/ CIFAR 2020 DLRLSS</b></a> (Click on arrow to view)</summary> <div style="text-align:center"><img src ="poster_landscape-1.png" width="1000"/></div> </details> <br> <details> <summary><b>Contents</b>: (Click to expand)</summary>
  1. Mish <br> a. Loss landscape
  2. ImageNet Scores
  3. MS-COCO
  4. Variation of Parameter Comparison<br> a. MNIST<br> b. CIFAR10<br>
  5. Significance Level <br>
  6. Results<br> a. Summary of Results (Vision Tasks)<br> b. [Summary of Results (Language Tasks)](https://github.com/digantamisra98/Mish#summary-of-results-language-tas
View on GitHub
GitHub Stars1.3k
CategoryEducation
Updated2d ago
Forks128

Languages

Jupyter Notebook

Security Score

100/100

Audited on Apr 4, 2026

No findings