MHE

Implementation for <Learning towards Minimum Hyperspherical Energy> in NIPS'18.

Generate Convert Improve

Install / Use

/learn @wy1iu/MHE

About this skill

Quality Score

0/100

README

Learning towards Minimum Hyperspherical Energy

By Weiyang Liu*, Rongmei Lin*, Zhen Liu*, Lixin Liu*, Zhiding Yu, Bo Dai, Le Song

License

MHE and SphereFace+ are released under the MIT License (refer to the LICENSE file for details).

Introduction
Citation
Requirements
Usage
Results
SphereFace+ for face recognition
Generative Adversarial Networks with MHE
Note

Introduction

The repository contains the tensorflow implementation of Minimum Hyperspherical Energy (MHE) and Caffe implementation of SphereFace+. Inspired by a well-known Physics problem (Thomson problem), MHE is formulated as an energy on a hypersphere. Minimizing it can achieve better generalization in various applications. In general, MHE serves as a simple plug-in regularization for neural networks, which enchances the neuron diversity on hyperspheres (i.e., angular diversity) and is able to consistently improve the network's generalization ability in various application. Our paper is published in NIPS 2018 and is available at arXiv.

Citation

If you find our work useful in your research, please consider to cite:

@article{LiuNIPS18,
  title={Learning towards Minimum Hyperspherical Energy},
  author={Liu, Weiyang and Lin, Rongmei and Liu, Zhen and Liu, Lixin and Yu, Zhiding and Dai, Bo and Song, Le},
  journal={NIPS},
  year={2018}
  }

Requirements

Python 3.6 (Python 2.7 needs to modify the unpickle funtion in train.py)
TensorFlow
Keras (for MHE-GAN)
numpy

Usage

Part 1: Clone the repositary

Clone the repositary.

git clone https://github.com/wy1iu/MHE.git

Part 2: Download CIFAR-100 training and testing data

For the current code, we use our own data pre-processing code, and it is the standard pre-processing for CIFAR-100. The training and testing data can be downloaded via Google Drive. Users should put the downloaded data to the MHE_ROOT directory.

Part 3: CIFAR-100

Train and test the model with the following code:

cd $MHE_ROOT/code
python train.py --model_name=mhe --power_s=0

The parameters include model_name: [baseline, mhe, half_mhe] and power_s: [0, 1, 2, a0, a1, a2]. mhe denotes the full-space MHE regularization, half_mhe represents the half-space MHE, and power_s is the parameter s (see equation 1 in the paper) to control the formulation of MHE (0,1,2 are the Euclidean distance, and a0,a1,a2 use the angles).
For different datasets, you may need adjuest to hyperparameter for the entire MHE regularizaiton.

Results

See the log folder for the expected results.

SphereFace-plus

SphereFace+ is an improved deep face recognition algorithm based on SphereFace. We incorporate the MHE regularization to the original SphereFace and further boost the recognition accuracy. SphereFace+ is implemented in Caffe and therefore is independently host here. An PyTorch version is available in OpenSphere.

MHE-GAN

To train a model from scratch, simply run python train.py. To test the best models we provide, run python test.py.

For the model trained with both MHE (magnitude 1e-7) and spectral normalization (located in MHE-GAN/sn-model), we obtained the inception score of 7.68, which is much higher than we originally reported in the paper. In contrast, the same GAN with only spectral normalization yields 7.4 inception score and the baseline GAN only achieves 6.64.

For the model trained with only MHE (magnitude 1e-6, located in MHE-GAN/non-sn-model), we obtained the inception score of 7.48, which is also much higher than we originally reported in the paper.

The weight files of the MHE+SN model with inception score 7.68 and the MHE-only model with inception score 7.48 are provided at Google Drive. Please place the model files under the snapshots folder in the corresponding experiment folders.

The following table summarizes the results:

| Model | Inception Score | Weights | | ---------- | :-----------: | :-----------: | | Baseline GAN | 6.64 | N/A | | GAN w/ Spectral Norm |7.40 | N/A | | GAN w/ MHE | 7.48 | Google Drive | | GAN w/ Spectral Norm & MHE | 7.68 | Google Drive |

Some random samples of the generated images from MHE-GAN (MHE+SN) is shown as follows:

Note

The code was tested on tensorflow of versions >= 1.10. For earlier versions, the code for computing inception score might report wrong inception scores due to bugs in tf.contrib.gan.
Surprisingly, we found that using Titan Xp to run our code gives better performance on CIFAR-10 than using GTX 1080. Note that, the experiments in the paper are mostly run by Titan Xp.

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

last30days-skill

13.8k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

000-main-rules

Project Context - Name: Interactive Developer Portfolio - Stack: Next.js (App Router), TypeScript, React, Tailwind CSS, Three.js - Architecture: Component-driven UI with a strict separation of conce

wy1iu

View profile

View on GitHub

GitHub Stars81

CategoryEducation

Updated10mo ago

Forks7

wy1iu/MHE

Languages

Python

Security Score

92/100

Audited on May 26, 2025

No findings

MHE