Multibranch
Codes for the paper "Deep Neural Networks with Multi-Branch Architectures Are Less Non-Convex"
Install / Use
/learn @hongyanz/MultibranchREADME
Deep-Neural-Networks-with-Multi-Branch-Architectures-Are-Less-Non-Convex
This is the code for the paper Deep Neural Networks with Multi-Branch Architectures Are Less Non-Convex in AISTATS 2019 by Hongyang Zhang, Junru Shao, and Ruslan Salakhutdinov.
The code is written in python and requires numpy, torch, torchvision and the tqdm library.
Install
This code depends on python 3.6, pytorch 0.3.1 and numpy. We suggest to install the dependencies using Anaconda or Miniconda. Here is an exemplary command:
$ wget https://repo.anaconda.com/archive/Anaconda3-5.1.0-Linux-x86_64.sh
$ bash Anaconda3-5.1.0-Linux-x86_64.sh
$ source ~/.bashrc
$ conda install pytorch=0.3.1
Get started
To get started, cd into the directory. Then run the scripts:
- fully_connected.py is a demo on plotting the landscape of multi-branch neural network where each sub-network is a full-connected network with ReLU activation functions and the dataset is synthetic.
- VGG.py is for running the multi-branch neural network based on VGG-9 on CIFAR-10 dataset.
Using the code
The command python fully_connected.py --help gives the help information about how to run the code that produces landscape, and python VGG.py --help explains how to run the multi-branch neural network based on VGG-9.
Running Example
<p align="center"> <img src="example.png" width="800"\> </p>Reference
For technical details and full experimental results, see the paper.
@inproceedings{Zhang2019deep,
author = {Hongyang Zhang and Junru Shao and Ruslan Salakhutdinov},
title = {Deep Neural Networks with Multi-Branch Architectures Are Less Non-Convex},
booktitle = {International Conference on Artificial Intelligence and Statistics},
year = {2019}
}
Contact
Please contact hongyanz@cs.cmu.edu if you have any question on the codes.
Related Skills
proje
Interactive vocabulary learning platform with smart flashcards and spaced repetition for effective language acquisition.
best-practices-researcher
The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app
flutter-tutor
Flutter Learning Tutor Guide You are a friendly computer science tutor specializing in Flutter development. Your role is to guide the student through learning Flutter step by step, not to provide d
groundhog
398Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
