SMDL
(IJCAI 2019) Submodular Batch Selection for Training Deep Neural Networks
Install / Use
/learn @VamshiTeja/SMDLREADME
Submodular Batch Selection for Training Deep Neural Networks
IJCAI 2019
Mini-batch gradient descent based methods are the de facto algorithms for training neural network architectures today. We introduce a mini-batch selection strategy based on submodular function maximization. Our novel submodular formulation captures the informativeness of each sample and diversity of the whole subset. We design an efficient, greedy algorithm which can give high-quality solutions to this NP-hard combinatorial optimization problem. Our extensive experiments on standard datasets show that the deep models trained using the proposed batch selection strategy provide better generalization than Stochastic Gradient Descent as well as a popular baseline sampling strategy across different learning rates, batch sizes, and distance metrics.
Setup
Python: 2.7.6, PyTorch 0.4.1
Clone the repository and install dependencies from requirements.txt
git clone https://github.com/VamshiTeja/SMDL
cd smdl
pip install -r requirements.txt
Run
All the configurations are in a single place: ./config/smdl.yml
Modify the configurations are run smdl.py
python smdl.py
The code is well documented and you would be able to get along. In case of troubles please do raise an issue here. We would be agile.
Citation
@inproceedings{joseph2019Submodular,
title = {Submodular Batch Selection for Training Deep Neural Networks},
author={K J, Joseph and Vamshi Teja, R and Krishnakant, Singh and Vineeth, N Balasubramanian},
booktitle={Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI, Macao, China.},
organization={International Joint Conferences on Artificial Intelligence Organization},
year={2019}}
Contributors
Related Skills
node-connect
349.7kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
109.7kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
349.7kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
349.7kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
