Evograd

Official PyTorch implementation of "EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization"

Generate Convert Improve

Install / Use

/learn @ondrejbohdal/Evograd

About this skill

Quality Score

0/100

README

EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization

[Paper]

Summary

Gradient-based meta-learning and hyperparameter optimization have seen significant progress recently, enabling practical end-to-end training of neural networks together with many hyperparameters. Nevertheless, existing approaches are relatively expensive as they need to compute second-order derivatives and store a longer computational graph. This cost prevents scaling them to larger network architectures. We present EvoGrad, a new approach to meta-learning that draws upon evolutionary techniques to more efficiently compute hypergradients. EvoGrad estimates hypergradient with respect to hyperparameters without calculating second-order gradients, or storing a longer computational graph, leading to significant improvements in efficiency. We evaluate EvoGrad on three substantial recent meta-learning applications, namely cross-domain few-shot learning with feature-wise transformations, noisy label learning with Meta-Weight-Net and low-resource cross-lingual learning with meta representation transformation. The results show that EvoGrad significantly improves efficiency and enables scaling meta-learning to bigger architectures such as from ResNet10 to ResNet34.

This repository provides implementation for the experiments mentioned in the paper:

Illustration using a 1-dimensional problem
Rotation transformation
Cross-domain few-shot classification via learned feature-wise transformation
Label noise with MetaWeightNet
Low-resource cross-lingual learning with MetaXL

Usage

The following libraries are generally needed for the experiments:

PyTorch
Higher
Matplotlib
Numpy
Sklearn
Tqdm

The versions and any additional libraries depend on the sub-problem (further details mentioned there).

Citation

If you find this useful for your research, please consider citing:

@inproceedings{bohdal2021evograd,
  title={EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization},
  author={Bohdal, Ondrej and Yang, Yongxin and Hospedales, Timothy},
  booktitle={Advances in Neural Information Processing Systems},
  year={2021}
}

Acknowledgments

This work was supported in part by the EPSRC Centre for Doctoral Training in Data Science, funded by the UK Engineering and Physical Sciences Research Council (grant EP/L016427/1) and the University of Edinburgh.

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

last30days-skill

18.3k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

sec-edgar-agentkit

AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.

ondrejbohdal

View profile

View on GitHub

GitHub Stars23

CategoryEducation

Updated1mo ago

Forks1

ondrejbohdal/evograd

Languages

Jupyter Notebook

Security Score

90/100

Audited on Feb 12, 2026

No findings