Cutmix

a Ready-to-use PyTorch Extension of Unofficial CutMix Implementations with more improved performance.

Generate Convert Improve

Install / Use

/learn @ildoonet/Cutmix

About this skill

Quality Score

0/100

README

cutmix

a Ready-to-use PyTorch Extension of Unofficial CutMix Implementations.

This re-implementation is improved in some parts,

Fixing issue #1 in the original repository
issue #3 : Random crop regions are randomly chosen, even within the same batch.
issue #4 : Different lambda values(sizes of crop regions) are randomly chosen, even within the same batch.
Images to be cropped are randomly chosen in the whole dataset. Original implementation selects images only inside the same batch(shuffling).
Easy to install and use on your existing project.
With additional augmentations(fast-autoaugment), the performances are improved further.

Hence, there may be slightly-improved training results also.

Requirements

python3
torch >= 1.1.0

Install

This repository is pip-installable,

$ pip install git+https://github.com/ildoonet/cutmix

or you can copy 'cutmix' folder to your project to use it.

Usage

Our CutMix is inhereted from the PyTorch Dataset class so you can wrap your own dataset(eg. cifar10, imagenet, ...). Also we provide CutMixCrossEntropyLoss, soft version of cross-entropy loss, which accept soft-labels required by cutmix.

from cutmix.cutmix import CutMix
from cutmix.utils import CutMixCrossEntropyLoss
...

dataset = datasets.CIFAR100(args.cifarpath, train=True, download=True, transform=transform_train)
dataset = CutMix(dataset, num_class=100, beta=1.0, prob=0.5, num_mix=2)    # this is paper's original setting for cifar.
...

criterion = CutMixCrossEntropyLoss(True)
for _ in range(num_epoch):
    for input, target in loader:    # input is cutmixed image's normalized tensor and target is soft-label which made by mixing 2 or more labels.
        output = model(input)
        loss = criterion(output, target)
    
        loss.backward()
        optimizer.step()
        optimizer.zero_grad()

Result

PyramidNet-200 + ShakeDrop + CutMix \w CIFAR-100

| | Top-1 Error(@300epoch) | Top-1 Error(Best) | Model File | |---------------------------------|------------:|------------|------------| | Paper's Reported Result | N/A | 13.81 | N/A | | Our Re-implementation | 13.68 | 13.15 | Download(12.88) | | + Fast AutoAugment | 13.3 | 12.95 | |

We ran 6 indenpendent experiments with our re-implemented codes and got top-1 errors of 13.09, 13.29, 13.27, 13.24, 13.15 and 12.88, using below command. (Converged at 300epoch with the top-1 errors of 13.55, 13.66, 13.95, 13.9, 13.8 and 13.32.)

$ python train.py -c conf/cifar100_pyramid200.yaml

ResNet + CutMix \w ImageNet

| | | Top-1 Error<br/>(@300epoch) | Top-1 Error<br/>(Best) | Model File | |------------|---------------------------------|------------:|----------:|-----------:| | ResNet18 | Reported Result \wo CutMix | N/A | 30.43 | | | Ours | 29.674 | 29.56 | | ResNet34 | Reported Result \wo CutMix | N/A | 26.456 | | | | Ours | 24.7 | 24.57 | Download | | ResNet50 | Paper's Reported Result | N/A | 21.4 | N/A | | | Author's Code(Our Re-run) | 21.768 | 21.586 | N/A | | | Our Re-implementation | 21.524 | 21.340 | Download(21.25) | | ResNet200 | Our Re-implementation | | | + Fast AutoAugment | 19.058 | 18.858 |

$ python train.py -c conf/imagenet_resnet50.yaml

We ran 5 independent experiments on ResNet50.

Author's codes
- 300epoch : 21.762, 21.614, 21.762, 21.644, 21.810
- best : 21.56, 21.556, 21.666, 21.498, 21.648
Our Re-implementation
- 300epoch : 21.53, 21.408, 21.55, 21.4, 21.73
- best : 21.392, 21.328, 21.386, 21.256, 21.34

Reference

Official
- Paper : https://arxiv.org/abs/1905.04899
- Implementation : https://github.com/clovaai/CutMix-PyTorch
ShakeDrop
- https://github.com/owruby/shake-drop_pytorch
Fast AutoAugment
- https://github.com/kakaobrain/fast-autoaugment

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

last30days-skill

16.5k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

sec-edgar-agentkit

AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.

ildoonet

View profile

View on GitHub

GitHub Stars168

CategoryEducation

Updated5d ago

Forks29

ildoonet/cutmix

Languages

Python

Security Score

100/100

Audited on Mar 25, 2026

No findings