LCCN

Latent Class-Conditional Noise Model

Generate Convert Improve

Install / Use

/learn @Sunarker/LCCN

About this skill

Quality Score

0/100

README

Latent Class-Conditional Noise Model

Abstract

Learning with noisy labels has become imperative in the Big Data era, which saves expensive human labors on accurate annotations. Previous noise-transition-based methods have achieved theoretically-grounded performance under the Class-Conditional Noise model (CCN). However, these approaches builds upon an ideal but impractical anchor set available to pre-estimate the noise transition. Even though subsequent works adapt the estimation as a neural layer, the ill-posed stochastic learning of its parameters in back-propagation easily falls into undesired local minimums. We solve this problem by introducing a Latent Class-Conditional Noise model (LCCN) to parameterize the noise transition under a Bayesian framework. By projecting the noise transition into the Dirichlet space, the learning is constrained on a simplex characterized by the complete dataset, instead of some ad-hoc parametric space wrapped by the neural layer. We then deduce a dynamic label regression method for LCCN, whose Gibbs sampler allows us efficiently infer the latent true labels to train the classifier and to model the noise. Our approach safeguards the stable update of the noise transition, which avoids previous arbitrarily tuning from a mini-batch of samples. We further generalize LCCN to different counterparts compatible with open-set noisy labels, semi-supervised learning as well as cross-model training. A range of experiments demonstrate the advantages of LCCN and its variants over the current state-of-the-art methods.

Get Started

Environments

The project is tested under the following environment settings:

OS: Ubuntu 18.04.5
GPU: NVIDIA GeForce RTX 3090
Python: 3.7.10
PyTorch: 1.7.1
Torchvision: 0.8.2
Cudatoolkit: 11.0.221
Numpy: 1.21.2
Tensorflow: 1.8.0

Code Structure

We summarize the codes as the following structure

├── README.md
├──                                              # Tensorflow code for LCCN
├── data/cifar10/cifar-10-batches-bin/*          # data files
│── cifar10_input.py                             # read data
│── cifar10.py                                   # backbone (training and inference)
│── cifar10_train.py                             # CE         
│── cifar10_train_bootstrapping.py               # CE with the bootstrapped labels
│── cifar10_train_T.py                           # CE with the transition matrix
│── cifar10_train_varT.py                        # CE with the adapted transition matrix
│── cifar10_train_varC.py                        # CE with LCCN
│── cifar10_train_T.py                           # CE with the transition matrix
│── cifar10_train_T.py                           # CE with the transition matrix
│── cifar10_eval.py                              # evaluation
├──                                              # Torch code for DivideLCCN
│── PreResNet.py                                 # Resnet Backbone 1
│── resnet_model.py                              # Resnet Backbone 2
│── dataloader_cifar.py                          # read data
│── divide_train_varC.py                         # DivideLCCN

Construct the noisy dataset

  python dataset.py

Baselines

Train DNNs directly with the cross-entropy loss (CE).

  python cifar10_train.py --train_dir results/events_ce/cifar10_train --noise_rate 0.3 # You can train other models like this one

Train Bootstrapping

  python cifar10_train_bootstrapping.py --train_dir results/events_bootstrapping/cifar10_train --noise_rate 0.3 # You can train other models like this one

Train Forward

  python cifar10_train_T.py --init_dir results/events_ce/cifar10_train --train_dir results/events_T/cifar10_train --noise_rate 0.3 # You can train other models like this one

Train S-adaptation

  python cifar10_train_varT.py --init_dir results/events_ce/cifar10_train --train_dir results/events_varT/cifar10_train --noise_rate 0.3 # You can train other models like this one

Train LCCN

Train LCCN

  python cifar10_train_varC.py --init_dir results/events_ce/cifar10_train --train_dir results/events_varC/cifar10_train --noise_rate 0.3 # You can train other models like this one

Train DivideLCCN

To train DivideLCCN on CIFAR-10/CIFAR-100 with different noisy types/ratios, simply run:

Train DivideLCCN, sym noise, CIFAR-10

  python divide_train_varC.py --noise_mode sym --r 0.2 --dataset cifar10 --num_class 10 --data_dir ${data_dir} # You can train other models like this one

Train DivideLCCN, asym noise, CIFAR-10

  python divide_train_varC.py --noise_mode asym --r 0.1 --dataset cifar10 --num_class 10 --data_dir ${data_dir} # You can train other models like this one

Train DivideLCCN, sym noise, CIFAR-100

  python divide_train_varC.py --noise_mode sym --r 0.2 --dataset cifar100 --num_class 100 --data_dir ${data_dir} # You can train other models like this one

Train DivideLCCN, asym noise, CIFAR-100

  python divide_train_varC.py --noise_mode asym --r 0.1 --dataset cifar100 --num_class 100 --data_dir ${data_dir} # You can train other models like this one

Results (please refer to the paper for more results)

Citation

@article{yao2023latent,
  title={Latent Class-Conditional Noise Model},
  author={Yao, Jiangchao and Han, Bo and Zhou, Zhihan and Zhang, Ya and Tsang, Ivor W},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2023},
  publisher={IEEE}
}

Acknowledgement

We borrow some codes from LCCN and DivideMix.

Related Skills

node-connect

343.1k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

90.0k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

343.1k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

343.1k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。