RECOMBINER

Pytorch Implementation of INR-based codec RECOMBINER (Robust and Enhanced Compression with Bayesian Implicit Neural Representations)

Generate Convert Improve

Install / Use

/learn @cambridge-mlg/RECOMBINER

About this skill

Quality Score

0/100

README

RECOMBINER: Robust and Enhanced Compression with Bayesian Implicit Neural Representations

Official PyTorch implementation of the INR-based codec RECOMBINER (published in ICLR 2024 :star2:). It sets a new SOTA on CIFAR-10 at low bitrates and achieves strong performance on other modalities comparing to other INR-based codecs. This repo provides implementations of RECOMBINER across modalities, including image (Kodak, CIFAR-10), audio, video, and protein 3D structure.

Examples

Examples of video and protein structures compressed with RECOMBINER.

Installation

We suggest using the following commands.

conda create --name $ENV_NAME
conda activate $ENV_NAME
conda install pytorch=2.1.0 torchvision=0.16.0 torchaudio=2.1.0 pytorch-cuda=11.8 -c pytorch -c nvidia
pip3 install -r requirements.txt

Dataset Preparation

Please refer to ./data/Readme.md.

Execution

Training RECOMBINER

python main_prior_training.py [--seed] [--train_dir] [--train_size] [--dataset] [--device] [--max_bitrate] [--saving_dir]

Compression Test Data Points

python main_compression.py [--seed] [--test_dir] [--test_idx] [--dataset] [--device] [--prior_path] [--save_dir]

Hyperparameters

You can also adjust the hyperparameters by modifying config.py

Citation

Please consider citing the following paper if you use this repo.

@inproceedings{he2024recombiner,
      title={RECOMBINER: Robust and Enhanced Compression with Bayesian Implicit Neural Representations}, 
      author={Jiajun He and Gergely Flamich and Zongyu Guo and José Miguel Hernández-Lobato},
      booktitle={International Conference on Learning Representations},
      year={2024},
}

Related Skills

node-connect

348.5k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

109.1k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

348.5k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

348.5k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。