SkillAgentSearch skills...

SBAC

Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)

Install / Use

/learn @Facebear-ljx/SBAC
About this skill

Quality Score

0/100

Supported Platforms

Zed

README

Facebear's RL implementation:\

Offline_RL:
SBAC(soft behavior regularized actor critic)
TD3+BC(A Minimalist approach to Offline RL)
BCQ(Batch-constrained Q learning)
BEAR(Bootstrapping Error Accumulation Reduction)\

Online_RL:
PPO(Proximal policy optimization)
TD3(Twin delayed deep deterministic policy gradient)
SAC(Soft Actor Critic)\

Related Skills

View on GitHub
GitHub Stars11
CategoryDevelopment
Updated21d ago
Forks3

Languages

Python

Security Score

75/100

Audited on Mar 11, 2026

No findings