DRL
Deep Reinforcement Learning
Install / Use
/learn @wangshusen/DRLREADME
Deep Reinforcement Learning
-
Overview.
-
Reinforcement Learning [slides] [lecture note] [Video (in Chinese)].
-
Value-Based Learning [slides] [Video (in Chinese)].
-
Policy-Based Learning [slides] [Video (in Chinese)].
-
Actor-Critic Methods [slides] [Video (in Chinese)].
-
AlphaGo [slides] [Video (in Chinese)].
-
-
TD Learning.
-
Sarsa [slides] [Video (in Chinese)].
-
Q-learning [slides] [Video (in Chinese)].
-
Multi-Step TD Target [slides] [Video (in Chinese)].
-
-
Advanced Topics on Value-Based Learning.
-
Experience Replay (ER) & Prioritized ER [slides] [Video (in Chinese)].
-
Overestimation, Target Network, & Double DQN [slides] [Video (in Chinese)].
-
Dueling Networks [slides] [Video (in Chinese)].
-
-
Policy Gradient with Baseline.
-
Policy Gradient with Baseline [slides] [Video (in Chinese)].
-
REINFORCE with Baseline [slides] [Video (in Chinese)].
-
Advantage Actor-Critic (A2C) [slides] [Video (in Chinese)].
-
REINFORCE versus A2C [slides] [Video (in Chinese)].
-
-
Advanced Topics on Policy-Based Learning.
-
Trust-Region Policy Optimization (TRPO) [slides] [Video (in Chinese)].
-
Partial Observation and RNNs.
-
-
Dealing with Continuous Action Space.
-
Discrete versus Continuous Control [slides] [Video (in Chinese)].
-
Deterministic Policy Gradient (DPG) for Continuous Control [slides] [Video (in Chinese)].
-
Stochastic Policy Gradient for Continuous Control [slides] [Video (in Chinese)].
-
-
Multi-Agent Reinforcement Learning.
-
Basics and Challenges [slides] [Video (in Chinese)].
-
Centralized VS Decentralized [slides] [Video (in Chinese)].
-
-
Imitation Learning.
-
Inverse Reinforcement Learning.
-
Generative Adversarial Imitation Learning (GAIL).
-
Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
API
A learning and reflection platform designed to cultivate clarity, resilience, and antifragile thinking in an uncertain world.
openclaw-plugin-loom
Loom Learning Graph Skill This skill guides agents on how to use the Loom plugin to build and expand a learning graph over time. Purpose - Help users navigate learning paths (e.g., Nix, German)
best-practices-researcher
The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app
Security Score
Audited on Mar 23, 2026
