DDQN

Deep Double Q-Learning implementation introduced by Hasselt et al in this paper: https://arxiv.org/abs/1509.06461. It's interfacing with openAI Gym. WIP.

Generate Convert Improve

Install / Use

/learn @DavidSanwald/DDQN

About this skill

Quality Score

0/100

README

Watch it in action at the Gym here:

https://gym.openai.com/evaluations/eval_GFtDBmuyRjCzcAkBibwYWQ#reproducibility

The algorithm is based on the great research of such great minds like David Silver, Hado van Hasselt, Vlad Minh and many more in particular (bust not exclusively) on Double DQN.

I also wrote about the algorithm on my blog, if you want to know more:

https://davidsanwald.github.io/2016/12/11/Double-DQN-interfacing-OpenAi-Gym.html

If you want to reproduce the exact results from the Gym please use the one file Gist, sometimes I feel like doing stupid things with every master branch I can get my hands on (;

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

last30days-skill

18.3k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

sec-edgar-agentkit

AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.

DavidSanwald

View profile

View on GitHub

GitHub Stars30

CategoryEducation

Updated4mo ago

Forks6

DavidSanwald/DDQN

Languages

Python

Security Score

72/100

Audited on Nov 14, 2025

No findings