DDQN
Deep Double Q-Learning implementation introduced by Hasselt et al in this paper: https://arxiv.org/abs/1509.06461. It's interfacing with openAI Gym. WIP.
Install / Use
/learn @DavidSanwald/DDQNREADME
Watch it in action at the Gym here:
https://gym.openai.com/evaluations/eval_GFtDBmuyRjCzcAkBibwYWQ#reproducibility
The algorithm is based on the great research of such great minds like David Silver, Hado van Hasselt, Vlad Minh and many more in particular (bust not exclusively) on Double DQN.
I also wrote about the algorithm on my blog, if you want to know more:
https://davidsanwald.github.io/2016/12/11/Double-DQN-interfacing-OpenAi-Gym.html
If you want to reproduce the exact results from the Gym please use the one file Gist, sometimes I feel like doing stupid things with every master branch I can get my hands on (;
Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
groundhog
398Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
last30days-skill
18.3kAI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
sec-edgar-agentkit
10AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.
