Results for "supervised-reinforcement-learning"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

93 skills found · Page 1 of 4

Ceruleanacg / Personae

1.4k

📈 Personae is a repo of implements and environment of Deep Reinforcement Learning & Supervised Learning for Quantitative Trading.

universal

paperreinforcement-learningstock+5

Updated 8d ago

lnmangione / Halite III

502

In this paper, we apply machine learning to create bots for Halite III, @twosigma's annual A.I. competition. We develop one classifier using Support Vector Machine with Supervised Learning, and one using a Deep Neural Network with Reinforcement Learning

universal

Updated 3d ago

tongjingqi / AI Can Learn Scientific Taste

373

We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference modeling and alignment problem.

universal

agentai-innovatorai-scientists+1

Updated 11h ago

accel-brain / Accel Brain Code

323

The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.

universal

auto-encoderautomatic-summarizationcombinatorial-optimization+16

Updated 12d ago

Jerry-XDL / AIDoctor

254

AIDoctor training medical GPT model with ChatGPT training pipeline, implemantation of Pretraining, Supervised Finetuning, RLHF(Reward Modeling and Reinforcement Learning) and DPO(Direct Preferenc…

universal

Updated 23h ago

lebrice / Sequoia

197

The Research Tree - A playground for research at the intersection of Continual, Reinforcement, and Self-Supervised Learning.

universal

Updated 2mo ago

yaqingwang / WeFEND AAAI20

139

Dataset for paper "Weak Supervision for Fake News Detection via Reinforcement Learning" published in AAAI'2020.

universal

aaai2020datasetdeep-learning+4

Updated 1mo ago

rainarch / DSNER

132

Distantly Supervised NER with Partial Annotation Learning and Reinforcement Learning

universal

Updated 7mo ago

InternLM / Spatial SSRL

124

[CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"

universal

3d-understandinglarge-language-modelslarge-vision-language-models+4

Updated 7d ago

synlp / ChiMed GPT

104

ChiMed-GPT is a Chinese medical large language model (LLM) built by continually training Ziya-v2 on Chinese medical data, where pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF) are comprehensively performed on it.

universal

Updated 2mo ago

YanjieZe / Rl3d

[RA-L 2023 & IROS 2023] Visual Reinforcement Learning with Self-Supervised 3D Representations

universal

Updated 4d ago

jon--lee / Decision Pretrained Transformer

Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learning.

universal

Updated 15d ago

AI-MOO / IBM Machine Learning Professional Certificate

Machine Learning, Time Series & Survival Analysis. Develop working skills in the main areas of Machine Learning: Supervised Learning, Unsupervised Learning, Deep Learning, and Reinforcement Learning. Also gain practice in specialized topics such as Time Series Analysis and Survival Analysis.

zed

courseradeep-learningibm+4

Updated 19d ago

NVlabs / NFT

Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasoning"

universal

llmmathreasoning+1

Updated 7d ago

scottemmons / Rvs

Reinforcement Learning via Supervised Learning

universal

Updated 4mo ago

Panda0406 / Reinforcement Learning Distant Supervision RE

Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning

universal

Updated 1y ago

michaelnny / InstructLLaMA

Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to InstructGPT or ChatGPT, but on a much smaller scale.

universal

4bit-fine-tuneinstructgptllam2+3

Updated 2mo ago

StateOfTheArt-quant / Sharpe

sharpe is a unified, interactive, general-purpose environment for backtesting or applying machine learning(supervised learning and reinforcement learning) in the context of quantitative trading

universal

algorithm-tradingbacktesting-trading-strategiesquantitative-finance+3

Updated 3mo ago

kblomdahl / Dream Go

Artificial go player based on reinforcement and supervised learning

universal

aibadukcuda+8

Updated 21d ago

StateOfTheArt-quant / Trading Gym

a unified environment for supervised learning and reinforcement learning in the context of quantitative trading

universal

ddpgdeep-learninggym-environment+4

Updated 3mo ago