Build software better, together

LiamConnell / deep-algotrading

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

reinforcement-learning deep-learning neural-network tensorflow lstm policy-gradient

Updated Jun 10, 2024
Jupyter Notebook

thu-ml / tianshou

Star

An elegant PyTorch deep reinforcement learning library.

pytorch dqn policy-gradient rl cql atari ddpg imitation-learning sac drl npg double-dqn trpo mujoco ppo a2c td3 bcq transferlab

Updated Jun 10, 2024
Python

chengxi600 / RLStuff

Star

A collection of reinforcement learning algorithm implementations

machine-learning reinforcement-learning genetic-algorithm q-learning policy-gradient actor-critic

Updated Jun 10, 2024
Jupyter Notebook

datawhalechina / easy-rl

Star

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

reinforcement-learning deep-reinforcement-learning q-learning dqn policy-gradient sarsa a3c ddpg imitation-learning double-dqn dueling-dqn ppo td3 easy-rl

Updated Jun 9, 2024
Jupyter Notebook

WorldEditor50 / snakeAI

Star

testing MLP, DQN, PPO, SAC, policy-gradient by snake

reinforcement-learning lstm dqn policy-gradient sac ppo snakeai

Updated Jun 8, 2024
C++

notjedi / tuxkart-ai

Star

[WIP] RL agent for the SuperTuxKart game.

reinforcement-learning deep-reinforcement-learning pytorch policy-gradient autoencoder vae rl ppo vae-pytorch

Updated Jun 7, 2024
Python

kkm24132 / ReinforcementLearning

Star

Focuses on Reinforcement Learning related concepts, use cases, and learning approaches

reinforcement-learning q-learning policy-gradient sarsa multi-armed-bandits montecarlo linear-function-approximation exploration-exploitation temporal-difference-algorithms

Updated Jun 5, 2024
Jupyter Notebook

MarioFiorino / Tutorial-Reinforcement-Learning-ITA-Python

Star

In questa repository una collezione di tutorial sulle basi del Reinforcement Learning, sviluppati in Python, interamente in italiano.

reinforcement-learning openai-gym q-learning policy-gradient sarsa ita tensorflow2 tutorial-italiano off-policy-monte-carlo programmazione-dinamica teoria-controllo-ottimale fondamenti-teorici-rl on-policy-first-visit-monte-carlo-control n-step-td semi-gradient-one-step-sarsa gradient-monte-carlo-target-control

Updated Jun 5, 2024
Jupyter Notebook

Allenpandas / Reinforcement-Learning-Papers

Star

📚 List of Top-tier Conference Papers on Reinforcement Learning (RL)，including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.

reinforcement-learning deep-reinforcement-learning q-learning artificial-intelligence dqn policy-gradient imitation-learning aaai ijcai reinforcement-learning-papers hierarchical-reinforcement-learning icml multi-agent-reinforcement-learning neurips meta-reinforcement-learning offline-reinforcement-learning rl-papers reinforcement-learning-conferences reinforcement-learning-paper reinforcement-learning-conferences-papers

Updated May 30, 2024

TomGoesGitHub / Spinning-Up-in-Reinforcement-Learning

Star

Several RL-agents are tested on classical environments and benchmarked against their stable-baselines implementation.

reinforcement-learning q-learning policy-gradient markov-decision-processes actor-critic

Updated May 25, 2024
Python

oliverc1623 / DRIVE-Sim

Star

A PyTorch-based framework to conduct deep reinforcement learning research in multiple autonomous vehicle simulators

simulator reinforcement-learning neural-networks policy-gradient autonomous-vehicles

Updated May 25, 2024
Jupyter Notebook

callmespring / RL-short-course

Star

Reinforcement Learning Short Course

reinforcement-learning q-learning ridesharing policy-gradient dynamic-programming deep-q-network markov-decision-processes policy-iteration value-iteration monte-carlo-methods temporal-differencing-learning model-based-rl policy-based-method fitted-q-iteration off-policy-evaluation offline-rl order-dispatch-recommendation

Updated May 23, 2024
Jupyter Notebook

CodeName-Detective / Prompt-to-Song-Generation-using-Large-Language-Models

Star

This project uses LLMs to generate music from text by understanding prompts, creating lyrics, determining genre, and composing melodies. It harnesses LLM capabilities to create songs based on text inputs through a multi-step approach.

natural-language-processing deep-learning transformers deep-reinforcement-learning policy-gradient genre-classification seq-to-seq llms rlhf flan-t5 llama3

Updated May 21, 2024
Jupyter Notebook

CodeName-Detective / A2C-Exploring-OpenAI-Gym-Environments-and-Enhancing-Actor-Critic-Algorithms-for-Optimal-Performance

Star

This project provides a comprehensive understanding of reinforcement learning, focusing on Actor Critic Algorithms. It involves exploring the OpenAI Gym library, implementing the A2C algorithm from DeepMind's seminal paper, and enhancing the A2C algorithm for improved performance and stability.

reinforcement-learning deep-reinforcement-learning policy-gradient actor-critic a2c open-ai-gym

Updated May 21, 2024
Jupyter Notebook

MarcoMeter / episodic-transformer-memory-ppo

Star

Clean baseline implementation of PPO using an episodic TransformerXL memory

deep-reinforcement-learning pytorch transformer policy-gradient pomdp actor-critic proximal-policy-optimization ppo on-policy episodic-memory transformer-xl gtrxl trxl gated-transformer-xl memory-gym

Updated May 13, 2024
Python

markhliu / AlphaGoSimplified

Star

Book repository for AlphaGo Simplified (CRC Press 2024). Implement ideas behind Deep Blue (rule-based AI) and AlphaGo (rule-based AI + Deep Learning) in three simple games: Last Coin Standing, Tic Tac Toe, and Connect Four.