A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
-
Updated
Jun 10, 2024 - Jupyter Notebook
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
An elegant PyTorch deep reinforcement learning library.
A collection of reinforcement learning algorithm implementations
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
testing MLP, DQN, PPO, SAC, policy-gradient by snake
[WIP] RL agent for the SuperTuxKart game.
Focuses on Reinforcement Learning related concepts, use cases, and learning approaches
In questa repository una collezione di tutorial sulle basi del Reinforcement Learning, sviluppati in Python, interamente in italiano.
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
Several RL-agents are tested on classical environments and benchmarked against their stable-baselines implementation.
A PyTorch-based framework to conduct deep reinforcement learning research in multiple autonomous vehicle simulators
Reinforcement Learning Short Course
This project uses LLMs to generate music from text by understanding prompts, creating lyrics, determining genre, and composing melodies. It harnesses LLM capabilities to create songs based on text inputs through a multi-step approach.
This project provides a comprehensive understanding of reinforcement learning, focusing on Actor Critic Algorithms. It involves exploring the OpenAI Gym library, implementing the A2C algorithm from DeepMind's seminal paper, and enhancing the A2C algorithm for improved performance and stability.
Clean baseline implementation of PPO using an episodic TransformerXL memory
Book repository for AlphaGo Simplified (CRC Press 2024). Implement ideas behind Deep Blue (rule-based AI) and AlphaGo (rule-based AI + Deep Learning) in three simple games: Last Coin Standing, Tic Tac Toe, and Connect Four.
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
DEEp Reinforcement learning framework
Baseline implementation of recurrent PPO using truncated BPTT
Simple maze solver by reinforcement learning
Add a description, image, and links to the policy-gradient topic page so that developers can more easily learn about it.
To associate your repository with the policy-gradient topic, visit your repo's landing page and select "manage topics."