Title: Proximal_Policy_Optimization
Download

Description: Reinforcement learning can be divided into value-based learning and strategy based learning according to method learning strategies. In the field of deep reinforcement learning, dqn algorithm is generated by combining deep learning with value-based Q-learning algorithm. Through experience playback pool and target network, deep learning algorithm is successfully introduced into reinforcement learning algorithm.
Downloaders recently:
[More information of uploader 小人物0104]]
To Search:
File list (Click to check if it's the file you need, and recomment it at the bottom):
文件名 | 大小 | 更新时间 |
---|---|---|
Proximal_Policy_Optimization | 0 | 2019-04-08 |
Proximal_Policy_Optimization\discrete_DPPO.py | 8808 | 2019-01-21 |
Proximal_Policy_Optimization\DPPO.py | 8270 | 2019-01-21 |
Proximal_Policy_Optimization\simply_PPO.py | 6458 | 2019-01-21 |