Surrey researchers Sign in
Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment
Journal article   Open access  Peer reviewed

Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment

Weimin Chen, Kelvin Kian Loong Wong, Sifan Long and Zhili Sun
Entropy (Basel, Switzerland), Vol.24(4), p.440
22/03/2022
PMID: 35455103

Abstract

approximation theory correct proximal policy optimization entropy policy gradient reinforcement learning Optimization
url
https://doi.org/10.3390/e24040440View
Published (Version of record) Open

Metrics

Details

Usage Policy