Logo image
Open Research University homepage
Surrey researchers Sign in
Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment
Journal article   Open access   Peer reviewed

Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment

Weimin Chen, Kelvin Kian Loong Wong, Sifan Long and Zhili Sun
Entropy (Basel, Switzerland), Vol.24(4), p.440
22/03/2022
PMID: 35455103

Abstract

approximation theory correct proximal policy optimization entropy policy gradient reinforcement learning Optimization
url
https://doi.org/10.3390/e24040440View
Published (Version of record) Open

Metrics

Details

Logo image

Usage Policy