Search
Now showing items 1-1 of 1
Potential-Based Reward Shaping Preserves Pareto Optimal Policies
(2017-05)
Reward shaping is a well-established family of techniques
that have been successfully used to improve the performance
and learning speed of Reinforcement Learning agents in singleobjective
problems. Here we extend the ...