Now showing items 1-1 of 1

    • Potential-Based Reward Shaping Preserves Pareto Optimal Policies 

      Mannion, Patrick; Devlin, Sam; Karl, Mannion; Duggan, Jim (2017-05)
      Reward shaping is a well-established family of techniques that have been successfully used to improve the performance and learning speed of Reinforcement Learning agents in singleobjective problems. Here we extend the ...