A survey of preference-based reinforcement learning methods
A survey of
preference-based reinforcement learning
methods
Christian Wirth
,
Riad Akrour
,
Gerhard Neumann
,
Johannes Fürnkranz
Journal of Machine Learning Research
18 (2017) 1-46
https://jmlr.org/papers/volume18/16-634/16-634.pdf