Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm
Crossref DOI link: https://doi.org/10.1007/s10994-014-5458-8
Published Online: 2014-07-02
Published Print: 2014-12
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Busa-Fekete, Róbert
Szörényi, Balázs
Weng, Paul
Cheng, Weiwei
Hüllermeier, Eyke
Text and Data Mining valid from 2014-07-02