Type of publication: | Artikel in einem Konferenzbericht |
Publikationsstatus: | Akzeptiert |
Jahr: | 2020 |
Verlag: | Proceedings of International Conference on Advances in Signal Processing and Artificial Intelligence (ASPAI) |
Ort: | Berlin, Germany |
URL: | https://cps.unileoben.ac.at/wp... |
DOI: | https://cps.unileoben.ac.at/wp/ASPAI2020Xue.pdf |
Abriss: | Gradient-free reinforcement learning algorithms often fail to scale to high dimensions and require a large number of rollouts. In this paper, we propose learning a predictor model that allows simulated rollouts in a rank-based black-box optimizer Covariance Matrix Adaptation Evolutional Strategy (CMA-ES) to achieve higher sample-efficiency. We validated the performance of our new approach on different benchmark functions where our algorithm shows a faster convergence compared to the standard CMA-ES. As a next step, we will evaluate our new algorithm in a robot cup flipping task. |
Schlagworte: | |
Autoren: | |
Anhänge
|
|
[Bib|RIS] | |
|