Reinforcement Learning from Scarce Experience via Policy Search | Kaicus Deutschland