Reinforcement learning with limited prior knowledge in long-term environments