Deep Vs Shallow Reinforcement Learning For Low Dimensional Continuous Control Tasks