Bounding Box Improvement with Reinforcement Learning