Optimal Reservoir Operations For Riverine Water Quality Improvement: A Reinforcement Learning Strategy