Learning What To Remember: Strategies For Selective External Memory In Online Reinforcement Learning Agents