Predictive Scene Representations For Embodied Visual Search