Learning Semantic Representations and Visual Navigation in Indoor Scenes