Reasoning Across Language and Vision in Machines and Humans