Evaluating visual conversational agents via cooperative human-AI games