GQA: A New Dataset for Real-World Visual Reasoning and Compositional QuestionAnswering
GQA is a new dataset designed to challenge and evaluate AI systems in visual reasoning and compositional question answering. It aims to advance scene understanding and multimodal interaction in real-world scenarios.