In recent years, the task of Visual Question Answering (VQA) has evolved into a very attractive research field. Normally, this task requires a simultaneous understanding of both the visual content of the image and the textual content of the question. |