ViCAN: Co-attention network for Vietnamese visual question answering