We present the method of the HCMUS team pariticipating in Multimodal Person Discovery in Broadcast TV Task at the MediaEval Challenge 2016. There are two main tasks in our method. First we identify a list of potential characters of interest from all video clips. Each potential character is defined as a pair of face track, a sequence of face patches, and a name. We use OCR results and face detection to find potential characters. |