In this paper, we present our effort to build a Vietnamese speech recognition system for customer service call center. Various techniques such as time delay deep neural network (TDNN), data augmentation are applied to achieve a low word error rate at for this challenging task. |