Recent years have witnessed the strong growth of Automatic Speech Recognition (ASR) studies due to its wide range of applications. However, there are few efforts put into the Vietnamese language. This paper introduces an end-to-end approach using Conformer, a combination of Transfomer and Convolution Neural Network, and pseudo labeling for Vietnamese ASR systems. |