Báo cáo khoa học: "Reduced n-gram models for English and Chinese corpora"