Báo cáo khoa học: "Japanese OCR Error Correction using Character Shape Similarity and Statistical Language Model "