A clustering technique for the Vietnamese word categorization