A new method based on clustering improves the efficiency of imbalanced data classification