phát hiện ra quy tắc kinh doanh được biết đến có thể là đặc biệt hữu ích, nó không, tuy nhiên, nhấn mạnh hiệu quả của cây quyết định về vấn đề định hướng quy tắc. Rất nhiều tên miền khác nhau, từ di truyền học để quá trình công nghiệp, thực sự không có quy tắc cơ bản, mặc dù chúng có thể khá phức tạp và che khuất bởi các dữ liệu ồn ào. | f w Team-Ffỉj Table The 95 Percent Confidence Interval Bounds for the champion Group 1 RESPONSE SIZE SEP 95 CONF 95 CONF SEP LOWER UPPER 1 900 000 900 000 .96 900 000 .96 900 000 900 000 .96 900 000 900 000 .96 900 000 .96 900 000 .96 900 000 .96 900 000 .96 Response rates vary from o to o. The bounds for the 95 o confidence level are calculated usingl .96 standard deviations from the mean. Chapter 5 The Lure of Statistics Data Mining Using Familiar Tools 143 Based on these possible response rates it is possible to tell if the confidence bounds overlap. The 95 percent confidence bounds for the challenger model were from about percent to percent. These bounds overlap the confidence bounds for the champion model when its response rates are percent percent or percent. For instance the confidence interval for a response rate of percent goes from percent to percent this does overlap percent percent. Using the overlapping bounds method we would consider these statistically the same. Comparing Results Using Difference of Proportions Overlapping bounds is easy but its results are a bit pessimistic. That is even though the confidence intervals overlap we might still be quite confident that the difference is not due to chance with some given level of confidence. Another approach is to look at the difference between response rates rather than the rates themselves. Just as there is a formula for the standard error of a proportion there is a formula for the standard error of