Data Mining and Knowledge Discovery Handbook, 2 Edition part 28

Data Mining and Knowledge Discovery Handbook, 2 Edition part 28. Knowledge Discovery demonstrates intelligent computing at its best, and is the most desirable and interesting end-product of Information Technology. To be able to discover and to extract knowledge from data is a task that many researchers and practitioners are endeavoring to accomplish. There is a lot of hidden knowledge waiting to be discovered – this is the challenge created by today’s abundance of data. Data Mining and Knowledge Discovery Handbook, 2nd Edition organizes the most current concepts, theories, standards, methodologies, trends, challenges and applications of data mining (DM) and knowledge discovery. | 250 Jerzy W. Grzymala-Busse independent variables and the decision is a dependent variable. A very simple example of such a table is presented as Table in which attributes are Temperature Headache Weakness Nausea and the decision is Flu. The set of all cases labeled by the same decision value is called a concept. For Table case set 1 2 4 5 is a concept of all cases affected by flu for each case from this set the corresponding value of Flu is yes . Table . An Example of a Dataset. Case Attributes Temperature Headache Weakness Nausea Decision Flu 1 veryJiigli yes yes no yes 2 high yes no yes yes 3 normal no no no no 4 normal yes yes yes yes 5 high no yes no yes 6 high no no no no 7 normal no yes no no Note that input data may be affected by errors. An example of such a data set is presented in Table . The case 7 has value for Weakness an obvious error since the attribute Weakness is symbolic with possible values yes and no. Such errors must be corrected before rule induction. Table . An Example of an Erroneous Dataset Case Attributes Temperature Headache Weakness Nausea Decision Flu 1 veryJiigli yes yes no yes 2 high yes no yes yes 3 normal no no no no 4 normal yes yes yes yes 5 high no yes no yes 6 high no no no no 7 normal no no no Another problem is caused by numerical attributes for example Temperature may be represented by real numbers as in Table . Obviously numerical attributes must be converted into symbolic attributes before or during rule induction. The process of converting numerical attributes into symbolic attributes is called discretization or quantization . 13 Rule Induction 251 Table . An Example of a Dataset with a Numerical Attribute. Case Attributes Temperature Headache Weakness Nausea Decision Flu 1 yes yes no yes 2 yes no yes yes 3 no no no no 4 yes yes yes yes 5 no yes no yes 6 no no no no 7 no yes no no Input data may be incomplete . some attributes may have missing .

Không thể tạo bản xem trước, hãy bấm tải xuống
TỪ KHÓA LIÊN QUAN
TÀI LIỆU MỚI ĐĂNG
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.