Data Preparation for Data Mining- P1: Ever since the Sumerian and Elam peoples living in the Tigris and Euphrates River basin some 5500 years ago invented data collection using dried mud tablets marked with tax records, people have been trying to understand the meaning of, and get use from, collected data. More directly, they have been trying to determine how to use the information in that data to improve their lives and achieve their objectives. | Data Preparation for Data Mining Dorian Pyle Senior Editor Diane D. Cerra Director of Production Manufacturing Yonie Overton Production Editor Edward Wade Editorial Assistant Belinda Breyer Cover Design Wall-To-Wall Studios Cover Photograph 1999 PhotoDisc Inc. Text Design Composition Rebecca Evans Associates Technical Illustration Dartmouth Publishing Inc. Copyeditor Gary Morris Proofreader Ken DellaPenta Indexer Steve Rath Printer Courier Corp. Designations used by companies to distinguish their products are often claimed as trademarks or registered trademarks. In all instances where Morgan Kaufmann Publishers Inc. is aware of a claim the product names appear in initial capital or all capital letters. Readers however should contact the appropriate companies for more complete information regarding trademarks and registration. Morgan Kaufmann Publishers Inc. Editorial and Sales Office 340 Pine Street Sixth Floor San Francisco CA 94104-3205 USA Telephone 415-392-2665 Facsimile 415-982-2665 Email mkp@ WWW http www. mkp. com Order toll free 800-745-7323 1999 by Morgan Kaufmann Publishers Inc. All rights reserved Please purchase PDF Split-Merge on to remove this watermark. No part of this publication may be reproduced stored in a retrieval system or transmitted in any form or by any means electronic mechanical photocopying or otherwise without the prior written permission of the publisher. Dedication To my dearly beloved Pat without whose love encouragement and support this book and very much more would never have come to be Please purchase PDF Split-Merge on to remove this watermark. Table of Contents Data Preparation for Data Mining Preface Introduction Chapter 1 - Data Exploration as a Process Chapter 2 - The Nature of the World and Its Impact on Data Preparation Chapter 3 - Data Preparation as a Process Chapter 4 - Getting the Data Basic Preparation Chapter 5 - Sampling Variability and Confidence Chapter 6 - Handling .