Data Streams Models and Algorithms- P3: In recent years, the progress in hardware technology has made it possible for organizations to store and record large streams of transactional data. Such data sets which continuously and rapidly grow over time are referred to as data streams. In addition, the development of sensor technology has resulted in the possibility of monitoring many events in real time. | 44 DATA STREAMS MODELSAND ALGORITHMS Technique Definition Pros Cons Sampling Choosing a data subset for analysis Error Bounds Guaranteed Poor for anomaly detection Load Shedding Ignoring a chunk of data Efficient for queries Very poor for anomaly detection Sketching Random projection on feature set Extremely Efficient May ignore Relevant features Synopsis Structure Quick Transformation Analysis Task Independent Not sufficient for very fast stream Aggregation Compiling summary statistics Analysis Task Independent May ignore Relevant features Table . Data Based Techniques Technique Definition Pros Cons Approximation Algorithms Algorithms with Error Bounds Efficient Resource adaptivity with data rates not always possible Sliding Window Analyzing most recent streams General Ignores part of stream Algorithm Output Granularity Highly Resource aware technique with memory and fluctuating data rates General Cost overhead of resource aware component Table . Task Based Techniques the task-based techniques. Each table provides a definition advantages and disadvantages of each technique. While the methods in Tables and provide an overview of the broad methods which can be used to adapt conventional methods to classification it is more useful to study specific techniques which are expressly designed for the purpose of classification. In the next section we will provide a review of these methods. 4. Classification Techniques This section reviews the state-of-the-art of data stream classification techniques. We have provided an overview of some of the key methods how well they address the research problems discussed earlier. lease purchase PDF Split-Merge on to remove this watermark. A Survey of Classification Methods in Data Streams 45 Ensemble Based Classification Wang et al. 30 have proposed a generic framework for mining concept drifting data streams. The framework is based on the observation that many data stream mining algorithms do not .