0% found this document useful (0 votes)

265 views55 pages

Chap 3

This document discusses data preprocessing techniques. It explains that real-world data is often dirty, incomplete, noisy, inconsistent and needs to be cleaned before analysis. The major tasks of data preprocessing are described as data cleaning, integration, transformation, reduction and discretization. Specific techniques for handling missing data, outliers, inconsistencies are also outlined.

Uploaded by

api-27259648

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

265 views55 pages

Chap 3

Uploaded by

api-27259648

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 55

Chapter 3: Data Preprocessing

 Why preprocess the data?

 Data in the real world is dirty

 incomplete: lacking attribute values, lacking

certain attributes of interest, or containing

only aggregate data
 noisy: containing errors or outliers

 inconsistent: containing discrepancies in codes

or names
 No quality data, no quality mining results!
 Quality decisions must be based on quality

data
 Data warehouse needs consistent integration

of quality data
October 16, 2008 Data Mining: Concepts and Techniques 2
Multi-Dimensional Measure of Data
Quality

 A well-accepted multidimensional view:

 Accuracy

 Completeness

 Consistency

 Timeliness

 Believability

 Value added

 Interpretability

 Accessibility

 Broad categories:
 intrinsic, contextual, representational, and

accessibility.
October 16, 2008 Data Mining: Concepts and Techniques 3
Major Tasks in Data
Preprocessing

 Data cleaning
 Fill in missing values, smooth noisy data, identify or
remove outliers, and resolve inconsistencies
 Data integration
 Integration of multiple databases, data cubes, or files
 Data transformation
 Normalization and aggregation
 Data reduction
 Obtains reduced representation in volume but produces
the same or similar analytical results
 Data discretization
 Part of data reduction but with particular importance,
especially for numerical data
October 16, 2008 Data Mining: Concepts and Techniques 4
Forms of data
preprocessing
Chapter 3: Data Preprocessing

 Why preprocess the data?

 Data cleaning tasks

 Fill in missing values
 Identify outliers and smooth out noisy
data
 Correct inconsistent data

October 16, 2008 Data Mining: Concepts and Techniques 7

Missing Data
 Data is not always available
 E.g., many tuples have no recorded value for several
attributes, such as customer income in sales data
 Missing data may be due to
 equipment malfunction
 inconsistent with other recorded data and thus deleted
 data not entered due to misunderstanding
 certain data may not be considered important at the
time of entry
 not register history or changes of the data
 Missing data may need to be inferred.

October 16, 2008 Data Mining: Concepts and Techniques 8

How to Handle Missing
Data?
 Ignore the tuple: usually done when class label is missing
(assuming the tasks in classification—not effective when the
percentage of missing values per attribute varies
considerably.
 Fill in the missing value manually: tedious + infeasible?
 Use a global constant to fill in the missing value: e.g.,
“unknown”, a new class?!
 Use the attribute mean to fill in the missing value
 Use the attribute mean for all samples belonging to the
same class to fill in the missing value: smarter
 Use the most probable value to fill in the missing value:
October 16, 2008 Data Mining: Concepts and Techniques 9
Noisy Data

 Noise: random error or variance in a measured

variable
 Incorrect attribute values may due to

 faulty data collection instruments

 data entry problems

 data transmission problems

 technology limitation

 inconsistency in naming convention

 Other data problems which requires data cleaning

 duplicate records

 incomplete data

October16,inconsistent
2008 data
Data Mining: Concepts and Techniques 10
How to Handle Noisy Data?
 Binning method:
 first sort data and partition into (equi-depth)

bins
 then one can smooth by bin means, smooth by

bin median, smooth by bin boundaries, etc.

 Clustering

 detect and remove outliers

 Combined computer and human inspection

 detect suspicious values and check by human

 Regression

 smooth by fitting the data into regression

functions
October 16, 2008 Data Mining: Concepts and Techniques 11
Simple Discretization Methods:
Binning
 Equal-width (distance) partitioning:
 It divides the range into N intervals of equal

size: uniform grid

 if A and B are the lowest and highest values of

the attribute, the width of intervals will be: W =

(B-A)/N.
 The most straightforward

 But outliers may dominate presentation

 Skewed data is not handled well.

 Equal-depth (frequency) partitioning:

 It divides the range into N intervals, each

containing approximately same number of

samples
October
16, 2008 Data Mining: Concepts and Techniques 12
Binning Methods for Data
Smoothing
• Sorted data for price (in dollars):
• 13,15,16,
16,19,20,20,21,22,22,25,25,25,33,35,35,35,35,36,40,45,
• 46,52,70
• * Partition into (equi-depth) bins:
- Bin 1: 13,15,16
- Bin 2: 16,19,20
- Bin 3: 20,21,22
- Bin 4: 22,25,25
- Bin 5: 25,25,33
- Bin 6: 33,35,35
- Bin 7 35,35,36
- Bin 8: 40, 45,46
- Bin 9: 52,70
*
October 16, 2008 Data Mining: Concepts and Techniques 13
Smoothing by bin means:

- Bin 1: 14,14,14
- Bin 2: 18,18,18
- Bin 3: 21,21,21
- Bin 4: 24,24,24
- Bin 5: 27,27,27
- Bin 6: 34,34,34
- Bin 7: 35,35,35
- Bin 8: 43,43,43
- Bin 9: 61,62

October 16, 2008 Data Mining: Concepts and Techniques 14

Smoothing by bin medians:

- Bin 1: 15,15,15
- Bin 2: 19,19,19
- Bin 3: 21,21,21
- Bin 4: 25,25,25
- Bin 5: 25,25,25
- Bin 6: 35,35,35
- Bin 7: 35,35,35
- Bin 8: 45,45,45
- Bin 9: 61,61

October 16, 2008 Data Mining: Concepts and Techniques 15

Smoothing by bin boundaries:

- Bin 1: 13,16,16
- Bin 2: 16,20,20
- Bin 3: 20,22,22
- Bin 4: 22,25,25
- Bin 5: 25,25,33
- Bin 6: 33,35,35
- Bin 7: 35,35,36
- Bin 8: 40,46,46
- Bin 9: 52,70

October 16, 2008 Data Mining: Concepts and Techniques 16

 Inconsistent Data

Inconsistencies may due to data entry

This may be corrected manually by performing paper trace

Knowledge Engineering tools may also be used to detect the

violation of known data constraints.

May due to data integration, where a given attributes can

have different names in different databases.

October 16, 2008 Data Mining: Concepts and Techniques 17

Cluster Analysis
Regression
y

Y1’ y=x+1

X1 x
Chapter 3: Data Preprocessing

 Why preprocess the data?

coherent store
 Schema integration
 integrate metadata from different sources

 Entity identification problem: identify real world

entities from multiple data sources, e.g., A.cust-

id ≡ B.cust-#
 Detecting and resolving data value conflicts
 for the same real world entity, attribute values

from different sources are different

 possible reasons: different representations,

different scales, e.g., metric vs. British units

October 16, 2008 Data Mining: Concepts and Techniques 21
Data in Data
Integration
 Redundant data occur often when integration of
multiple databases
 The same attribute may have different names
in different databases
 One attribute may be a “derived” attribute in
another table, e.g., annual revenue
 Redundant data may be able to be detected by
correlational analysis
 Careful integration of the data from multiple
sources may help reduce/avoid redundancies and
inconsistencies and improve mining speed and
Octoberquality
16, 2008 Data Mining: Concepts and Techniques 22
Data
Transformation

 Smoothing: remove noise from data

 Aggregation: summarization, data cube
construction
 Generalization: concept hierarchy climbing
 Normalization: scaled to fall within a small,
specified range
 min-max normalization
 z-score normalization
 normalization by decimal scaling
 Attribute/feature construction
 New attributes
October 16, 2008 constructed
Data Mining: from the given
Concepts and Techniques 23
Data Transformation:
Normalization
 min-max normalization
v − minA
v' = (new _ maxA − new _ minA) + new _ minA
maxA − minA
 z-score normalization
v −meanA
v' =
stand _ devA
 normalization by decimal scaling
v
v' = j Where j is the smallest integer such that Max(| v ' |)<1
10
Chapter 3: Data Preprocessing

 Why preprocess the data?

 Data cleaning
 Data integration and transformation
 Data reduction
 Discretization and concept hierarchy
generation
 Summary
October 16, 2008 Data Mining: Concepts and Techniques 25
Data Reduction
Strategies
 Warehouse may store terabytes of data: Complex
data analysis/mining may take a very long time to
run on the complete data set
 Data reduction
 Obtains a reduced representation of the data set

that is much smaller in volume but yet produces

the same (or almost the same) analytical results
 Data reduction strategies
 Data cube aggregation

 Dimensionality reduction

 Numerosity reduction

 Discretization and concept hierarchy generation

October 16, 2008 Data Mining: Concepts and Techniques 26

Data Cube Aggregation
 The lowest level of a data cube
 the aggregated data for an individual entity of
interest
 e.g., a customer in a phone calling data
warehouse.
 Multiple levels of aggregation in data cubes
 Further reduce the size of data to deal with
 Reference appropriate levels
 Use the smallest representation which is enough
to solve the task
October
 16, 2008 Data Mining: Concepts and Techniques 27
Dimensionality Reduction
 Feature selection (i.e., attribute subset selection):
 Select a minimum set of features such that the

probability distribution of different classes given

the values for those features is as close as
possible to the original distribution given the
values of all features
 reduce # of patterns in the patterns, easier to

understand
 Heuristic methods (due to exponential # of choices):
 step-wise forward selection

 step-wise backward elimination

 combining forward selection and backward

October 16, 2008 Data Mining: Concepts and Techniques 28

Example of Decision Tree Induction

Initial attribute set:

{A1, A2, A3, A4, A5, A6}
A4 ?

A1? A6?

Class 1 Class 2 Class 1 Class 2

> Reduced attribute set: {A1, A4, A6}

Heuristic Feature Selection
Methods
 There are 2d possible sub-features of d features
 Several heuristic feature selection methods:
 Best single features under the feature

independence assumption: choose by

significance tests.
 Best step-wise feature selection:

 The best single-feature is picked first

 Then next best feature condition to the first,

...
 Step-wise feature elimination:


Repeatedly eliminate the worst feature
 Best combined feature selection and

elimination:
October 16, 2008 Data Mining: Concepts and Techniques 30
Data Compression
 String compression
 There are extensive theories and well-tuned

algorithms
 Typically lossless

 But only limited manipulation is possible without

expansion
 Audio/video compression
 Typically lossy compression, with progressive

refinement
 Sometimes small fragments of signal can be

reconstructed without reconstructing the whole

 Time sequence is not audio
October 16,
 2008 Data Mining: Concepts and Techniques 31
Data Compression

Original Data Compressed

Data
lossless

s s y
lo
Original Data
Approximated
Wavelet Transforms
Haar2 Daubechie4
 Discrete wavelet transform (DWT): linear signal
processing
 Compressed approximation: store only a small
fraction of the strongest of the wavelet coefficients
 Similar to discrete Fourier transform (DFT), but
better lossy compression, localized in space
 Method:
 Length, L, must be an integer power of 2 (padding with 0s,
when necessary)
 Each transform has 2 functions: smoothing, difference
 Applies to pairs of data, resulting in two set of data of
length L/2
October 16,
 2008 Data Mining: Concepts and Techniques 33
Principal Component Analysis

 Given N data vectors from k-dimensions, find c

<= k orthogonal vectors that can be best used
to represent data
 The original data set is reduced to one
consisting of N data vectors on c principal
components (reduced dimensions)
 Each data vector is a linear combination of the c
principal component vectors
 Works for numeric data only
 Used when the number of dimensions is large
October 16, 2008 Data Mining: Concepts and Techniques 34
Principal Component Analysis

Y1
Y2

X1
Numerosity Reduction

 Parametric methods
 Assume the data fits some model, estimate
model parameters, store only the parameters,
and discard the data (except possible outliers)
 Log-linear models: obtain value at a point in
m-D space as the product on appropriate
marginal subspaces
 Non-parametric methods
 Do not assume models
 Major families: histograms, clustering,
sampling
October 16, 2008 Data Mining: Concepts and Techniques 36
Regression and Log-Linear
Models

 Linear regression: Data are modeled to fit a

straight line
 Often uses the least-square method to fit the
line
 Multiple regression: allows a response variable Y
to be modeled as a linear function of
multidimensional feature vector
 Log-linear model:
October 16, 2008
approximates discrete
Data Mining: Concepts and Techniques 37
Regress Analysis and Log-
Linear Models
 Linear regression: Y = α + β X
 Two parameters , α and β specify the line and

are to be estimated by using the data at hand.

 using the least squares criterion to the known

values of Y1, Y2, …, X1, X2, ….

 Multiple regression: Y = b0 + b1 X1 + b2 X2.
 Many nonlinear functions can be transformed

into the above.

 Log-linear models:
 The multi-way table of joint probabilities is

approximated by a product of lower-order

tables.

Probability: p(a, b, c, d) = αab βacχad δbcd
October 16, 2008 Data Mining: Concepts and Techniques
Histograms

 A popular data 40
reduction technique 35
 Divide data into 30
buckets and store
25
average (sum) for
each bucket 20
 Can be constructed 15
optimally in one 10
dimension using
5
dynamic
programming 0
 Related to

90000
20000

40000

70000
50000
10000

30000

60000

80000

100000
quantization
Clustering

 Partition data set into clusters, and one can store

cluster representation only
 Can be very effective if data is clustered but not
if data is “smeared”
 Can have hierarchical clustering and be stored in
multi-dimensional index tree structures
 There are many choices of clustering definitions
and clustering algorithms, further detailed in
Chapter 8
October 16, 2008 Data Mining: Concepts and Techniques 40
Sampling

 Allow a mining algorithm to run in complexity that

is potentially sub-linear to the size of the data
 Choose a representative subset of the data

 Simple random sampling may have very poor

performance in the presence of skew

 Develop adaptive sampling methods

 Stratified sampling:

 Approximate the percentage of each class (or

subpopulation of interest) in the overall

database
 Used in conjunction with skewed data

 Sampling may not reduce database I/Os (page at a

October 16, 2008 Data Mining: Concepts and Techniques 41
Sampling

W O R
SRS le random
i m p h o u t
( s e wi t
l
samp ment)
p l a c e
re

SRSW
R

Raw Data
Sampling

Raw Data Cluster/Stratified Sample

Hierarchical Reduction
 Use multi-resolution structure with different
degrees of reduction
 Hierarchical clustering is often performed but tends

to define partitions of data sets rather than

“clusters”
 Parametric methods are usually not amenable to

hierarchical representation
 Hierarchical aggregation

 An index tree hierarchically divides a data set

into partitions by value range of some attributes

 Each partition can be considered as a bucket

 Thus an index tree with aggregates stored at

each node is Data

October 16, 2008 a hierarchical histogram
Mining: Concepts and Techniques 44
Chapter 3: Data Preprocessing

 Why preprocess the data?

 Ordinal — values from an ordered set

 Continuous — real numbers

 Discretization:
☛ divide the range of a continuous attribute into

intervals
 Some classification algorithms only accept

categorical attributes.
 Reduce data size by discretization

 Prepare for further analysis

October 16, 2008 Data Mining: Concepts and Techniques 46

Discretization and Concept
hierachy

 Discretization
 reduce the number of values for a given
continuous attribute by dividing the range of
the attribute into intervals. Interval labels can
then be used to replace actual data values.
 Concept hierarchies
 reduce the data by collecting and replacing
low level concepts (such as numeric values for
the attribute age) by higher level concepts
(such as young, middle-aged, or senior).

October 16, 2008 Data Mining: Concepts and Techniques 47

Discretization and concept
hierarchy generation for numeric
data

 Binning (see sections before)

 Histogram analysis (see sections before)

 Clustering analysis (see sections before)

 Entropy-based discretization

 Segmentation by natural partitioning

October 16, 2008 Data Mining: Concepts and Techniques 48

Entropy-Based Discretization
 Given a set of samples S, if S is partitioned into
two intervals S1 and S2 using boundary T, the
entropy after partitioning
|S | is | S |
E (S , T ) = 1 Ent ( ) + 2 Ent ( )
|S| S1 | S | S2
 The boundary that minimizes the entropy function
over all possible boundaries is selected as a binary
discretization.
 The process is recursively applied to partitions
obtained until some
Ent ( S stopping
) − E (T , S )criterion
>δ is met, e.g.,

 Experiments show that it may reduce data size

and improve classification accuracy
Segmentation by natural
partitioning

3-4-5 rule can be used to segment numeric data

into
relatively uniform, “natural” intervals.
* If an interval covers 3, 6, 7 or 9 distinct values at
the most significant digit, partition the range
into 3 equi-width intervals
* If it covers 2, 4, or 8 distinct values at the most
significant digit, partition the range into 4
intervals
* If it covers 1, 5,
October 16, 2008
or 10 distinct values at the most
Data Mining: Concepts and Techniques 50
Example of 3-4-5 rule
count

Step 1: -$351 -$159 profit $1,838 $4,700

Min Low (i.e, 5%-tile) High(i.e, 95%-0 tile) Max
Step 2: msd=1,000 Low=-$1,000 High=$2,000

(-$1,000 - $2,000)
Step 3:

(-$1,000 - 0) (0 -$ 1,000) ($1,000 - $2,000)

(-$4000 -$5,000)
Step 4:

($2,000 - $5, 000)

(-$400 - 0) (0 - $1,000) ($1,000 - $2, 000)
(0 -
($1,000 -
(-$400 - $200)
$1,200) ($2,000 -
-$300) $3,000)
($200 -
($1,200 -
$400)
(-$300 - $1,400)
($3,000 -
-$200)
($400 - ($1,400 - $4,000)
(-$200 - $600) $1,600) ($4,000 -
-$100) ($600 - ($1,600 - $5,000)
$800) ($800 - ($1,800 -
$1,800)
(-$100 - $1,000) $2,000)
0)
Concept hierarchy generation for
categorical data

 Specification of a partial ordering of attributes

explicitly at the schema level by users or experts
 Specification of a portion of a hierarchy by
explicit data grouping
 Specification of a set of attributes, but not of
their partial ordering
 Specification of only a partial set of attributes

October 16, 2008 Data Mining: Concepts and Techniques 52

Specification of a set of
attributes
Concept hierarchy can be automatically
generated based on the number of distinct
values per attribute in the given attribute set.
The attribute with the most distinct values is
placed at the lowest level of the hierarchy.

country 15 distinct values

province_or_ state 65 distinct

values
city 3567 distinct values

street 674,339 distinct values

Chapter 3: Data Preprocessing

 Why preprocess the data?

 Data preparation is a big issue for both

warehousing and mining
 Data preparation includes
 Data cleaning and data integration
 Data reduction and feature selection
 Discretization
 A lot a methods have been developed but still an
active area of research
October 16, 2008 Data Mining: Concepts and Techniques 55

Chapter 3 - Data Preparation - Data Mining Concepts and Techniques Han and Kamber
No ratings yet
Chapter 3 - Data Preparation - Data Mining Concepts and Techniques Han and Kamber
61 pages
L6 Data Preprocessing
No ratings yet
L6 Data Preprocessing
9 pages
Lecture 3 and 4 - Data Preprocessing
No ratings yet
Lecture 3 and 4 - Data Preprocessing
25 pages
2.3 Data Cleaning
No ratings yet
2.3 Data Cleaning
24 pages
Solutions Manual To Accompany Digital Signal Processing: A Computer Based Approach 3rd Edition 9780073048376 Instant Download
100% (4)
Solutions Manual To Accompany Digital Signal Processing: A Computer Based Approach 3rd Edition 9780073048376 Instant Download
41 pages
Analisis Data 2
No ratings yet
Analisis Data 2
40 pages
Discrete Systems and Digital Signal Processing With MATLAB Second Edition Elali - Download The Ebook in PDF With All Chapters To Read Anytime
No ratings yet
Discrete Systems and Digital Signal Processing With MATLAB Second Edition Elali - Download The Ebook in PDF With All Chapters To Read Anytime
37 pages
UNIT 3 Data Preprocessing
No ratings yet
UNIT 3 Data Preprocessing
22 pages
Data Mining and Data Warehousing CSPC-308
No ratings yet
Data Mining and Data Warehousing CSPC-308
51 pages
A Novel Approach To Saturation Characteristics Modeling and Its Impact On Synchronous Machine Transient Stability Analysis
No ratings yet
A Novel Approach To Saturation Characteristics Modeling and Its Impact On Synchronous Machine Transient Stability Analysis
12 pages
Digital Signal Processing Using MATLAB 2nd Edition Vinay K. Ingle New Release 2025
No ratings yet
Digital Signal Processing Using MATLAB 2nd Edition Vinay K. Ingle New Release 2025
136 pages
UNIT-2 Data Pre-Processing
No ratings yet
UNIT-2 Data Pre-Processing
57 pages
ELEN3013 4 Digital Filters (Ch12)
No ratings yet
ELEN3013 4 Digital Filters (Ch12)
114 pages
Data Mining - Preprocessing
No ratings yet
Data Mining - Preprocessing
77 pages
3 Preprocessing
No ratings yet
3 Preprocessing
27 pages
Lect 4
No ratings yet
Lect 4
30 pages
3 Prep
No ratings yet
3 Prep
53 pages
3 Prep
No ratings yet
3 Prep
50 pages
Chapter2 Data Preprocssing
No ratings yet
Chapter2 Data Preprocssing
70 pages
MCQ For EXCEL Part1
No ratings yet
MCQ For EXCEL Part1
12 pages
02 Data Warehouse
No ratings yet
02 Data Warehouse
18 pages
Chapter 3 - For Class
No ratings yet
Chapter 3 - For Class
52 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
99 pages
DWDM Unit-Ii
No ratings yet
DWDM Unit-Ii
18 pages
Week 4 - 5 - Data Preprocessing
No ratings yet
Week 4 - 5 - Data Preprocessing
67 pages
Unit 2 - Data Preprocessing
No ratings yet
Unit 2 - Data Preprocessing
42 pages
Chapter 2 - Data Preprocessing
No ratings yet
Chapter 2 - Data Preprocessing
15 pages
Module2 DataPreprocessing
No ratings yet
Module2 DataPreprocessing
27 pages
Laboratory Exercise 3: Discrete-Time Signals: Frequency-Domain Representations
No ratings yet
Laboratory Exercise 3: Discrete-Time Signals: Frequency-Domain Representations
49 pages
AI351 Lecture 1
No ratings yet
AI351 Lecture 1
32 pages
Unit 2
No ratings yet
Unit 2
37 pages
Data Mining & Data Warehousing
No ratings yet
Data Mining & Data Warehousing
62 pages
BDA Class1
No ratings yet
BDA Class1
33 pages
CS-DM Module-2
No ratings yet
CS-DM Module-2
30 pages
M-3420-IB-08MC Instruction Book
100% (1)
M-3420-IB-08MC Instruction Book
187 pages
Application Format 126 GDOC
No ratings yet
Application Format 126 GDOC
2 pages
Pentium
No ratings yet
Pentium
1 page
CS-DM Module-2
No ratings yet
CS-DM Module-2
29 pages
ECE Curriculum Overview
No ratings yet
ECE Curriculum Overview
81 pages
Study Material I
No ratings yet
Study Material I
140 pages
Data Preprocessing: Why Preprocess The Data?
No ratings yet
Data Preprocessing: Why Preprocess The Data?
51 pages
Swetha Unit 1 Part 2 Data Preprocessing
No ratings yet
Swetha Unit 1 Part 2 Data Preprocessing
74 pages
DataPreprocessing 2
No ratings yet
DataPreprocessing 2
68 pages
Spectral Audio Signal Processing
0% (1)
Spectral Audio Signal Processing
9 pages
ICS 2408 - Lecture 2 - Data Preprocessing
No ratings yet
ICS 2408 - Lecture 2 - Data Preprocessing
29 pages
K X N X K X N X: 1. Periodicity
No ratings yet
K X N X K X N X: 1. Periodicity
13 pages
Data Preprocessing - DWM
No ratings yet
Data Preprocessing - DWM
42 pages
DSP Lab
No ratings yet
DSP Lab
70 pages
Data Preprocessing Essentials
No ratings yet
Data Preprocessing Essentials
59 pages
Quick Question42
No ratings yet
Quick Question42
51 pages
Data Preprocessing - Data Cleaning
100% (2)
Data Preprocessing - Data Cleaning
29 pages
Discrete-Time Fourier Transform
No ratings yet
Discrete-Time Fourier Transform
5 pages
Data Cleaning and Datamining
No ratings yet
Data Cleaning and Datamining
54 pages
FFT Lab for Engineering Students
No ratings yet
FFT Lab for Engineering Students
15 pages
Data Mining for Tech Enthusiasts
No ratings yet
Data Mining for Tech Enthusiasts
61 pages
MATLAB Short Tutorial
No ratings yet
MATLAB Short Tutorial
28 pages
Data Pre-Processing Guide
No ratings yet
Data Pre-Processing Guide
33 pages
Data Mining: Concepts and Techniques: - Chapter 3
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 3
52 pages
Intel 80186 Pin Out
No ratings yet
Intel 80186 Pin Out
20 pages
Chap 6
No ratings yet
Chap 6
77 pages
Data Mining: Concepts and Techniques: September 16, 2020 1
No ratings yet
Data Mining: Concepts and Techniques: September 16, 2020 1
46 pages
Unit 2 Final
No ratings yet
Unit 2 Final
46 pages
Chapter3 - Frequency-Domain Representation of Signals
No ratings yet
Chapter3 - Frequency-Domain Representation of Signals
28 pages
Que Es Datamin
No ratings yet
Que Es Datamin
52 pages
Introduction To Random Processes Handouts PDF
No ratings yet
Introduction To Random Processes Handouts PDF
116 pages
02
No ratings yet
02
78 pages
Digital Signal Processing Basics
No ratings yet
Digital Signal Processing Basics
16 pages
"Association Rules": Market Baskets Frequent Itemsets A-Priori Algorithm
No ratings yet
"Association Rules": Market Baskets Frequent Itemsets A-Priori Algorithm
30 pages
USA Comparison5
No ratings yet
USA Comparison5
30 pages
Hotter Dryer
No ratings yet
Hotter Dryer
27 pages
SAR Image Processing Guide
No ratings yet
SAR Image Processing Guide
66 pages
Appendix A
No ratings yet
Appendix A
21 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
80 pages
Data Preprocessing Essentials
No ratings yet
Data Preprocessing Essentials
42 pages
Electrical Power and Energy Systems: Ricardo Caneloi Dos Santos, Eduardo Cesar Senger
No ratings yet
Electrical Power and Energy Systems: Ricardo Caneloi Dos Santos, Eduardo Cesar Senger
10 pages
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 2 &3
No ratings yet
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 2 &3
36 pages
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 3
No ratings yet
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 3
53 pages
Digital Signal Processing Problems
No ratings yet
Digital Signal Processing Problems
5 pages
bf609DSP Tutesheet Amizone
No ratings yet
bf609DSP Tutesheet Amizone
10 pages
DIT Radix-2
No ratings yet
DIT Radix-2
7 pages
Analysis of Stock Market Data by Using Dynamic Fourier and Wavelets Teknik
No ratings yet
Analysis of Stock Market Data by Using Dynamic Fourier and Wavelets Teknik
13 pages
Data Preprocessing Essentials
No ratings yet
Data Preprocessing Essentials
52 pages
Unix 4 and 5
No ratings yet
Unix 4 and 5
46 pages
Chap 7
No ratings yet
Chap 7
71 pages
Data Pre Processing
No ratings yet
Data Pre Processing
48 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
78 pages
Advantages of FFT Spectrum Analyzer Technology
No ratings yet
Advantages of FFT Spectrum Analyzer Technology
2 pages
Greenhouse Effect2
No ratings yet
Greenhouse Effect2
8 pages
3datapreprocessing ppt3
No ratings yet
3datapreprocessing ppt3
46 pages
01 Data Pre Processing
No ratings yet
01 Data Pre Processing
46 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
51 pages
Chap 1
No ratings yet
Chap 1
32 pages
Distance Relay
No ratings yet
Distance Relay
7 pages
Creating Night Effects
No ratings yet
Creating Night Effects
5 pages
Data Mining Chapter3 0
No ratings yet
Data Mining Chapter3 0
32 pages
Signals and Systems
No ratings yet
Signals and Systems
1 page
Virtual Memory
No ratings yet
Virtual Memory
5 pages
DSP Ai
No ratings yet
DSP Ai
113 pages
Engineering Student Performance Data
No ratings yet
Engineering Student Performance Data
2 pages
Campus Data Format HCL - xlsMCA
No ratings yet
Campus Data Format HCL - xlsMCA
2 pages
Time Table-A
No ratings yet
Time Table-A
2 pages
Data Pre Processing
No ratings yet
Data Pre Processing
35 pages
Why Data Preprocessing?: Incomplete
No ratings yet
Why Data Preprocessing?: Incomplete
17 pages
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
50 pages
DATA MINING Chapter 1 and 2 Lect Slide
No ratings yet
DATA MINING Chapter 1 and 2 Lect Slide
47 pages
To Whom It May Concern
No ratings yet
To Whom It May Concern
1 page
Assign I P 050209
100% (2)
Assign I P 050209
2 pages
CTS Topper Shopper MCA
No ratings yet
CTS Topper Shopper MCA
1 page
Flash Mask
No ratings yet
Flash Mask
1 page
FFT VHDL Code
No ratings yet
FFT VHDL Code
4 pages
Flash
100% (1)
Flash
9 pages
Programmable Keyboard-Display Interface - 8279
100% (2)
Programmable Keyboard-Display Interface - 8279
4 pages