0% found this document useful (0 votes)

11 views45 pages

Clustering

The document discusses clustering in bioinformatics, focusing on its application in gene expression and microarray analysis. It outlines the clustering problem, various clustering algorithms, and principles such as homogeneity and separation. Additionally, it details the K-means clustering technique and Lloyd's algorithm for optimizing cluster assignments.

Uploaded by

RANIA_MKHININI_GAHAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views45 pages

Clustering

Uploaded by

RANIA_MKHININI_GAHAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

C LUSTERING IN

B IOINFORMATICS

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

O VERVIEW

Define the clustering problem

Motivation: gene expression and microarrays

Types of clustering

Clustering algorithms

Other applications of clustering

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

T HE CLUSTERING PROBLEM
Motivation: Find patterns in a sea of data

Input:

A (large) number of datapoints: N

A measure of distance between any two data points dij

Output:

Groupings (clustering) of the elements into K (the number can be user-

specified or automatically determined) ‘similarity’ classes

Sometimes there is also an objective measure that the obtained clustering

seeks to minimize.

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

A MARQUEE APPLICATION :
M ICROARRAY ANALYSIS
What do newly sequenced genes do?

Simply comparing the new gene sequences to known DNA sequences often does not necessarily reveal
the function of a gene: for 40% of sequenced genes, functionality cannot be ascertained by only
comparing to sequences of other known genes

Genes that perform similar or complementary function to known genes (reference) will be expressed
(transcribed) at high levels together with known genes

Genes that perform antagonistic functions (e.g. down-regulation) may be expressed at high levels at an
earlier or later time point when compared to known genes

E.g. what happens to gene expression in cancer cells?

Expression level is estimated by measuring the amount of mRNA for that particular gene

A gene is active if it is being transcribed

More mRNA usually indicates more gene activity

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

A MICROARRAY EXPERIMENT
Produce cDNA from mRNA (cDNA is more stable)

Label cDNA with a fluorescent dye or biotin for detection

Diﬀerent color labels are available to compare many samples at once

Wash cDNA over the microarray containing thousands of high density probes
that hybridize to complementary strands in the sample and immobilize them on
the surface.

For biotin-labeled samples, stain with the biotin-specific fluorescently labeled

antibody

Read the microarray, using a laser or a high-resolution CCD

Illumination reveals transcribed/co-expressed genes

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

HTTP://UPLOAD.WIKIMEDIA.ORG/WIKIPEDIA/COMMONS/0/0E/MICROARRAY2.GIF

Green: expressed only in control

Red: expressed only in an experimental cell

Yellow: equally expressed in both samples

Black: NOT expressed in either control or sample

HTTP://UPLOAD.WIKIMEDIA.ORG/WIKIPEDIA/EN/C/C8/MICROARRAY-SCHEMA.JPG

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

Track the sample over a period
of time to observe changes in
gene expression over time

Track two samples under the

same conditions to look for
differential expression

Each box represents one gene’s

expression over time

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

M ICROARRAY D ATA

Microarray data are usually transformed into a (relative, normalized)

intensity matrix

Can also be represented as a bit matrix (log2 of relative intensity)

The intensity matrix allows biologists to infer correlations between

diﬀerent genes (even if they are dissimilar) and to understand how
genes functions might be related

Care must be taken to normalize the data appropriately, e.g.

diﬀerent time points can come from diﬀerent arrays.

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

Gene Time 1 Time 2 Time 3
1 10 8 10
2 10 0 9 Which genes are similar?
3 4 8.5 3
4 9.5 0.5 8.5
5 4.5 8.5 3
What defines co-expression?
6 10.5 9 12
7 5 8.5 11 How to measure the distance/
8 2.7 8.7 2 similarity?
9 9.7 2 9
10 10.2 1 9.2

INTENSITY TABLE

EUCLIDEAN DISTANCE IN D-DIMENSIONS

�
� d
��
D(x, y) = � (x − y )2 i i
i=1

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

F INDING SIMILAR GENES
1 2 3 4 5 6 7 8 9 10

1 8.1 9.2 7.7 8.9 2.3 5.1 10.9 6.1 7.0

2 8.1 12.0 0.9 11.8 9.5 10.1 13.3 2.0 1.0

2,4,9,10 10 3 9.2 12.0 11.2 0.5 11.1 8.1 1.7 10.5 11.5

1,6,7 4 7.7 0.9 11.2 10.9 9.2 9.5 12.5 1.6 1.1

5 8.9 11.8 0.5 10.9 10.8 8.0 2.1 10.3 11.3

6 2.3 9.5 11.1 9.2 10.8 5.6 12.7 7.7 8.5

7 5.1 10.1 8.1 9.5 8.0 5.6 9.3 8.3 9.3

10
5 8 10.9 13.3 1.7 12.5 2.1 12.7 9.3 12.0 12.9

9 6.1 2.0 10.5 1.6 10.3 7.7 8.3 12.0 1.1

10 7.0 1.0 11.5 1.1 11.3 8.5 9.3 12.9 1.1

5 0 PAIRWISE DISTANCES
10 1 6 7 2 4 9 10 3 5 8

3,5,8 1 0.0 2.3 5.1 8.1 7.7 6.1 7.0 9.2 8.9 10.9

6 2.3 0.0 5.6 9.5 9.2 7.7 8.5 11.1 10.8 12.7
5
0 7 5.1 5.6 0.0 10.1 9.5 8.3 9.3 8.1 8.0 9.3
0 2 8.1 9.5 10.1 0.0 0.9 2.0 1.0 12.0 11.8 13.3
5
10 0 4 7.7 9.2 9.5 0.9 0.0 1.6 1.1 11.2 10.9 12.5

9 6.1 7.7 8.3 2.0 1.6 0.0 1.1 10.5 10.3 12.0

10 7.0 8.5 9.3 1.0 1.1 1.1 0.0 11.5 11.3 12.9

3 9.2 11.1 8.1 12.0 11.2 10.5 11.5 0.0 0.5 1.7

6 8.9 10.8 8.0 11.8 10.9 10.3 11.3 0.5 0.0 2.1

8 10.9 12.7 9.3 13.3 12.5 12.0 12.9 1.7 2.1 0.0

REARRANGED DISTANCES
CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]
C LUSTERING P RINCIPLES
Homogeneity: elements of the same cluster are maximally close to
each other

Separation: elements in separate clusters are maximally far apart

from each other

One is actually implied by the other (in many cases)

Generally speaking, this is a hard problem.

 
� �
min α d(x, y) − β d(x, y)
clustering
x,y∈the same cluster x,y∈diﬀerent clusters

RELATIVE IMPORTANCE
CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]
BECAUSE
� �
d(x, y) + d(x, y)
x,y∈the same cluster x,y∈diﬀerent clusters
�
= d(x, y) = D = const
x,y
WE CAN
 SIMPLIFY 
� �
min α d(x, y) − β d(x, y)
clustering
x,y∈the same cluster x,y∈diﬀerent clusters

TO AN EQUIVALENT EXPRESSION THAT ONLY DEPENDS ON

INTRA-CLUSTER DISTANCES

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

P OOR CLUSTERING EXAMPLE

This clustering violates both

principles:

Points in the same cluster are far

apart

Points in diﬀerent cluster are close

together

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

B ETTER CLUSTERING
EXAMPLE

This clustering appears sensible.

But we need to use an objective

m e t r i c to o p t i m i z e c l u s te r
assignment.

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

C LUSTERING TECHNIQUES

Agglomerative: Start with every element in its own cluster, and

iteratively join clusters together

Divisive: Start with one cluster and iteratively divide it into smaller
clusters

Hierarchical: Organize elements into a tree, leaves represent genes

and the length of the paths between leaves represents the distances
between genes. Similar genes lie within the same subtrees

Generally, finding the exact solution to a clustering problem is NP

hard.

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

K- MEANS CLUSTERING
A technique to partition a set of N points into K clusters

Each cluster is represented with a mean (a centroid) – hence ‘K-

means’

Input: A set V with N points (v1, v2 ... vn), the desired number of
clusters K and a distance measure between any two points d(v,w)

Output: A set X of K cluster centers that minimize the squared

error distortion D(V,X) over all possible choices of X.

�N
1
D(V, X) = min d (vi , xk )
2
N i=1 k

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

K- MEANS CLUSTERING
For K=1, the problem becomes trivial: the centroid of all points is
the solution for Euclidean distances.
1 �
x= vi
N i
For K≥2 the problem becomes NP-complete

An eﬃcient heuristic exists

Lloyd’s algorithm.

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

L LOYD ’ S A LGORITHM

1. Arbitrarily assign the K cluster centers (this can significantly

influence the outcome)

2. while cluster centers keep changing

11
1
2 2 2
y

1 1
Center 2
1 1 1
1
1.00 Center 2
1 22 2
2
2
2
2

0.00
0.00 1.00 2.00

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

K-MEANS EXECUTION EXAMPLE

K=3 (different starting

K=2 K=3
points)

2.00 2.00 2.00

11 22 22
1 2 2
2 2 2 3 3 3 3 3 33
y

y
1 Center
1 1 2 Center
2 2 2 Center
2 2 Center
1 1 2 2 2 3 2 2 3
1 2 2
1.00 1.00 Center 3 1.00
2 Center 2
1 22 2 11 1 2 11 1
2 Center 1 3
2 3 Center111
2 1 1
2 3 1

0.00 0.00 0.00

0.00 1.00 2.00 0.00 1.00 2.00 0.00 1.00 2.00

x x x

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

H OW TO CHOOSE K ?
The simplest approach is to start with K=1 and increase K until the
squared error distortion (SED) stops decreasing

The problems is that K=N always achieves the value of 0 (each point is a
cluster), so we always keep increasing K.

Generally, need to add further constraints (e.g. model complexity) to obtain

non-trivial results
15

11.25
SED

7.5

3.75

0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
K
CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]
C ONSERVATIVE K-M EANS
A LGORITHM

Lloyd algorithm is fast but in each iteration it moves many data

points, not necessarily causing better convergence.

A more conservative method would be to move one point at a time

only if it improves the overall clustering cost

The smaller the clustering cost of a partition of data points is the better
that clustering is

Diﬀerent methods (e.g. the squared error distortion) can be used to

measure this clustering cost

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

K-M EANS “G REEDY ” A LGORITHM
ProgressiveGreedyK-Means(k)
Select an arbitrary partition P into k clusters
while forever
bestChange ← 0
for every cluster C
for every element i not in C
if cost(P) – cost(Pi→C) > bestChange
bestChange ← cost(P) – cost(Pi→C)
i* ← I
C* ←C
if bestChange > 0
Change partition P by moving i* to C*
else
return P

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

CONCLUSION: LLOYD’S IN MORE EFFICIENT,
BOTH IN RUN-TIME AND IN BEST FOUND SED

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

HTTP://WWW.SPRINGERLINK.COM/CONTENT/K474381227655563/

Euclidean distance is not necessarily the best measure for co-

expression.

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

H IERARCHICAL CLUSTERING
Instead of grouping into discrete clusters, produces a ‘classification’
tree, also called a dendrogram

A more intuitive example is probably obtained from molecular

sequence data (an early example of clustering applications)

We have a collection of aligned nucleotide sequences from diﬀerent

species, and wish to construct their evolutionary hierarchy/history –
a phylogeny.

HTTP://WWW.SCIENCEMAG.ORG/CGI/REPRINT/310/5750/979.PDF

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

H IERARCHICAL CLUSTERING
Consider the following distance matrix on 5 nucleotide (partial
mitochondrial genome) sequences. The values are p-distances defined
as the number of nucleotide diﬀerences normalized by the length of
the sequence.
Human Chimpanzee Gorilla Orangutan Gibbon

Human - 0.0882682 0.102793 0.159598 0.179688

Chimpanzee - - 0.106145 0.170759 0.1875

Gorilla - - - 0.166295 0.1875

Orangutan - - - - 0.188616

Gibbon - - - - -

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

C LUSTERING PROCEDURE
At each step, we select the two closest sequences and join them to form a clade.

We then replace the two just joined sequences with their ancestor

This reduces the size of the data matrix by one

We need to compute the distances from the new ancestor to the remaining sequences
Human Chimpanzee Gorilla Orangutan Gibbon
Human - 0.0882682 0.102793 0.159598 0.179688
Chimpanzee - - 0.106145 0.170759 0.1875
Gorilla - - - 0.166295 0.1875
Orangutan - - - - 0.188616
Gibbon - - - - -

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

U PDATING DISTANCES
There are multiple strategies for computing the distances to the new
‘ancestral’ sequence a that joins sequences m and n

Single Linkage d(x, a) = min [d(x, m), d(x, n)]

Complete Linkage d(x, a) = max [d(x, m), d(x, n)]
UPGMA Unweighted Pair Group Method with d(x, m) + d(x, n)
d(x, a) =
Arithmetic Mean
2
WPGMA Weighted Pair Group Method with s(m)d(x, m) + s(n)d(x, n)
d(x, a) =
Arithmetic Mean
s(m) + s(n)
s(n) counts the number of actual
sequences represented by node n.

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

E XAMPLE CONTINUED
Use complete linkage. Joining human and chimp...
Human Chimpanzee Gorilla Orangutan Gibbon
Human - 0.0882682 0.102793 0.159598 0.179688
Chimpanzee - - 0.106145 0.170759 0.1875
Gorilla - - - 0.166295 0.1875
Orangutan - - - - 0.188616
Gibbon - - - - -

Human-Chimpanzee Gorilla Orangutan Gibbon

Human-Chimpanzee - 0.106145 0.170759 0.1875

Gorilla - - 0.166295 0.1875

Orangutan - - - 0.188616

Gibbon - - - -

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

Human-Chimpanzee Gorilla Orangutan Gibbon
Human-Chimpanzee - 0.106145 0.170759 0.1875
Gorilla - - 0.166295 0.1875
Orangutan - - - 0.188616
Gibbon - - - -

Human-Chimpanzee-Gorilla Orangutan Gibbon

Human-Chimpanzee-Gorilla - 0.170759 0.1875
Orangutan - - 0.188616
Gibbon - - -

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

Orangutan Gibbon Gibbon
Human-Chimpanzee-Gorilla 0.170759 0.1875
Hum-Chimp-Gor-Orang 0.188616
Orangutan - 0.188616
Gibbon - - Gibbon -

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

A NOTE ON WPGMA
Gorilla Orangutan Gibbon
Human-Chimpanzee 0.104469 0.165179 0.183594
Gorilla - 0.166295 0.1875
Orangutan - - 0.188616
Gibbon - - -

Orangutan Gibbon
Human-Chimpanzee-Gorilla 0.165551 0.184896
Orangutan - 0.188616
Gibbon - -

d(HCG-Orang) = 1/3 [2 d(HC-Orang) + d (Gor-Orang)]

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

B ACK TO MICROARRAYS ...
Clustering plots can be interpreted as gene/condition hierarchy

HTTP://UPLOAD.WIKIMEDIA.ORG/WIKIPEDIA/COMMONS/4/48/HEATMAP.PNG

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

A FEW OTHER APPLICATIONS

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

Use clustering of similar sequences in protein databases to reduce
complexity and speed up comparisons. Each cluster of similar
sequences is represented by a single sequence.

Complexity reduction is an important application of clustering

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

The structure of proteins interactions can be represented by a graph

Node = proteins, Edges = interactions

Look for clusters (densely connected components) in graphs

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

Hierarchical clustering to
improve protein structure
prediction by merging the
predictions made by a large
number of alternative
conformation models

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

F URTHER READING ...

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

DEFINES THE
CONCEPT OF
‘ELEMENT
BELONGS TO A
PARTITION WITH
A PROBABILITY’

BUILD A MINIMUM
SPANNING TREE
AND DELETE
LONGEST EDGES
TO CREATE
PARTITIONS

CSE/BIMM/BENG 181 MAY 24, 2011 SERGEI L KOSAKOVSKY POND [[email protected]]

Science 10 3rd Quarter Exam
84% (102)
Science 10 3rd Quarter Exam
2 pages
Science: Quarter 4 - Module 2: Cell Division
85% (27)
Science: Quarter 4 - Module 2: Cell Division
21 pages
The Genetics of Human Tooth Agenesis
No ratings yet
The Genetics of Human Tooth Agenesis
7 pages
Transgenic Fly Virtual Lab Guide
No ratings yet
Transgenic Fly Virtual Lab Guide
7 pages
Principles - Psychiatric Genetics PDF
No ratings yet
Principles - Psychiatric Genetics PDF
432 pages
K-Means and Kohonen Maps Unsupervised Clustering Techniques: Steve Hookway 4/8/04
No ratings yet
K-Means and Kohonen Maps Unsupervised Clustering Techniques: Steve Hookway 4/8/04
53 pages
5 Microarray PDF
No ratings yet
5 Microarray PDF
79 pages
Ch10 Clustering
No ratings yet
Ch10 Clustering
45 pages
Microarray Full
No ratings yet
Microarray Full
56 pages
K Means Clustering
No ratings yet
K Means Clustering
43 pages
CMMB 461 Dna Microarray 2 2019 For D2L
No ratings yet
CMMB 461 Dna Microarray 2 2019 For D2L
27 pages
K-Means Clustering Clustering Algorithms Implementation and Comparison
No ratings yet
K-Means Clustering Clustering Algorithms Implementation and Comparison
4 pages
Introduction To Data Science Unsupervised Learning: CS 194 Fall 2015 John Canny
No ratings yet
Introduction To Data Science Unsupervised Learning: CS 194 Fall 2015 John Canny
54 pages
Clustering Lecture 1: Basics: Jing Gao
No ratings yet
Clustering Lecture 1: Basics: Jing Gao
62 pages
Clustering
No ratings yet
Clustering
22 pages
DNE110318 F
No ratings yet
DNE110318 F
10 pages
A Comparative Study of K-Means, DBSCAN and OPTICS
No ratings yet
A Comparative Study of K-Means, DBSCAN and OPTICS
6 pages
An Approach of Hybrid Clustering Technique For Maximizing Similarity of Gene Expression
No ratings yet
An Approach of Hybrid Clustering Technique For Maximizing Similarity of Gene Expression
14 pages
Graph Partitioning & Clustering Techniques
No ratings yet
Graph Partitioning & Clustering Techniques
14 pages
Clustering
No ratings yet
Clustering
64 pages
7 Cluster Analysis
No ratings yet
7 Cluster Analysis
62 pages
Unit 4
No ratings yet
Unit 4
43 pages
Cluster Analysis: Biological Data Analysis and Chemometrics
No ratings yet
Cluster Analysis: Biological Data Analysis and Chemometrics
41 pages
Module 4 - Supervised and Unsupervised Learning Techniques
No ratings yet
Module 4 - Supervised and Unsupervised Learning Techniques
52 pages
A Comparative Study and Analysis For Microarray Gene Expression Data Using Clustering Techniques
No ratings yet
A Comparative Study and Analysis For Microarray Gene Expression Data Using Clustering Techniques
3 pages
Unit 7 Clustering
No ratings yet
Unit 7 Clustering
56 pages
ResearchCards 18sept2020 PDF
No ratings yet
ResearchCards 18sept2020 PDF
172 pages
A Survey On Biclustering
No ratings yet
A Survey On Biclustering
3 pages
Project Report IITM SHALINI
No ratings yet
Project Report IITM SHALINI
8 pages
Metodos Clasificacion
No ratings yet
Metodos Clasificacion
203 pages
AI ML Lecture 6
No ratings yet
AI ML Lecture 6
20 pages
07 Clustering
No ratings yet
07 Clustering
54 pages
AI-AG-Day-2-28th Feb 2023
No ratings yet
AI-AG-Day-2-28th Feb 2023
44 pages
Unit IV Clustering
No ratings yet
Unit IV Clustering
60 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
16 pages
Lecture 06
No ratings yet
Lecture 06
47 pages
Week 9 - Clustering
No ratings yet
Week 9 - Clustering
63 pages
Clustering and Distance Measures
No ratings yet
Clustering and Distance Measures
2 pages
Unsupervised
No ratings yet
Unsupervised
14 pages
Pattern Recognition - Clustering - Classification
No ratings yet
Pattern Recognition - Clustering - Classification
177 pages
Exp5 - Unsupervised Learning
No ratings yet
Exp5 - Unsupervised Learning
13 pages
8 Clustering2
No ratings yet
8 Clustering2
84 pages
A Comparative Study On Distance Measuring Approach
No ratings yet
A Comparative Study On Distance Measuring Approach
3 pages
Clustering
No ratings yet
Clustering
24 pages
18 A Comparison of Various Distance Functions On K - Mean Clustering Algorithm
No ratings yet
18 A Comparison of Various Distance Functions On K - Mean Clustering Algorithm
9 pages
164-Article Text-421-1-10-20210814
No ratings yet
164-Article Text-421-1-10-20210814
6 pages
Clustering
No ratings yet
Clustering
35 pages
Data Mining Unit-Iv
No ratings yet
Data Mining Unit-Iv
34 pages
Lecture 6
No ratings yet
Lecture 6
55 pages
Enhancing DBSCAN Algorithm For Data Mining
No ratings yet
Enhancing DBSCAN Algorithm For Data Mining
5 pages
SEEM2460 Unsupervised Learning Clustering
No ratings yet
SEEM2460 Unsupervised Learning Clustering
76 pages
Sine Cosine Based Algorithm For Data Clustering
No ratings yet
Sine Cosine Based Algorithm For Data Clustering
5 pages
Unsuper
No ratings yet
Unsuper
15 pages
B43 Exp5 ML
No ratings yet
B43 Exp5 ML
6 pages
ML - 8
No ratings yet
ML - 8
70 pages
SJNanda - Spider and CollidingBodies
No ratings yet
SJNanda - Spider and CollidingBodies
50 pages
DEU CSC5045 Intelligent System Applications Using Fuzzy - 4+clustering
No ratings yet
DEU CSC5045 Intelligent System Applications Using Fuzzy - 4+clustering
61 pages
AIML Chapter 13
No ratings yet
AIML Chapter 13
26 pages
Data Mining Lecture Notes-1: Bsc. (H) Computer Science: Vi Semester Teacher: Ms. Sonal Linda
No ratings yet
Data Mining Lecture Notes-1: Bsc. (H) Computer Science: Vi Semester Teacher: Ms. Sonal Linda
40 pages
BCA Semester VI Data Mining Module 4 (Presentation Kind of N
No ratings yet
BCA Semester VI Data Mining Module 4 (Presentation Kind of N
56 pages
08 K-Means
No ratings yet
08 K-Means
19 pages
Evolution of Animal Breeding
No ratings yet
Evolution of Animal Breeding
14 pages
PCR Variations
No ratings yet
PCR Variations
3 pages
Intro to Pharma Biotech
No ratings yet
Intro to Pharma Biotech
63 pages
Biomimetics For NASA Langley Research Center Year 2000 Report of Findings From A Six-Month Survey
No ratings yet
Biomimetics For NASA Langley Research Center Year 2000 Report of Findings From A Six-Month Survey
95 pages
Perpetuation of Life
No ratings yet
Perpetuation of Life
59 pages
Hardy We in Berg
No ratings yet
Hardy We in Berg
7 pages
Chromosomes
No ratings yet
Chromosomes
9 pages
Chapter 16 BIO 1510
No ratings yet
Chapter 16 BIO 1510
33 pages
2015 Perry Et Al Insights Into Hominin Phenotypic and Dietary Evolution From Ancient DNA Sequence Data
No ratings yet
2015 Perry Et Al Insights Into Hominin Phenotypic and Dietary Evolution From Ancient DNA Sequence Data
9 pages
Human Genome
No ratings yet
Human Genome
3 pages
Domestication, Germplasm Conservation and Introduction
No ratings yet
Domestication, Germplasm Conservation and Introduction
59 pages
Chromosome Classification Chromosome Banding
No ratings yet
Chromosome Classification Chromosome Banding
8 pages
Biology U7
No ratings yet
Biology U7
8 pages
Simulated Baby Genetics Lab
No ratings yet
Simulated Baby Genetics Lab
4 pages
Molecular Phylogenetics and Evolution: Phalloceros
No ratings yet
Molecular Phylogenetics and Evolution: Phalloceros
10 pages
Site-Directed RNA Editing - Recent Advances and Open Challenges
No ratings yet
Site-Directed RNA Editing - Recent Advances and Open Challenges
10 pages
Powerpoint Messenger Rna
No ratings yet
Powerpoint Messenger Rna
33 pages
Tryptophan Pathway & Operon Mutations
No ratings yet
Tryptophan Pathway & Operon Mutations
3 pages
MICR3003 Lecture 2 Designer BacteriaDr J M Pemberton 2003
100% (1)
MICR3003 Lecture 2 Designer BacteriaDr J M Pemberton 2003
18 pages
Review Key
No ratings yet
Review Key
10 pages
Whole-Genome Analyses Reveal Past Population Fluct
No ratings yet
Whole-Genome Analyses Reveal Past Population Fluct
21 pages
Adobe Scan Jan 17, 2024-1
No ratings yet
Adobe Scan Jan 17, 2024-1
4 pages
(Ebook PDF) Biology: Concepts and Applications 9th Edition Download
100% (2)
(Ebook PDF) Biology: Concepts and Applications 9th Edition Download
55 pages
Study Guide - Answers
No ratings yet
Study Guide - Answers
4 pages
Variation and Mutation in Evolution
No ratings yet
Variation and Mutation in Evolution
51 pages