Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
8 views4 pages

Assignment7 2023

The document contains a series of questions and answers related to clustering techniques in data mining, including supervised vs. unsupervised learning, specific clustering methods like DBSCAN and K-Means, and concepts such as distance metrics and dendograms. It provides multiple-choice questions with correct answers indicated for each. Key topics covered include hierarchical clustering, single linkage, complete linkage, and the application of clustering algorithms on given sets of points.

Uploaded by

Indu Borra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views4 pages

Assignment7 2023

The document contains a series of questions and answers related to clustering techniques in data mining, including supervised vs. unsupervised learning, specific clustering methods like DBSCAN and K-Means, and concepts such as distance metrics and dendograms. It provides multiple-choice questions with correct answers indicated for each. Key topics covered include hierarchical clustering, single linkage, complete linkage, and the application of clustering algorithms on given sets of points.

Uploaded by

Indu Borra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Data Mining: Assignment Week 7: Clustering

1. Which of the following statement is NOT true about clustering?

A. It is a supervised learning technique

B. It is an unsupervised learning technique

C. It is also known as exploratory data analysis

D. It groups data into homogeneous groups

Answer: A

2. Which of the following clustering technique start with the points as individual
clusters and, at each step, merge the closest pair of clusters

A. K-Means clustering

B. DBSCAN

C. Divisive clustering

D. Agglomerative clustering

Answer: D

3. DBSCAN is a___________ algorithm

A. Partitional clustering

B. Hierarchical clustering

C. Fuzzy clustering

D. Complete clustering

Answer: A
4. The Euclidean distance matrix between four 2-dimensional points, p1, p2, p3, and p4,
is shown below. A possible set of co-ordinate values of these points are:

p1 p2 p3 p4
p1 0 1.00 1.41 1.00
0 4 0
p2 1.00 0 1.00 1.41
0 0 4
p3 1.41 1.00 0 1.00
4 0 0
p4 1.00 1.41 1.00 0
0 4 0

A. p1=(0, 0), p2=(0, 1), p3=(1, 0), p4=(1, 1)

B. p1=(0, 0), p2=(1, 0), p3=(1, 1), p4=(0, 1)

C. p1=(1, 0), p2=(0, 0), p3=(1, 1), p4=(0, 1)

D. p1=(0, 0), p2=(1, 1), p3=(1, 0), p4=(0, 1)

Answer : B

5. The leaves of a dendogram in hierarchical clustering represent?

A. Individual data points

B. Clusters of multiple data points

C. Distances between data points

D. Cluster membership of the data points

Answer : A

6. Distance between two clusters in complete linkage clustering is defined as:

A. Distance between the closest pair of points between the clusters

B. Distance between the furthest pair of points between the clusters

C. Distance between the most centrally located pair of points in the clusters

D. None of the above

Answer : B
7. Consider a set of five 2-dimensional points p1=(0, 0), p2=(5, 0), p3=(5, 1), p4=(0, 1),
and p5=(0, 0.5). Euclidean distance is the distance function. Single linkage clustering
is used to cluster the points into two clusters. The clusters are:
A. {p1, p2, p3} {p4, p5}

B. {p1, p4, p5} {p2, p3}

C. {p1, p2, p5} {p3, p4}

D. {p1, p2, p4} {p3, p5}

Answer : B

8. Consider a set of five 2-dimensional points p1=(0, 0), p2=(5, 0), p3=(5, 1), p4=(0, 1),
and p5=(0, 0.5). Euclidean distance is the distance function. Complete linkage cluster-
ing is used to cluster the points into two clusters. The clusters are:
A. {p1, p4, p5} {p2, p3}

B. {p1, p2, p3} {p4, p5}

C. {p1, p2, p5} {p3, p4}

D. {p1, p2, p4} {p3, p5}

Answer : A

9. Consider a set of five 2-dimensional points p1=(0, 0), p2=(5, 0), p3=(5, 1), p4=(0, 1),
and p5=(0, 0.5). Euclidean distance is the distance function. The k-means algorithm is
used to cluster the points into two clusters. The initial cluster centers are p1 and p5.
The clusters after two iterations of k-means are:
A. {p1, p4, p5} {p2, p3}

B. {p1, p2, p3} {p4, p5}

C. {p3, p4, p5} {p1, p2}

D. {p1, p2, p4} {p3, p5}

Answer: A
10. Given a set of seven 2-dimensional points p1=(0, 0), p2=(5, 0), p3=(5, 1), p4=(0, 1),
p5=(0, 0.5), p6=(0, 9), and p7=(5.5, 1). Euclidean distance is the distance function. The
DBSCAN algorithm is used to cluster the points. Epsilon = 1, and MinPts = 2 is used for
DBSCAN. The clusters and outliers obtained are:
A. Clusters: {p1, p3, p4, p5} {p2, p7}; Outlier: p6

B. Clusters: {p1, p2, p3} {p4, p5, p6}; Outlier: p7

C. Clusters: {p1, p4, p5} {p2, p3, p7}; Outlier: p6

D. Clusters: {p1, p4, p5} {p2, p3, p6}; Outlier: p7

Answer: C

You might also like