Q. What is a limitation of using K-Means for clustering?
-
A.
It can only cluster numerical data
-
B.
It assumes clusters are of equal size and density
-
C.
It is not scalable to large datasets
-
D.
It requires a distance metric
Solution
K-Means assumes that clusters are spherical and of similar size and density, which may not be true for all datasets.
Correct Answer:
B
— It assumes clusters are of equal size and density
Learn More →
Q. Which clustering algorithm is based on the concept of density?
-
A.
K-Means
-
B.
Hierarchical Clustering
-
C.
DBSCAN
-
D.
Gaussian Mixture Model
Solution
DBSCAN (Density-Based Spatial Clustering of Applications with Noise) identifies clusters based on the density of data points.
Correct Answer:
C
— DBSCAN
Learn More →
Q. Which clustering algorithm is particularly effective for identifying clusters of varying shapes and densities?
-
A.
K-means
-
B.
Hierarchical clustering
-
C.
DBSCAN
-
D.
Gaussian Mixture Models
Solution
DBSCAN is effective for identifying clusters of varying shapes and densities, as it does not assume spherical clusters.
Correct Answer:
C
— DBSCAN
Learn More →
Q. Which of the following algorithms is commonly used for hierarchical clustering?
-
A.
K-means
-
B.
DBSCAN
-
C.
Agglomerative clustering
-
D.
Gaussian Mixture Models
Solution
Agglomerative clustering is a popular method for hierarchical clustering, where clusters are formed by merging smaller clusters.
Correct Answer:
C
— Agglomerative clustering
Learn More →
Q. Which of the following clustering methods can handle non-spherical clusters?
-
A.
K-Means
-
B.
Hierarchical Clustering
-
C.
DBSCAN
-
D.
All of the above
Solution
DBSCAN can handle non-spherical clusters as it groups points based on density rather than distance.
Correct Answer:
C
— DBSCAN
Learn More →
Q. Which of the following is a limitation of hierarchical clustering?
-
A.
It can only handle small datasets
-
B.
It requires prior knowledge of the number of clusters
-
C.
It is not sensitive to noise
-
D.
It cannot produce a dendrogram
Solution
Hierarchical clustering can be computationally expensive and is generally limited to smaller datasets due to its complexity.
Correct Answer:
A
— It can only handle small datasets
Learn More →
Q. Which of the following metrics is NOT typically used to evaluate clustering performance?
-
A.
Silhouette score
-
B.
Adjusted Rand Index
-
C.
Mean Squared Error
-
D.
Davies-Bouldin Index
Solution
Mean Squared Error is not typically used to evaluate clustering performance; it is more relevant for regression tasks.
Correct Answer:
C
— Mean Squared Error
Learn More →
Showing 1 to 7 of 7 (1 Pages)