Q. In hierarchical clustering, what does the dendrogram represent?
-
A.
The accuracy of the model
-
B.
The hierarchy of clusters
-
C.
The distance between data points
-
D.
The number of features
Solution
A dendrogram visually represents the arrangement of clusters in hierarchical clustering.
Correct Answer:
B
— The hierarchy of clusters
Learn More →
Q. In the context of clustering, what does 'curse of dimensionality' refer to?
-
A.
The increase in computational cost with more dimensions
-
B.
The difficulty in visualizing high-dimensional data
-
C.
The sparsity of data in high dimensions affecting clustering
-
D.
All of the above
Solution
The curse of dimensionality encompasses all these challenges, making clustering in high-dimensional spaces difficult.
Correct Answer:
D
— All of the above
Learn More →
Q. What does the silhouette score measure in clustering?
-
A.
The accuracy of predictions
-
B.
The compactness and separation of clusters
-
C.
The number of clusters
-
D.
The speed of the algorithm
Solution
The silhouette score evaluates how similar an object is to its own cluster compared to other clusters.
Correct Answer:
B
— The compactness and separation of clusters
Learn More →
Q. What is the main advantage of hierarchical clustering?
-
A.
It requires a predefined number of clusters
-
B.
It can produce a dendrogram for visualizing clusters
-
C.
It is faster than K-Means
-
D.
It is less sensitive to noise
Solution
Hierarchical clustering creates a dendrogram, which provides a visual representation of the data's clustering structure.
Correct Answer:
B
— It can produce a dendrogram for visualizing clusters
Learn More →
Q. What is the main advantage of using Gaussian Mixture Models (GMM) over K-Means?
-
A.
GMM can handle non-spherical clusters
-
B.
GMM is faster
-
C.
GMM requires fewer parameters
-
D.
GMM is easier to implement
Solution
GMM can model clusters with different shapes and sizes, unlike K-Means which assumes spherical clusters.
Correct Answer:
A
— GMM can handle non-spherical clusters
Learn More →
Q. What is the main difference between hard and soft clustering?
-
A.
Hard clustering assigns points to one cluster, soft clustering assigns probabilities
-
B.
Soft clustering is faster than hard clustering
-
C.
Hard clustering can handle noise, soft cannot
-
D.
There is no difference
Solution
Hard clustering assigns each data point to a single cluster, while soft clustering assigns probabilities to each cluster.
Correct Answer:
A
— Hard clustering assigns points to one cluster, soft clustering assigns probabilities
Learn More →
Q. What is the purpose of the elbow method in clustering?
-
A.
To determine the optimal number of clusters
-
B.
To visualize cluster separation
-
C.
To evaluate cluster quality
-
D.
To reduce dimensionality
Solution
The elbow method helps identify the optimal number of clusters by plotting the explained variance against the number of clusters.
Correct Answer:
A
— To determine the optimal number of clusters
Learn More →
Q. Which clustering algorithm is based on density?
-
A.
K-Means
-
B.
Hierarchical Clustering
-
C.
DBSCAN
-
D.
Gaussian Mixture Model
Solution
DBSCAN (Density-Based Spatial Clustering of Applications with Noise) identifies clusters based on the density of data points.
Correct Answer:
C
— DBSCAN
Learn More →
Q. Which clustering algorithm is best suited for non-spherical clusters?
-
A.
K-Means
-
B.
DBSCAN
-
C.
Hierarchical Clustering
-
D.
Gaussian Mixture Models
Solution
DBSCAN is effective for identifying clusters of varying shapes and densities, making it suitable for non-spherical clusters.
Correct Answer:
B
— DBSCAN
Learn More →
Q. Which clustering technique can automatically determine the number of clusters?
-
A.
K-Means
-
B.
Agglomerative Clustering
-
C.
DBSCAN
-
D.
Mean Shift
Solution
DBSCAN can automatically determine the number of clusters based on the density of data points.
Correct Answer:
C
— DBSCAN
Learn More →
Q. Which method can be used to determine the optimal number of clusters in K-Means?
-
A.
Elbow Method
-
B.
Cross-Validation
-
C.
Grid Search
-
D.
Random Search
Solution
The Elbow Method helps to determine the optimal number of clusters by plotting the explained variance.
Correct Answer:
A
— Elbow Method
Learn More →
Q. Which of the following is a limitation of K-Means clustering?
-
A.
It can handle large datasets
-
B.
It is sensitive to outliers
-
C.
It can find non-convex clusters
-
D.
It requires no prior knowledge of data
Solution
K-Means is sensitive to outliers, which can skew the results and affect cluster centroids.
Correct Answer:
B
— It is sensitive to outliers
Learn More →
Q. Which of the following is NOT a common application of clustering?
-
A.
Market segmentation
-
B.
Anomaly detection
-
C.
Image classification
-
D.
Document clustering
Solution
Image classification typically involves supervised learning, while clustering is unsupervised.
Correct Answer:
C
— Image classification
Learn More →
Showing 1 to 13 of 13 (1 Pages)