Q. In a classification problem, what does a confusion matrix represent?
A.
The relationship between features
B.
The performance of a classification model
C.
The distribution of data points
D.
The training time of the model
Show solution
Solution
A confusion matrix represents the performance of a classification model by showing the true positive, true negative, false positive, and false negative counts.
Correct Answer:
B
— The performance of a classification model
Learn More →
Q. In regression analysis, what does R-squared indicate?
A.
The strength of the relationship between variables
B.
The proportion of variance explained by the model
C.
The accuracy of predictions
D.
The number of features used in the model
Show solution
Solution
R-squared indicates the proportion of variance in the dependent variable that can be explained by the independent variables in the model.
Correct Answer:
B
— The proportion of variance explained by the model
Learn More →
Q. In supervised learning, what is the role of the training dataset?
A.
To evaluate the model's performance
B.
To tune hyperparameters
C.
To train the model to learn patterns
D.
To visualize data
Show solution
Solution
The training dataset is used to train the model, allowing it to learn patterns and relationships in the data.
Correct Answer:
C
— To train the model to learn patterns
Learn More →
Q. What does the ROC curve represent in classification problems?
A.
The relationship between precision and recall
B.
The trade-off between true positive rate and false positive rate
C.
The accuracy of the model over different thresholds
D.
The distribution of predicted probabilities
Show solution
Solution
The ROC curve illustrates the trade-off between the true positive rate (sensitivity) and the false positive rate at various threshold settings.
Correct Answer:
B
— The trade-off between true positive rate and false positive rate
Learn More →
Q. What is the main advantage of using ensemble methods in supervised learning?
A.
They are simpler to implement
B.
They reduce the risk of overfitting
C.
They combine predictions from multiple models to improve accuracy
D.
They require less data for training
Show solution
Solution
Ensemble methods combine predictions from multiple models, which often leads to improved accuracy and robustness.
Correct Answer:
C
— They combine predictions from multiple models to improve accuracy
Learn More →
Q. What is the main difference between regression and classification?
A.
Regression predicts continuous values, while classification predicts discrete labels
B.
Regression is unsupervised, while classification is supervised
C.
Regression uses more features than classification
D.
There is no difference
Show solution
Solution
The main difference is that regression predicts continuous values, while classification predicts discrete labels.
Correct Answer:
A
— Regression predicts continuous values, while classification predicts discrete labels
Learn More →
Q. What is the purpose of regularization in regression models?
A.
To increase the model complexity
B.
To reduce the training time
C.
To prevent overfitting by penalizing large coefficients
D.
To improve the interpretability of the model
Show solution
Solution
Regularization is used in regression models to prevent overfitting by adding a penalty for large coefficients, thus simplifying the model.
Correct Answer:
C
— To prevent overfitting by penalizing large coefficients
Learn More →
Q. What is the purpose of regularization in supervised learning?
A.
To increase the complexity of the model
B.
To prevent overfitting
C.
To improve training speed
D.
To enhance feature selection
Show solution
Solution
Regularization techniques are used to prevent overfitting by adding a penalty for larger coefficients in the model.
Correct Answer:
B
— To prevent overfitting
Learn More →
Q. Which algorithm is typically used for binary classification tasks?
A.
K-Means Clustering
B.
Linear Regression
C.
Logistic Regression
D.
Principal Component Analysis
Show solution
Solution
Logistic Regression is commonly used for binary classification tasks as it models the probability of a binary outcome.
Correct Answer:
C
— Logistic Regression
Learn More →
Q. Which of the following is a common evaluation metric for regression models?
A.
Accuracy
B.
F1 Score
C.
Mean Absolute Error
D.
Confusion Matrix
Show solution
Solution
Mean Absolute Error (MAE) is a common evaluation metric for regression models, measuring the average magnitude of errors in predictions.
Correct Answer:
C
— Mean Absolute Error
Learn More →
Q. Which of the following techniques can be used to handle imbalanced datasets in classification?
A.
Data augmentation
B.
Feature scaling
C.
Cross-validation
D.
Resampling methods
Show solution
Solution
Resampling methods, such as oversampling the minority class or undersampling the majority class, can help address imbalanced datasets.
Correct Answer:
D
— Resampling methods
Learn More →
Q. Which of the following techniques can help prevent overfitting?
A.
Increasing the number of features
B.
Using a more complex model
C.
Cross-validation
D.
Ignoring validation data
Show solution
Solution
Cross-validation is a technique that helps prevent overfitting by ensuring the model's performance is evaluated on different subsets of the data.
Correct Answer:
C
— Cross-validation
Learn More →
Showing 1 to 12 of 12 (1 Pages)