Feature Engineering and Model Selection - Higher Difficulty Problems

Download Q&A
Q. In the context of model selection, what does cross-validation help to prevent?
  • A. Overfitting
  • B. Underfitting
  • C. Data leakage
  • D. Bias
Q. What is the effect of using polynomial features in a linear regression model?
  • A. It reduces the model complexity
  • B. It can capture non-linear relationships
  • C. It increases the risk of underfitting
  • D. It eliminates multicollinearity
Q. What is the main advantage of using ensemble methods like Random Forest over a single decision tree?
  • A. They are faster to train
  • B. They reduce variance and improve prediction accuracy
  • C. They are easier to interpret
  • D. They require less data
Q. What is the purpose of using regularization techniques in model selection?
  • A. To increase the model's complexity
  • B. To reduce the training time
  • C. To prevent overfitting by penalizing large coefficients
  • D. To improve the interpretability of the model
Q. Which feature transformation technique is used to normalize the range of features?
  • A. One-Hot Encoding
  • B. Min-Max Scaling
  • C. Label Encoding
  • D. Feature Extraction
Q. Which of the following is a common method for handling missing data in a dataset?
  • A. Removing all rows with missing values
  • B. Replacing missing values with the mean or median
  • C. Ignoring the missing values during training
  • D. All of the above
Q. Which of the following is a common method for handling missing data?
  • A. Removing all rows with missing values
  • B. Imputing missing values with the mean or median
  • C. Ignoring missing values during training
  • D. Using a more complex model
Q. Which of the following is a disadvantage of using decision trees for model selection?
  • A. They are easy to interpret
  • B. They can easily overfit the training data
  • C. They handle both numerical and categorical data
  • D. They require less data preprocessing
Q. Which of the following is a disadvantage of using too many features in a model?
  • A. Increased interpretability
  • B. Higher computational cost
  • C. Better model performance
  • D. Reduced risk of overfitting
Q. Which of the following techniques is NOT typically used in feature selection?
  • A. Recursive Feature Elimination
  • B. Principal Component Analysis
  • C. Random Forest Importance
  • D. K-Means Clustering
Showing 1 to 10 of 10 (1 Pages)
Soulshift Feedback ×

On a scale of 0–10, how likely are you to recommend The Soulshift Academy?

Not likely Very likely