A machine learning (ML) specialist must develop a classification model for a financial services company. A domain expert provides the dataset, which is tabular with 10,000 rows and 1,020 features. During exploratory data analysis, the specialist finds no missing values and a small percentage of duplicate rows. There are correlation scores of > 0.9 for 200 feature pairs. The mean value of each feature is similar to its 50th percentile. Which feature engineering strategy should the ML specialist use with Amazon SageMaker?
A) Apply dimensionality reduction by using the principal component analysis (PCA) algorithm.
B) Drop the features with low correlation scores by using a Jupyter notebook.
C) Apply anomaly detection by using the Random Cut Forest (RCF) algorithm.
D) Concatenate the features with high correlation scores by using a Jupyter notebook.
Correct Answer:
Verified
Q145: A manufacturing company asks its machine learning
Q146: A data scientist is using an Amazon
Q147: An aircraft engine manufacturing company is measuring
Q148: A company that runs an online library
Q149: A financial company is trying to detect
Q151: A company is launching a new product
Q152: A company is converting a large number
Q153: A machine learning (ML) specialist is administering
Q154: A Machine Learning Specialist is designing a
Q155: A Machine Learning Specialist is planning to
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents