A media company wants to perform machine learning and analytics on the data residing in its Amazon S3 data lake. There are two data transformation requirements that will enable the consumers within the company to create reports: Daily transformations of 300 GB of data with different file formats landing in Amazon S3 at a scheduled time. One-time transformations of terabytes of archived data residing in the S3 data lake. Which combination of solutions cost-effectively meets the company's requirements for transforming the data? (Choose three.)
A) For daily incoming data, use AWS Glue crawlers to scan and identify the schema.
B) For daily incoming data, use Amazon Athena to scan and identify the schema.
C) For daily incoming data, use Amazon Redshift to perform transformations.
D) For daily incoming data, use AWS Glue workflows with AWS Glue jobs to perform transformations.
E) For archived data, use Amazon EMR to perform data transformations.
F) For archived data, use Amazon SageMaker to perform data transformations.
Correct Answer:
Verified
Q20: A company wants to optimize the cost
Q21: A US-based sneaker retail company launched its
Q22: A regional energy company collects voltage data
Q23: A company is migrating its existing on-premises
Q24: A real estate company has a mission-critical
Q26: A company launched a service that produces
Q27: A company is streaming its high-volume billing
Q28: A company's marketing team has asked for
Q29: A company has developed an Apache Hive
Q30: A financial company hosts a data lake
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents