A company is planning to create a data lake in Amazon S3. The company wants to create tiered storage based on access patterns and cost objectives. The solution must include support for JDBC connections from legacy clients, metadata management that allows federation for access control, and batch-based ETL using PySpark and Scala Operational management should be limited. Which combination of components can meet these requirements? (Choose three.)
A) AWS Glue Data Catalog for metadata management
B) Amazon EMR with Apache Spark for ETL
C) AWS Glue for Scala-based ETL
D) Amazon EMR with Apache Hive for JDBC clients
E) Amazon Athena for querying data in Amazon S3 using JDBC drivers
F) Amazon EMR with Apache Hive, using an Amazon RDS with MySQL-compatible backed metastore
Correct Answer:
Verified
Q78: An operations team notices that a few
Q79: A company has an application that uses
Q80: A central government organization is collecting events
Q81: An operations team notices that a few
Q82: A company wants to use an automatic
Q84: A marketing company is using Amazon EMR
Q85: A company is planning to do a
Q86: A company wants to provide its data
Q87: A company wants to research user turnover
Q88: A company wants to enrich application logs
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents