A company is building a data lake and needs to ingest data from a relational database that has time-series data. The company wants to use managed services to accomplish this. The process needs to be scheduled daily and bring incremental data only from the source into Amazon S3. What is the MOST cost-effective approach to meet these requirements?
A) Use AWS Glue to connect to the data source using JDBC Drivers. Ingest incremental records only using job bookmarks.
B) Use AWS Glue to connect to the data source using JDBC Drivers. Store the last updated key in an Amazon DynamoDB table and ingest the data using the updated key as a filter.
C) Use AWS Glue to connect to the data source using JDBC Drivers and ingest the entire dataset. Use appropriate Apache Spark libraries to compare the dataset, and find the delta.
D) Use AWS Glue to connect to the data source using JDBC Drivers and ingest the full data. Use AWS DataSync to ensure the delta only is written into Amazon S3.
Correct Answer:
Verified
Q124: A power utility company is deploying thousands
Q125: A large telecommunications company is planning to
Q126: An ecommerce company is migrating its business
Q127: A company is sending historical datasets to
Q128: An education provider's learning management system (LMS)
Q130: A data analytics specialist is setting up
Q131: A transport company wants to track vehicular
Q132: A company has a data lake on
Q133: A hospital is building a research data
Q134: A retail company has 15 stores across
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents