A company has developed several AWS Glue jobs to validate and transform its data from Amazon S3 and load it into Amazon RDS for MySQL in batches once every day. The ETL jobs read the S3 data using a DynamicFrame. Currently, the ETL developers are experiencing challenges in processing only the incremental data on every run, as the AWS Glue job processes all the S3 input data on each run. Which approach would allow the developers to solve the issue with minimal coding effort?
A) Have the ETL jobs read the data from Amazon S3 using a DataFrame.
B) Enable job bookmarks on the AWS Glue jobs.
C) Create custom logic on the ETL jobs to track the processed S3 objects.
D) Have the ETL jobs delete the processed objects or data from Amazon S3 after each run.
Correct Answer:
Verified
Q70: Three teams of data analysts use Apache
Q71: A manufacturing company uses Amazon S3 to
Q72: A retail company's data analytics team recently
Q73: A company has a data warehouse in
Q74: A media analytics company consumes a stream
Q76: A company is migrating from an on-premises
Q77: A media company is using Amazon QuickSight
Q78: An operations team notices that a few
Q79: A company has an application that uses
Q80: A central government organization is collecting events
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents