A company wants to provide its data analysts with uninterrupted access to the data in its Amazon Redshift cluster. All data is streamed to an Amazon S3 bucket with Amazon Kinesis Data Firehose. An AWS Glue job that is scheduled to run every 5 minutes issues a COPY command to move the data into Amazon Redshift. The amount of data delivered is uneven throughout the day, and cluster utilization is high during certain periods. The COPY command usually completes within a couple of seconds. However, when load spike occurs, locks can exist and data can be missed. Currently, the AWS Glue job is configured to run without retries, with timeout at 5 minutes and concurrency at 1. How should a data analytics specialist configure the AWS Glue job to optimize fault tolerance and improve data availability in the Amazon Redshift cluster?
A) Increase the number of retries. Decrease the timeout value. Increase the job concurrency.
B) Keep the number of retries at 0. Decrease the timeout value. Increase the job concurrency.
C) Keep the number of retries at 0. Decrease the timeout value. Keep the job concurrency at 1.
D) Keep the number of retries at 0. Increase the timeout value. Keep the job concurrency at 1.
Correct Answer:
Verified
Q115: A market data company aggregates external data
Q116: A company owns facilities with IoT devices
Q117: A bank operates in a regulated environment.
Q118: A company wants to use an automatic
Q119: A company is hosting an enterprise reporting
Q121: A company receives data from its vendor
Q122: A company uses Amazon Redshift for its
Q123: A company analyzes historical data and needs
Q124: A power utility company is deploying thousands
Q125: A large telecommunications company is planning to
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents