A data analyst is using AWS Glue to organize, cleanse, validate, and format a 200 GB dataset. The data analyst triggered the job to run with the Standard worker type. After 3 hours, the AWS Glue job status is still RUNNING. Logs from the job run show no error codes. The data analyst wants to improve the job execution time without overprovisioning. Which actions should the data analyst take?
A) Enable job bookmarks in AWS Glue to estimate the number of data processing units (DPUs) . Based on the profiled metrics, increase the value of the executor-cores job parameter.
B) Enable job metrics in AWS Glue to estimate the number of data processing units (DPUs) . Based on the profiled metrics, increase the value of the maximum capacity job parameter.
C) Enable job metrics in AWS Glue to estimate the number of data processing units (DPUs) . Based on the profiled metrics, increase the value of the spark.yarn.executor.memoryOverhead job parameter.
D) Enable job bookmarks in AWS Glue to estimate the number of data processing units (DPUs) . Based on the profiled metrics, increase the value of the num-executors job parameter.
Correct Answer:
Verified
Q35: A transportation company uses IoT sensors attached
Q36: A company leverages Amazon Athena for ad-hoc
Q37: An insurance company has raw data in
Q38: A company has a business unit uploading
Q39: A company wants to improve the data
Q41: A company needs to store objects containing
Q42: A banking company wants to collect large
Q43: A retail company is building its data
Q44: A company has 1 million scanned documents
Q45: A media company has been performing analytics
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents