An airline has been collecting metrics on flight activities for analytics. A recently completed proof of concept demonstrates how the company provides insights to data analysts to improve on-time departures. The proof of concept used objects in Amazon S3, which contained the metrics in .csv format, and used Amazon Athena for querying the data. As the amount of data increases, the data analyst wants to optimize the storage solution to improve query performance. Which options should the data analyst use to improve performance as the data lake grows? (Choose three.)
A) Add a randomized string to the beginning of the keys in S3 to get more throughput across partitions.
B) Use an S3 bucket in the same account as Athena.
C) Compress the objects to reduce the data transfer I/O.
D) Use an S3 bucket in the same Region as Athena.
E) Preprocess the .csv data to JSON to reduce I/O by fetching only the document keys needed by the query.
F) Preprocess the .csv data to Apache Parquet to reduce I/O by fetching only the data blocks needed for predicates.
Correct Answer:
Verified
Q86: A company wants to provide its data
Q87: A company wants to research user turnover
Q88: A company wants to enrich application logs
Q89: A large retailer has successfully migrated to
Q90: A retail company wants to use Amazon
Q92: An online retail company is migrating its
Q93: A company is building a service to
Q94: A retail company leverages Amazon Athena for
Q95: A company has a marketing department and
Q96: A company has an encrypted Amazon Redshift
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents