A banking company wants to collect large volumes of transactional data using Amazon Kinesis Data Streams for real-time analytics. The company uses PutRecord to send data to Amazon Kinesis, and has observed network outages during certain times of the day. The company wants to obtain exactly once semantics for the entire processing pipeline. What should the company do to obtain these characteristics?
A) Design the application so it can remove duplicates during processing be embedding a unique ID in each record.
B) Rely on the processing semantics of Amazon Kinesis Data Analytics to avoid duplicate processing of events.
C) Design the data producer so events are not ingested into Kinesis Data Streams multiple times.
D) Rely on the exactly one processing semantics of Apache Flink and Apache Spark Streaming included in Amazon EMR.
Correct Answer:
Verified
Q37: An insurance company has raw data in
Q38: A company has a business unit uploading
Q39: A company wants to improve the data
Q40: A data analyst is using AWS Glue
Q41: A company needs to store objects containing
Q43: A retail company is building its data
Q44: A company has 1 million scanned documents
Q45: A media company has been performing analytics
Q46: A data engineering team within a shared
Q47: An online gaming company is using an
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents