You need to create a job that does frequency analysis on input data. You will do this by writing a Mapper that uses TextInputFormat and splits each value (a line of text from an input file) into individual characters. For each one of these characters, you will emit the character as a key and an InputWritable as the value. As this will produce proportionally more intermediate data than input data, which two resources should you expect to be bottlenecks?
A) Processor and network I/O
B) Disk I/O and network I/O
C) Processor and RAM
D) Processor and disk I/O
Correct Answer:
Verified
Q26: In the reducer, the MapReduce API provides
Q27: When can a reduce class also serve
Q28: You write MapReduce job to process 100
Q29: MapReduce v2 (MRv2/YARN) splits which major functions
Q30: Analyze each scenario below and indentify which
Q32: You need to move a file titled
Q33: In a large MapReduce job with m
Q34: In a MapReduce job, the reducer receives
Q35: Workflows expressed in Oozie can contain:
A) Sequences
Q36: Which best describes what the map method
Unlock this Answer For Free Now!
View this answer and more for free by performing one of the following actions
Scan the QR code to install the App and get 2 free unlocks
Unlock quizzes for free by uploading documents