Cloudera Certified Administrator for Apache Hadoop CDH4 Upgrade Exam (CCAH) Sample Questions:
1. You are running a Hadoop cluster with NameNode on host mynamenode, a secondary NameNode on host mysecondary and DataNodes.
Which best describes how you determine when the last checkpoint happened?
A) Execute hdfs dfsadmin -report on the command line in and look at the Last Checkpoint information.
B) Connect to the web UI of the Secondary NameNode (http://mysecondarynamenode:50090) and look at the "Last Checkpoint" information
C) Execute hdfs dfsadmin -saveNameSpace on the command line which returns to you the last checkpoint value in fstime file.
D) Connect to the web UI of the NameNode (http://mynamenode:50070/) and look at the "Last Checkpoint" information
2. What metadata is stored on a DataNode when a block is written to it?
A) Node location of each block belonging to the same namespace.
B) Information on the file's location in HDFS.
C) Checksums for the data in the block, as a separate file.
D) None. Only the block itself is written.
3. Identify the function performed by the Secondary NameNode daemon on a cluster configured to run with a single NameNode.
A) In this configuration, the Secondary NameNode performs deal-time backups of the NameNode.
B) In this configuration, the Secondary NameNode performs a checkpoint operation on the files by the NameNode.
C) In this configuration, the Secondary NameNode servers as alternate data channel for clients to reach HDFS, should the NameNode become too busy.
D) In this configuration, the Secondary NameNode is standby NameNode, ready to failover and provide high availability.
4. Your cluster has nodes in seven racks, and you have provided a rack topology script. What is Hadoop's block placement policy, assuming a block replication factor of three?
A) Because there are seven racks the block is written to a node on each rack
B) One copy of the block is written to a node in each of three racks
C) All three of the block are written to nodes on the same rack
D) One copy of the block is written to a node in one rack; two copies are written to two nodes in a different rack
5. Compare the hardware requirements of the NameNode with that of the DataNodes in a Hadoop cluster running MapReduce v1 (MRv1):
A) The NameNode requires more memory and requires greater disk capacity than the DataNodes.
B) The NameNode and DataNodes should the same hardware configuration.
C) The NameNode requires more memory and no disk drives.
D) The NameNode requires less memory and less disk capacity than the DataNodes.
E) The NameNode requires more memory but less disk capacity.
Solutions:
Question # 1 Answer: B | Question # 2 Answer: A | Question # 3 Answer: B | Question # 4 Answer: D | Question # 5 Answer: E |