Cloudera Certified Administrator for Apache Hadoop CDH4 Sample Questions:
1. Using cloudera manager on CDH4 cluster running mapreduce V1(MRv1), you delete a tasktracker role instance from a host that also a runs a datanode role instance and a region server role instance.cloudera
Manager make changes to the cluster and prompts you to the accept the changes.
What other configuration option will cloudera manager automatically prompt you to change?
A) The option to failover to the instance by namenode
B) the option to immediately rebalance the cluster
C) The option to change java maximum heap sizes for the other role instances
D) the option to specify an alternate slave host to place the received data node role instance
2. How does the NameNode know DataNodes are available on a cluster running MapReduce v1 (MRv1)
A) DataNodes heartbeat in the master on a regular basis.
B) The NameNode send a broadcast across the network when it first starts, and DataNodes
respond.
C) The NameNode broadcasts a heartbeat on the network on a regular basis, and DataNodes respond.
D) DataNodes listed in the dfs.hosts file. The NameNode uses as the definitive list of available DataNodes.
3. How does HDFS Federation help HDFS Scale horizontally?
A) HDFS Federation improves the resiliency of HDFS in the face of network issues by removing the NameNode as a single-point-of-failure.
B) HDFS Federation reduces the load on any single NameNode by using the multiple, independent NameNode to manage individual pars of the filesystem namespace.
C) HDFS Federation provides cross-data center (non-local) support for HDFS, allowing a cluster administrator to split the Block Storage outside the local cluster.
D) HDFS Federation allows the Standby NameNode to automatically resume the services of an active NameNode.
4. What occurs when you run a hadoop job specifying output directory for job output which already exists in HDFS?
A) an error will occur after the mappers have completed but before any reducers begin to run because the output path must not exist during the shuffle and sort.
B) the job will run successfully. Output from the reducers will override the contents of existing directory
C) An error will occur immediately because the output directory must not already exist.
D) the job will run successfully output from the reducers will be placed in a directory called job output -1
5. You set the value of dfs.block.size to 64MB in hdfs-site.xml on a client machine, but you set the same property to 128MB on your clusters name node. What happens when the client writes a file to HDFS?
A) A block size of 64MB will be used
B) An execution will be thrown when the client attempts to write the file, because the values are different.
C) A block size of 128MB will be used
D) The file will be written successfully with a block size of 64MB but client attempting to read the file will fail because the namenode believes the blocks to be 128MB in size
Solutions:
Question # 1 Answer: D | Question # 2 Answer: A | Question # 3 Answer: B | Question # 4 Answer: C | Question # 5 Answer: A |