Big Data Training ( Hadoop,HDFS, PIG, HIVE): What is Secondary Namenode and what it actually does?

Monday, 17 August 2015

What is Secondary Namenode and what it actually does?

Mainly SecondaryNameNode server is used as a backup of NameNode server in Hadoop Distributed File system to recreate the NameNode server if it goes down. As total HDFS depends on the NameNode we have to recover it.

If you are building a test cluster, you don’t need to set up secondary name node on a different machine, something like pseudo cluster install steps. However if you’re building out a real distributed cluster, you must move secondary node to other machine and it is a great idea. You can have Secondary Namenode on a different machine other than NameNode in case the Namenode goes down.

NameNode actually holds the edit log which contains the metadata information of the NameNode. In every few second the edit log is sent to the Secondary NameNode. Secondary NameNode can not become actual NameNode itself, because after some time gap the edit log is sent to the SecondaryNameNode. If the NameNode goes down the Secondary NameNode lost few data but it has a persistent copy of the Namenode server’s metadata. So we can recreate the NameNode.

Big Data Training ( Hadoop,HDFS, PIG, HIVE)

Monday, 17 August 2015

What is Secondary Namenode and what it actually does?

No comments:

Post a Comment