I have installed Hadoop in pseudo distributed mode on my laptop, OS is Ubuntu.
I have changed paths where hadoop will store its data (by default hadoop stores data in /tmp folder)
hdfs-site.xml file looks as below :
<property>
<name>dfs.data.dir</name>
<value>/HADOOP_CLUSTER_DATA/data</value>
</property>
Now whenever I restart machine and try to start hadoop cluster using start-all.sh script, data node never starts. I confirmed that data node is not start by checking logs and by using jps command.
Then I
- Stopped cluster using
stop-all.shscript. - Formatted HDFS using
hadoop namenode -formatcommand. - Started cluster using
start-all.shscript.
Now everything works fine even if I stop and start cluster again. Problem occurs only when I restart machine and try to start the cluster.
- Has anyone encountered similar problem?
- Why this is happening and
- How can we solve this problem?