Tuesday 18 August 2015

What does the term ‘Data Locality’ mean in Hadoop?



One of the most important features in Hadoop is that it works on local data as far as possible which means that when we perform any map reduce task, the task is send to  DataNode which contains the data on which the task will perform means the TaskTracker on each node would only run the map reduce code the data block/s present on that local node. This is known as data locality.



No comments:

Post a Comment