Best unofficial Apache Server developers community |
|
Somewhat of an odd question, but does anyone know what kind of sort MapReduce uses in the sort portion of shuffle/sort? I would think merge or insertion (in keeping with the whole MapReduce paradigm), but I'm not sure.
posted via StackOverflow
|
![]()  
|
It's Quicksort, afterwards the sorted intermediate outputs get merged together. Quicksort checks the recursion depth and gives up when it is too deep. If this is the case, Heapsort is used. Have a look at the Quicksort class:
You can change the algorithm used via the map.sort.class value in the hadoop-default.xml. |