HaDoop MapReduce to detect the author of a mysterious doc based on 1.2GB corpus with Cosine Similarity 20 April 2016 #school #project #learning #share