Of Ivory and Smurfs: Loxodontan MapReduce Experiments for Web Search
MARYLAND UNIV COLLEGE PARK DEPT OF COMPUTER SCIENCE
Pagination or Media Count:
This paper describes Ivory, an attempt to build a distributed retrieval system around the open-source Hadoop implementation of MapReduce. We focus on three noteworthy aspects of our work a retrieval architecture built directly on the Hadoop Distributed File System HDFS, a scalable Map-Reduce algorithm for inverted indexing, and webpage classification to enhance retrieval effectiveness.
- Computer Programming and Software
- Computer Systems