Data-Intensive Text Processing with MapReduce
by Jimmy Lin, Chris Dyer
Publisher: Morgan & Claypool Publishers 2010
ISBN/ASIN: 1608453421
ISBN-13: 9781608453429
Number of pages: 175
Description:
This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader 'think in MapReduce', but also discusses limitations of the programming model as well.
Download or read it online for free here:
Download link
(1.7MB, PDF)
Similar books

by C.J. Date, Hugh Darwen - Addison Wesley
This is a book on database management based on an earlier book by the same authors. It can be seen as an abstract blueprint for the design of a DBMS and the language interface to such a DBMS. It serves as a basis for a model of type inheritance.
(7675 views)

by S. Yuan, A.Z. Abidin, M. Sloan, J. Wang - arXiv
A comprehensive survey on Internet advertising, discussing the research issues, identifying the recent technologies, and suggesting its future directions. We start with a brief history, introduction, and classification of the industry.
(16176 views)

by J. M. Hellerstein, M. Stonebraker - UC Berkeley
These lecture notes provide students and professionals with a grounding in database research and a technical context for understanding recent innovations in the field. The readings included treat the most important issues in the database area.
(19446 views)

by Tony Gill, at al. - Getty Publications
This book provides an overview of metadata, its types, roles, and characteristics; a discussion of metadata as it relates to resources on the Web; a description of methods, tools, standards, and protocols used to publish digital collections; etc.
(17322 views)