Data-Intensive Text Processing with MapReduce
by Jimmy Lin, Chris Dyer
Publisher: Morgan & Claypool Publishers 2010
ISBN/ASIN: 1608453421
ISBN-13: 9781608453429
Number of pages: 175
Description:
This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader 'think in MapReduce', but also discusses limitations of the programming model as well.
Download or read it online for free here:
Download link
(1.7MB, PDF)
Similar books
Concurrency Control and Recovery in Database Systems
by P. A. Bernstein, V. Hadzilacos, N. Goodman - Addison Wesley
This book is about techniques for concurrency control and recovery. It covers techniques for centralized and distributed computer systems, and for single copy, multiversion, and replicated databases. Example applications are included.
(23020 views)
by P. A. Bernstein, V. Hadzilacos, N. Goodman - Addison Wesley
This book is about techniques for concurrency control and recovery. It covers techniques for centralized and distributed computer systems, and for single copy, multiversion, and replicated databases. Example applications are included.
(23020 views)
Database Systems and Structures
by Osmar R. Zaiane - Simon Fraser University
An introduction to data models, database systems, the structure and use of relational database systems and relational languages, indexing and storage management, query processing in relational databases, and the theory of relational database design.
(16107 views)
by Osmar R. Zaiane - Simon Fraser University
An introduction to data models, database systems, the structure and use of relational database systems and relational languages, indexing and storage management, query processing in relational databases, and the theory of relational database design.
(16107 views)
Database Explorations
by C.J. Date, Hugh Darwen
The database field is full of important problems still to be solved and interesting issues still to be examined -- and some of those problems and issues are explored in this book. It reports on some of our most recent investigations in this field.
(6707 views)
by C.J. Date, Hugh Darwen
The database field is full of important problems still to be solved and interesting issues still to be examined -- and some of those problems and issues are explored in this book. It reports on some of our most recent investigations in this field.
(6707 views)
Data Mining and Knowledge Discovery in Real Life Applications
by Julio Ponce, Adem Karahoca - InTech
This book presents different ways of theoretical and practical advances and applications of data mining in different promising areas. The book will serve as a Data Mining bible to show a right way for the students, researchers and practitioners.
(16954 views)
by Julio Ponce, Adem Karahoca - InTech
This book presents different ways of theoretical and practical advances and applications of data mining in different promising areas. The book will serve as a Data Mining bible to show a right way for the students, researchers and practitioners.
(16954 views)