Data-Intensive Text Processing with MapReduce
by Jimmy Lin, Chris Dyer
Publisher: Morgan & Claypool Publishers 2010
Number of pages: 175
This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader 'think in MapReduce', but also discusses limitations of the programming model as well.
Home page url
Download or read it online for free here:
by Anand Rajaraman, Jeffrey D. Ullman - Stanford University
At the highest level of description, this book is about data mining. However, it focuses on data mining of very large amounts of data. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web.
by Shigeaki Sakurai (ed.) - InTech
Text mining techniques are studied aggressively in order to extract the knowledge from the data. This book introduces advanced text mining techniques. They are various techniques from relation extraction to under or less resourced language.
by Arno Jan Knobbe - IOS Press
This thesis is concerned with Data Mining: extracting useful insights from large collections of data. With the increased possibilities in modern society for companies and institutions to gather data, this subject has become of increasing importance.
- Fujitsu Siemens Computers
This book is an introduction to storage technologies and storage networks. It also provides an overview of the storage product portfolio of Fujitsu Siemens Computers which is the basis for solutions that help you manage the growing flood of data.