Mining of Massive Datasets
by Anand Rajaraman, Jeffrey D. Ullman
Publisher: Stanford University 2010
Number of pages: 340
Description:
At the highest level of description, this book is about data mining. However, it focuses on data mining of very large amounts of data, that is, data so large it does not fit in main memory. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web.
Download or read it online for free here:
Download link
(2MB, PDF)
Similar books

by P. A. Bernstein, V. Hadzilacos, N. Goodman - Addison Wesley
This book is about techniques for concurrency control and recovery. It covers techniques for centralized and distributed computer systems, and for single copy, multiversion, and replicated databases. Example applications are included.
(24513 views)

by Eugenia G. Giannopoulou - InTech
This book brings together the most recent advances of data mining research in the promising areas of medicine and biology. It consists of seventeen chapters which describe interesting applications, motivating progress and worthwhile results.
(23261 views)

by Jimmy Lin, Chris Dyer - Morgan & Claypool Publishers
This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns.
(16707 views)

by Graham Williams - Togaware Pty Ltd
Data mining is about building models from data. We build models to gain insights into the world and how the world works. A data miner, in building models, deploys many different data analysis and model building techniques.
(27915 views)