Mining of Massive Datasets
by Anand Rajaraman, Jeffrey D. Ullman
Publisher: Stanford University 2010
Number of pages: 340
At the highest level of description, this book is about data mining. However, it focuses on data mining of very large amounts of data, that is, data so large it does not fit in main memory. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web.
Home page url
Download or read it online for free here:
by Jimmy Lin, Chris Dyer - Morgan & Claypool Publishers
This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns.
by C.J. Date, Hugh Darwen - Addison Wesley
This is a book on database management based on an earlier book by the same authors. It can be seen as an abstract blueprint for the design of a DBMS and the language interface to such a DBMS. It serves as a basis for a model of type inheritance.
by David Maier - Computer Science Press
The book is intended for a second course in databases and a reference for researchers in the field. The material covered includes relational algebra, functional dependencies, multivalued and join dependencies, normal forms, representation theory...
by Clinton Gormley, Zachary Tong - O'Reilly
Whether you need full-text search or real-time analytics of data, this book introduces you to the fundamental concepts required to start working with Elasticsearch. With these foundations laid, it will move on to more-advanced search techniques.