Mining of Massive Datasets
by Anand Rajaraman, Jeffrey D. Ullman
Publisher: Stanford University 2010
Number of pages: 340
At the highest level of description, this book is about data mining. However, it focuses on data mining of very large amounts of data, that is, data so large it does not fit in main memory. Because of the emphasis on size, many of our examples are about the Web or data derived from the Web.
Home page url
Download or read it online for free here:
- National Academies Press
Using big data analytics to identify complex patterns hidden inside volumes of data that have never been combined could accelerate the rate of scientific discovery and lead to the development of beneficial technologies and products.
by Christian S. Jensen - Aalborg University
Topics covered: the semantics of temporal data, the design of data models and languages for temporal data, the design of databases expressed in terms of temporal data models as well as temporally enhanced design of conventional databases.
by E. F. Codd - Addison-Wesley
Written by the originator of the relational model, the book covers the practical aspects of the design of relational databases. The author defines twelve rules that database management systems need to follow in order to be described as relational.
Data mining comprises techniques and algorithms, for determining interesting patterns from large datasets. There are currently hundreds algorithms that perform tasks such as frequent pattern mining, clustering, and classification, among others.