Data-Intensive Text Processing with MapReduce
by Jimmy Lin, Chris Dyer
Publisher: Morgan & Claypool Publishers 2010
Number of pages: 175
This book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning. We introduce the notion of MapReduce design patterns, which represent general reusable solutions to commonly occurring problems across a variety of problem domains. This book not only intends to help the reader 'think in MapReduce', but also discusses limitations of the programming model as well.
Home page url
Download or read it online for free here:
by Neeraj Sharma, at al. - IBM Corporation
This free e-book teaches you the fundamentals of databases, including relational database theory, logical and physical database design, and the SQL language. Advanced topics include using functions, stored procedures and XML.
- National Academies Press
Using big data analytics to identify complex patterns hidden inside volumes of data that have never been combined could accelerate the rate of scientific discovery and lead to the development of beneficial technologies and products.
by David Maier - Computer Science Press
The book is intended for a second course in databases and a reference for researchers in the field. The material covered includes relational algebra, functional dependencies, multivalued and join dependencies, normal forms, representation theory...
by S. Yuan, A.Z. Abidin, M. Sloan, J. Wang - arXiv
A comprehensive survey on Internet advertising, discussing the research issues, identifying the recent technologies, and suggesting its future directions. We start with a brief history, introduction, and classification of the industry.