Data Wrangling Handbook
by Open Knowledge Foundation
Publisher: School of Data 2012
The Data Wrangling Handbook is a companion text to the School of Data. Its function is something like a traditional textbook -- it will provide the detail and background theory to support the School of Data courses and challenges.
Home page url
Download or read it online for free here:
- IBM Redbooks
The book provides lots of information for developers, including code samples for creating PHP applications with DB2, Informix Dynamic Server, and Cloudscape. We use the latest PHP data access extensions including: PHP Data Objects and ibm_db2.
by Karl Seguin - openmymind.net
MongoDB is a document-oriented database -- it should be viewed as an alternative to relational databases. This book covers a number of topics with a focus on the fundamentals you will need to get comfortably up and running.
by Alan F Gates - O'Reilly Media
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs. The structure of Pig programs is amenable to parallelization, which enables them to handle very large data sets.
by Lars George - O'Reilly Media
If you are looking for a solution to accommodate a virtually endless amount of data, this book will show you how Apache HBase can fulfill your needs. HBase scales to billions of rows and columns, while ensuring that performance remain constant.