Programming Pig
by Alan F Gates
Publisher: O'Reilly Media 2011
Number of pages: 222
Description:
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.
Download or read it online for free here:
Download link
(6.4MB, PDF)
Similar books

by Lars George - O'Reilly Media
If you are looking for a solution to accommodate a virtually endless amount of data, this book will show you how Apache HBase can fulfill your needs. HBase scales to billions of rows and columns, while ensuring that performance remain constant.
(9861 views)

by J. C. Anderson, J. Lehnardt, N. Slater - O'Reilly Media
CouchDB's creators show you how to use this document-oriented database as a standalone application framework or with high-volume, distributed applications. CouchDB is ideal for web applications that handle huge amounts of loosely structured data.
(9889 views)

by Mark Pollack, et al. - O'Reilly Media
This book shows you how Spring's data access framework can help you connect to either non-relational or relational databases. You'll learn how Spring Data's model reduces the learning curve for applications with newer data access technologies.
(12937 views)

- IBM Redbooks
The book provides lots of information for developers, including code samples for creating PHP applications with DB2, Informix Dynamic Server, and Cloudscape. We use the latest PHP data access extensions including: PHP Data Objects and ibm_db2.
(13494 views)