Programming Pig
by Alan F Gates
Publisher: O'Reilly Media 2011
Number of pages: 222
Description:
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.
Download or read it online for free here:
Download link
(6.4MB, PDF)
Similar books

by Julia Silge, David Robinson - O'Reilly Media
With this practical book, you'll explore text-mining techniques with tidytext, a package that authors developed using the tidy principles behind R packages like ggraph and dplyr. You'll learn how tidytext can make text analysis easy and effective.
(4766 views)

- Techotopia
This ebook provides a step by step path to proficiency with MySQL. Beginning with the relational database basics and the installation of MySQL, the book covers everything needed to successfully build, maintain and secure MySQL based databases.
(14968 views)

- Wikibooks
MySQL is a free, widely used SQL engine. It can be used as a fast database as well as a rock-solid DBMS using a modular engine architecture. The purpose of this wikibook is to provide a practical knowledge on using the database ...
(7521 views)

by Mark Pollack, et al. - O'Reilly Media
This book shows you how Spring's data access framework can help you connect to either non-relational or relational databases. You'll learn how Spring Data's model reduces the learning curve for applications with newer data access technologies.
(13713 views)