Programming Pig
by Alan F Gates
Publisher: O'Reilly Media 2011
Number of pages: 222
Description:
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.
Download or read it online for free here:
Download link
(6.4MB, PDF)
Similar books
The Little MongoDB Book
by Karl Seguin - openmymind.net
MongoDB is a document-oriented database -- it should be viewed as an alternative to relational databases. This book covers a number of topics with a focus on the fundamentals you will need to get comfortably up and running.
(11282 views)
by Karl Seguin - openmymind.net
MongoDB is a document-oriented database -- it should be viewed as an alternative to relational databases. This book covers a number of topics with a focus on the fundamentals you will need to get comfortably up and running.
(11282 views)
Rethinking Enterprise Storage: A Hybrid Cloud Model
by Marc Farley - Microsoft Press
The book describes a storage architecture that some experts are calling a game changer in the infrastructure industry. Called the Microsoft hybrid cloud storage, it is a way to integrate cloud storage services with traditional enterprise storage.
(7968 views)
by Marc Farley - Microsoft Press
The book describes a storage architecture that some experts are calling a game changer in the infrastructure industry. Called the Microsoft hybrid cloud storage, it is a way to integrate cloud storage services with traditional enterprise storage.
(7968 views)
Data Wrangling Handbook
by Open Knowledge Foundation - School of Data
The Data Wrangling Handbook is a companion text to the School of Data. Its function is something like a traditional textbook -- it will provide the detail and background theory to support the School of Data courses and challenges.
(9392 views)
by Open Knowledge Foundation - School of Data
The Data Wrangling Handbook is a companion text to the School of Data. Its function is something like a traditional textbook -- it will provide the detail and background theory to support the School of Data courses and challenges.
(9392 views)
Understanding Big Data
by Chris Eaton, et al. - McGraw-Hill
Big Data represents a new era in data exploration and utilization, and IBM helps clients navigate this transformation. The book reveals how to use Big Data technology to deliver a robust, secure, highly available, enterprise-class Big Data platform.
(12215 views)
by Chris Eaton, et al. - McGraw-Hill
Big Data represents a new era in data exploration and utilization, and IBM helps clients navigate this transformation. The book reveals how to use Big Data technology to deliver a robust, secure, highly available, enterprise-class Big Data platform.
(12215 views)