Programming Pig
by Alan F Gates
Publisher: O'Reilly Media 2011
Number of pages: 222
Description:
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.
Download or read it online for free here:
Download link
(6.4MB, PDF)
Similar books
Understanding Big Data
by Chris Eaton, et al. - McGraw-Hill
Big Data represents a new era in data exploration and utilization, and IBM helps clients navigate this transformation. The book reveals how to use Big Data technology to deliver a robust, secure, highly available, enterprise-class Big Data platform.
(11654 views)
by Chris Eaton, et al. - McGraw-Hill
Big Data represents a new era in data exploration and utilization, and IBM helps clients navigate this transformation. The book reveals how to use Big Data technology to deliver a robust, secure, highly available, enterprise-class Big Data platform.
(11654 views)
Spring Data: Modern Data Access for Enterprise Java
by Mark Pollack, et al. - O'Reilly Media
This book shows you how Spring's data access framework can help you connect to either non-relational or relational databases. You'll learn how Spring Data's model reduces the learning curve for applications with newer data access technologies.
(14373 views)
by Mark Pollack, et al. - O'Reilly Media
This book shows you how Spring's data access framework can help you connect to either non-relational or relational databases. You'll learn how Spring Data's model reduces the learning curve for applications with newer data access technologies.
(14373 views)
A Little Riak Book
by Eric Redmond - GitBook
This is a free little book about Riak, a scalable, high availability NoSQL datastore. Riak is an open-source, distributed key/value database for high availability and near-linear scalability. Riak has remarkably high uptime and grows with you.
(8489 views)
by Eric Redmond - GitBook
This is a free little book about Riak, a scalable, high availability NoSQL datastore. Riak is an open-source, distributed key/value database for high availability and near-linear scalability. Riak has remarkably high uptime and grows with you.
(8489 views)
HBase: The Definitive Guide
by Lars George - O'Reilly Media
If you are looking for a solution to accommodate a virtually endless amount of data, this book will show you how Apache HBase can fulfill your needs. HBase scales to billions of rows and columns, while ensuring that performance remain constant.
(11289 views)
by Lars George - O'Reilly Media
If you are looking for a solution to accommodate a virtually endless amount of data, this book will show you how Apache HBase can fulfill your needs. HBase scales to billions of rows and columns, while ensuring that performance remain constant.
(11289 views)