by Alan F Gates
Publisher: O'Reilly Media 2011
Number of pages: 344
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.
Home page url
Download or read it online for free here:
by Jacek Laskowski - GitBook
This collections of notes (what some may rashly call a 'book') serves as the ultimate place of mine to collect all the nuts and bolts of using Apache Spark. The notes aim to help me designing and developing better products with Apache Spark.
by Mark Pollack, et al. - O'Reilly Media
This book shows you how Spring's data access framework can help you connect to either non-relational or relational databases. You'll learn how Spring Data's model reduces the learning curve for applications with newer data access technologies.
by Lars George - O'Reilly Media
If you are looking for a solution to accommodate a virtually endless amount of data, this book will show you how Apache HBase can fulfill your needs. HBase scales to billions of rows and columns, while ensuring that performance remain constant.
This ebook provides a step by step path to proficiency with MySQL. Beginning with the relational database basics and the installation of MySQL, the book covers everything needed to successfully build, maintain and secure MySQL based databases.