
Mastering Apache Spark 2.0
by Jacek Laskowski
Publisher: GitBook 2016
Number of pages: 1621
Description:
This collections of notes (what some may rashly call a 'book') serves as the ultimate place of mine to collect all the nuts and bolts of using Apache Spark. The notes aim to help me designing and developing better products with Apache Spark.
Download or read it online for free here:
Read online
(online html)
Similar books
MySQL- Wikibooks
MySQL is a free, widely used SQL engine. It can be used as a fast database as well as a rock-solid DBMS using a modular engine architecture. The purpose of this wikibook is to provide a practical knowledge on using the database ...
(10428 views)
Rethinking Enterprise Storage: A Hybrid Cloud Modelby Marc Farley - Microsoft Press
The book describes a storage architecture that some experts are calling a game changer in the infrastructure industry. Called the Microsoft hybrid cloud storage, it is a way to integrate cloud storage services with traditional enterprise storage.
(9518 views)
Programming Pigby Alan F Gates - O'Reilly Media
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs. The structure of Pig programs is amenable to parallelization, which enables them to handle very large data sets.
(13267 views)
Text Mining with R: A Tidy Approachby Julia Silge, David Robinson - O'Reilly Media
With this practical book, you'll explore text-mining techniques with tidytext, a package that authors developed using the tidy principles behind R packages like ggraph and dplyr. You'll learn how tidytext can make text analysis easy and effective.
(7899 views)