Logo

Mastering Apache Spark 2.0 by Jacek Laskowski

Small book cover: Mastering Apache Spark 2.0

Mastering Apache Spark 2.0
by

Publisher: GitBook
Number of pages: 1621

Description:
This collections of notes (what some may rashly call a 'book') serves as the ultimate place of mine to collect all the nuts and bolts of using Apache Spark. The notes aim to help me designing and developing better products with Apache Spark.

Home page url

Download or read it online for free here:
Read online
(online html)

Similar books

Book cover: MySQLMySQL
- Wikibooks
MySQL is a free, widely used SQL engine. It can be used as a fast database as well as a rock-solid DBMS using a modular engine architecture. The purpose of this wikibook is to provide a practical knowledge on using the database ...
(10428 views)
Book cover: Rethinking Enterprise Storage: A Hybrid Cloud ModelRethinking Enterprise Storage: A Hybrid Cloud Model
by - Microsoft Press
The book describes a storage architecture that some experts are calling a game changer in the infrastructure industry. Called the Microsoft hybrid cloud storage, it is a way to integrate cloud storage services with traditional enterprise storage.
(9518 views)
Book cover: Programming PigProgramming Pig
by - O'Reilly Media
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs. The structure of Pig programs is amenable to parallelization, which enables them to handle very large data sets.
(13267 views)
Book cover: Text Mining with R: A Tidy ApproachText Mining with R: A Tidy Approach
by - O'Reilly Media
With this practical book, you'll explore text-mining techniques with tidytext, a package that authors developed using the tidy principles behind R packages like ggraph and dplyr. You'll learn how tidytext can make text analysis easy and effective.
(7899 views)