Apache Spark is amazing when everything clicks. But if you haven't seen the performance improvements you expected, or still don't feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run faster and handle larger data sizes, while using fewer resources. Ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications, this book ...
Read More
Apache Spark is amazing when everything clicks. But if you haven't seen the performance improvements you expected, or still don't feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run faster and handle larger data sizes, while using fewer resources. Ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications, this book describes techniques that can reduce data infrastructure costs and developer hours. Not only will you gain a more comprehensive understanding of Spark, you'll also learn how to make it sing. With this book, you'll explore: How Spark SQL's new interfaces improve performance over SQL's RDD data structure The choice between data joins in Core Spark and Spark SQL Techniques for getting the most out of standard RDD transformations How to work around performance issues in Spark's key/value pair paradigm Writing high-performance Spark code without Scala or the JVM How to test for functionality and performance when applying suggested improvements Using Spark MLlib and Spark ML machine learning libraries Spark's Streaming components and external community packages
Read Less
Add this copy of High Performance Spark: Best Practices for Scaling and to cart. £7.46, good condition, Sold by Evergreen Goodwill rated 5.0 out of 5 stars, ships from Seattle, WA, UNITED STATES, published 2017 by O'Reilly Media.
Add this copy of High Performance Spark: Best Practices for Scaling and to cart. £9.58, fair condition, Sold by Goodwill Books rated 5.0 out of 5 stars, ships from Hillsboro, OR, UNITED STATES, published 2017 by O'Reilly Media.
Add this copy of High Performance Spark: Best Practices for Scaling and to cart. £9.77, good condition, Sold by HPB-Red rated 5.0 out of 5 stars, ships from Dallas, TX, UNITED STATES, published 2017 by O'Reilly Media.
Choose your shipping method in Checkout. Costs may vary based on destination.
Seller's Description:
Good. Connecting readers with great books since 1972! Used textbooks may not include companion materials such as access codes, etc. May have some wear or writing/highlighting. We ship orders daily and Customer Service is our top priority!
Add this copy of High Performance Spark: Best Practices for Scaling and to cart. £10.21, good condition, Sold by HPB-Red rated 5.0 out of 5 stars, ships from Dallas, TX, UNITED STATES, published 2017 by O'Reilly Media.
Choose your shipping method in Checkout. Costs may vary based on destination.
Seller's Description:
Good. Connecting readers with great books since 1972! Used textbooks may not include companion materials such as access codes, etc. May have some wear or writing/highlighting. We ship orders daily and Customer Service is our top priority!
Add this copy of High Performance Spark: Best Practices for Scaling and to cart. £10.38, good condition, Sold by HPB-Red rated 5.0 out of 5 stars, ships from Dallas, TX, UNITED STATES, published 2017 by O'Reilly Media.
Choose your shipping method in Checkout. Costs may vary based on destination.
Seller's Description:
Good. Connecting readers with great books since 1972! Used textbooks may not include companion materials such as access codes, etc. May have some wear or writing/highlighting. We ship orders daily and Customer Service is our top priority!
Add this copy of High Performance Spark: Best Practices for Scaling and to cart. £13.64, fair condition, Sold by Goodwill of Silicon Valley rated 5.0 out of 5 stars, ships from San Jose, CA, UNITED STATES, published 2017 by O'Reilly Media.
Choose your shipping method in Checkout. Costs may vary based on destination.
Seller's Description:
Fair. Supports Goodwill of Silicon Valley job training programs. The cover and pages are in Acceptable condition! Any other included accessories are also in Acceptable condition showing use. Use can include some highlighting and writing page and cover creases as well as other types visible wear such as cover tears discoloration staining marks scuffs etc. All pages intact.
Add this copy of High Performance Spark: Best Practices for Scaling and to cart. £16.27, very good condition, Sold by BV Books LLC rated 5.0 out of 5 stars, ships from Parker, CO, UNITED STATES, published 2017 by O'Reilly Media.
Add this copy of High Performance Spark: Best Practices for Scaling and to cart. £17.71, good condition, Sold by SurplusTextSeller rated 5.0 out of 5 stars, ships from Columbia, MO, UNITED STATES, published 2017 by O'Reilly Media.
Choose your shipping method in Checkout. Costs may vary based on destination.
Seller's Description:
Good. Ships in a BOX from Central Missouri! May not include working access code. Will not include dust jacket. Has used sticker(s) and some writing or highlighting. UPS shipping for most packages, (Priority Mail for AK/HI/APO/PO Boxes).
Add this copy of High Performance Spark: Best Practices for Scaling and to cart. £17.90, like new condition, Sold by Bookwitch rated 5.0 out of 5 stars, ships from Concord, CA, UNITED STATES, published 2017 by O'Reilly Media, 2017.
Choose your shipping method in Checkout. Costs may vary based on destination.
Seller's Description:
As New. Book Small quarto, softcover, 12 n32 in white and red wraps. 342 pp. including index. Apache Spark is amazing when everything clicks. But if you haven't seen the performance improvements you expected, or still don't feel confident enough to use Spark in production, this practical book is for you. help your Spark queries run faster and handle larger data sizes, while using fewer resources. Ideal for software engineers, data engineers, developers, and system administrators.