Image for Scaling Machine Learning with Spark

Scaling Machine Learning with Spark : Distributed ML with MLlib, TensorFlow, and PyTorch

See all formats and editions

Learn how to build end-to-end scalable machine learning solutions with Apache Spark.

With this practical guide, author Adi Polak introduces data and ML practitioners to creative solutions that supersede today's traditional methods.

You'll learn a more holistic approach that takes you beyond specific requirements and organizational goals--allowing data and ML practitioners to collaborate and understand each other better. Scaling Machine Learning with Spark examines several technologies for building end-to-end distributed ML workflows based on the Apache Spark ecosystem with Spark MLlib, MLflow, TensorFlow, and PyTorch.

If you're a data scientist who works with machine learning, this book shows you when and why to use each technology. You will:Explore machine learning, including distributed computing concepts and terminologyManage the ML lifecycle with MLflowIngest data and perform basic preprocessing with SparkExplore feature engineering, and use Spark to extract featuresTrain a model with MLlib and build a pipeline to reproduce itBuild a data system to combine the power of Spark with deep learningGet a step-by-step example of working with distributed TensorFlowUse PyTorch to scale machine learning and its internal architecture

Read More
Available
£47.99 Save 25.00%
RRP £63.99
Add Line Customisation
2 in stock Need More ?
Add to List
Product Details
O'Reilly Media
1098106822 / 9781098106829
Paperback / softback
006.312
21/03/2023
United States
400 pages
178 x 233 mm