Learn PySpark: Build Python-based Machine Learning and…

$64.99

Price: $64.99
(as of Jul 26, 2024 23:04:57 UTC – Details)



Leverage machine learning and deep learning models to build applications with real-time data using PySpark. This book is perfect for those who want to learn how to use this language to perform exploratory data analysis and solve a variety of business challenges.
You'll start by reviewing PySpark basics, such as Spark's core architecture, and see how to use PySpark for big data processing, such as data ingestion, cleansing, and transformation techniques. This will be followed by workflows for analyzing streaming data using PySpark and a comparison of various streaming platforms.
You will then see how to schedule different Spark jobs using Airflow with PySpark and examine how to tune deep learning and machine learning models to make real-time predictions. This book concludes with a discussion of graph frameworks and performing network analysis using graph algorithms in PySpark. All of the code presented in the book will be available in Python scripts on Github.
What you will learnDevelop pipelines for real-time data processing using PySpark
Build Machine Learning and Deep Learning Models with PySpark's Latest Offerings
Use graph analysis with PySpark
Creating sequence embeddings from text data
Who is this book for?

Data scientists, machine learning engineers, and deep learning engineers who want to learn and use PySpark for real-time analysis of streaming data.

ASIN: B07XMHZ4W7
Publisher ‏ : ‎ Apress; 1st edition (6 September 2019)
Publication date: September 6, 2019
Language ‏ : ‎ English
File size: 26497 KB
Text to speech: enabled
Screen Reader ‏ : ‎ Compatible
Enhanced Typesetting ‏ : ‎ Enabled
X-rays: Not enabled
Word Wise ‏ : ‎ Not enabled
Sticky Notes ‏ : ‎ On Kindle Scribe
Print length: 230 pages
Source of page numbers ISBN ‏ : ‎ 1484249607

Reviews

There are no reviews yet.

Be the first to review “Learn PySpark: Build Python-based Machine Learning and…”

Your email address will not be published. Required fields are marked *