Scalable data analysis in Python with Dask /

Saved in:
Bibliographic Details
Author / Creator:Kashif, Mohammed, speaker.
Imprint:[Place of publication not identified] : Packt, [2019]
Description:1 online resource (1 streaming video file (3 hr., 41 min., 52 sec.))
Language:English
Subject:
Format: E-Resource Video Streaming Video
URL for this record:http://pi.lib.uchicago.edu/1001/cat/bib/13678496
Hidden Bibliographic Details
Digital file characteristics:video file
Notes:Title from title screen (viewed January 31, 2020).
Date of publication from resource description page.
Presenter, Mohammed Kashif.
In English.
Summary:"In this course, you'll learn to scale your data analysis. Firstly, you will execute distributed data science projects right from data ingestion to data manipulation and visualization using Dask. Then, you will explore the Dask framework. After, see how Dask can be used with other common Python tools such as NumPy, pandas, Matplotlib, scikit-learn, and more. You'll be working on large datasets and performing exploratory data analysis to investigate the dataset, then come up with the findings from the dataset. You'll learn by implementing data analysis principles using different statistical techniques in one go across different systems on the same massive datasets. Throughout the course, we'll go over the various techniques, modules, and features that Dask has to offer. Finally, you'll learn to use its unique offering for Machine Learning, using the Dask-ML package. You'll also start using parallel processing in your data tasks on your own system without moving to the distributed environment."--Resource description page