You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Dask is a parallel and distributed computing library that scales the existing Python and PyData ecosystem. Dask can scale up to your full laptop capacity and out to a cloud cluster.
Prepare
1. You should clone this repository
git clone https://github.com/dask/dask-tutorial
and then install necessary packages.
There are three different ways to achieve this, pick the one that best suits you, and only pick one option.
They are, in order of preference:
Note that these options will alter your existing environment, potentially changing the versions of packages you already
have installed.
2c) Use Dockerfile
You can build a docker image from the provided Dockerfile.
$ docker build . # This will build using the same env as in a)
Run a container, replacing the ID with the output of the previous command
$ docker run -it -p 8888:8888 -p 8787:8787 <container_id_or_tag>
The above command will give an URL (Like https://(container_id or 127.0.0.1):8888/?token=<sometoken>) which
can be used to access the notebook from browser. You may need to replace the given hostname with "localhost" or
"127.0.0.1".
You should follow only one of the options above!
Launch Jupyter
From the repo directory
jupyter lab
This was already done for method c) and does not need repeating.
You are welcome to use Jupyter notebook if you prefer, but we'll be using lab in the live tutorial.