py-dask

v 2024.4.1 Updated: 7 months, 1 week ago

Minimal task scheduling abstraction.

Dask provides multi-core execution on larger-than-memory datasets using blocked algorithms and task scheduling. It maps high-level NumPy, Pandas, and list operations on large datasets on to many operations on small in-memory datasets. It then executes these graphs in parallel on a single machine. Dask lets us use traditional NumPy, Pandas, and list programming while operating on inconveniently large data in a small amount of space.

https://github.com/dask/dask/

Installable ports:


Add to my watchlist

Installations 1
Requested Installations 1