You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Tensorpack DataFlow is an efficient and flexible data
loading pipeline for deep learning, written in pure Python.
Its main features are:
Highly-optimized for speed.
Parallelization in Python is hard and most libraries do it wrong.
DataFlow implements highly-optimized
parallel building blocks which gives you an easy interface to parallelize your workload.
Written in pure Python.
This allows it to be used together with any other Python-based library.
DataFlow is originally part of the tensorpack library
and has been through many years of polishing.
Given its independence of the rest of the tensorpack library,
it is now a separate library whose source code is synced with tensorpack.
Please use tensorpack issues for support.
Why would you want to use DataFlow instead of a platform-specific data loading solutions?
We recommend you to read
Why DataFlow?.
Install:
pip install --upgrade git+https://github.com/tensorpack/dataflow.git
# or add `--user` to install to user's local directories
You may also need to install opencv, which is used by many builtin DataFlows.