You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This repo contains several examples to run BytePS, including popular CV/NLP models implemented in TensorFlow/PyTorch/MXNet. You can use them to reproduce the end-to-end evaluation results in the OSDI'20 paper of BytePS.
A run.sh is provided for each example to demonstrate the basic usage. It does not set up the necessary enviroment variables needed by BytePS. Therefore, before you run these examples, we recommend you read this tutorial for detailed guidelines on launching BytePS. Usually it will be easier to get started on the simple example in that tutorial, and then try more complicated examples in this repo.
Read the best-practice doc for improving the training performance after you have launched BytePS successfully.
We also provide options to switch to Horovod for performance comparison. To do so, you should change bpslaunch to MPI-based command (see Horovod tutorial), and then change the value of related environment variable (usually DISTRIBUTED_FRAMEWORK) or python trailing argument as noted in each run.sh.
Acknowledgement
Some of our examples rely on public repos, including: