Replies: 1 comment
-
its not even remotely clear whats actually being asked, its next to impossible to give a reasonable answer to such a seemingly complex topic without some more context for comprehension |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Dear people from Pytest,
We are currently investigating testing our Distributed Data Parallel implementation in Pytorch Lightning (possibility with Pytest).
DPP launches one process per gpu and run deep learning training on multi-node multi-gpus.
Here is the line where we launch process using
subprocess.Popen(..)
https://github.com/PyTorchLightning/pytorch-lightning/blob/master/pytorch_lightning/accelerators/ddp_accelerator.py#L127.
We wondered if Pytest Team would have knowledge about the best way to approach this problem.
Best regards,
Thomas Chaton
Research Engineer at Pytorch Lightning.
Beta Was this translation helpful? Give feedback.
All reactions