Testing Distributed Data Parallele with Pytest #7995

tchaton · 2020-11-04T08:57:49Z

tchaton
Nov 4, 2020

Dear people from Pytest,

We are currently investigating testing our Distributed Data Parallel implementation in Pytorch Lightning (possibility with Pytest).

DPP launches one process per gpu and run deep learning training on multi-node multi-gpus.
Here is the line where we launch process using subprocess.Popen(..)
https://github.com/PyTorchLightning/pytorch-lightning/blob/master/pytorch_lightning/accelerators/ddp_accelerator.py#L127.

We wondered if Pytest Team would have knowledge about the best way to approach this problem.

Best regards,
Thomas Chaton
Research Engineer at Pytorch Lightning.

RonnyPfannschmidt · 2020-11-04T10:26:07Z

RonnyPfannschmidt
Nov 4, 2020
Maintainer

its not even remotely clear whats actually being asked, its next to impossible to give a reasonable answer to such a seemingly complex topic without some more context for comprehension

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Testing Distributed Data Parallele with Pytest #7995

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Testing Distributed Data Parallele with Pytest #7995

tchaton Nov 4, 2020

Replies: 1 comment

RonnyPfannschmidt Nov 4, 2020 Maintainer

tchaton
Nov 4, 2020

RonnyPfannschmidt
Nov 4, 2020
Maintainer