You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It would be nice if we can specify this through the YAML. Perhaps this can be helpful if the storage latency is high and we'd want more prefetch to happen.
Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
We tried increasing num_workers, which can help and reduce latency.
However, increasing num_workers also increase CPU Memory usage.
Additional context
NeMo 24.07
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem? Please describe.
As you know, PyTorch data loader has option for prefetch factor: https://pytorch.org/docs/stable/data.html
This is in addition to num_workers. The default is 2.
Describe the solution you'd like
It would be nice if we can specify this through the YAML. Perhaps this can be helpful if the storage latency is high and we'd want more prefetch to happen.
Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
We tried increasing num_workers, which can help and reduce latency.
However, increasing num_workers also increase CPU Memory usage.
Additional context
NeMo 24.07
The text was updated successfully, but these errors were encountered: