PyTorch versions #338
terryfrankcombe
started this conversation in
General
Replies: 1 comment 5 replies
-
Hi Terry, This is a consequence of some very strange upstream bugs in PyTorch we've been affected by: #311. The linked issue has the details; basically training/inference will get progressively slower and slower over time. With 1.12 I had also noticed some other irregularities due to the new nvFuser JIT backend. I would be careful with 1.12, but if it is working for you that is good to know. If you have (or have not) experienced the slowdowns in the linked issue, would you mind posting the details in that thread? We're still trying to understand when and why that issue occurs so we can hopefully work around it. Thanks! |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I just noticed that the installation instructions state that we need pytorch <= 1.11.*.
I've been running apparently happily with 1.12.0. Are there known problems with this?
(For additional context, I'm trying to get things running on A100, having used V100 up to now. But my pytorch doesn't like sm_80. Installing 1.12.0+cu113 gives a tool that consistently hangs after the second batch of epoch 0. Should I be looking for something in 1.11? Conversely, might pytorch 2.0 work?)
Ciao
Terry
Beta Was this translation helpful? Give feedback.
All reactions