-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CUDA out of memory] training in 1024 × 576 resolution in the A100 80G #39
Comments
@CallMeFrozenBanana I meet the same problem, have you solved it |
@pixeli99 1GPU utilizing all 40GB VRAM, eval-off, 512x256 small-res unet training test was successful however same error here, High resolution appears to be an issue. For Default frame resolution for |
Ive had better luck after disabling validation via
The validation logic seems to blow up the VRAM requirements heavily. After disabling validation, I can run I just realized this is eerily similar to the training script at MooreThreads/Moore-AnimateAnyone/ |
any update? |
Having the same problem. Is it just too much to try train on A100 80GB with the full 1024x576 resolution? I set the gradient checkpointing on, and enable_xformers_memory_efficient_attention also. |
Hi, Thanks for any suggestions.
The largest resolution that could be used for training is 512 × 512 with ~76G memory cost.
I set the enable_xformers_memory_efficient_attention to True but nothing changed at all.
The text was updated successfully, but these errors were encountered: