You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am profiling a ML workload using torch profiler. The code appears as:
with profile(activities=[
ProfilerActivity.CPU,
ProfilerActivity.CUDA],
record_shapes=True
) as prof:
main_args = parse_main_args()
main(main_args, DETECTED_SYSTEM)
prof.export_chrome_trace("torch_trace.json")
# print(prof.key_averages().table(sort_by="self_cpu_time_total", row_limit=20))
# print(prof.key_averages().table(sort_by="self_cuda_time_total", row_limit=20))
The code runs fine without the profiler. The code also runs fine to finish with the torch profiler. However when the profiler reaches the export statement, I get the following error:
I am profiling a ML workload using torch profiler. The code appears as:
The code runs fine without the profiler. The code also runs fine to finish with the torch profiler. However when the profiler reaches the export statement, I get the following error:
How to resolve this error? The machine is DGX H200x8, Ubuntu 22.04.4 LTS (Jammy Jellyfish)
The text was updated successfully, but these errors were encountered: