accuracy drops a lot in fp16 mode #879

itachi1232gg · 2023-08-22T05:22:29Z

my model's accuracy drops a lot when I convert it into fp16 mode, even a pretrained resnet34 experienced accuracy drop in fp16 mode.

import os  
os.environ['CUDA_MODULE_LOADING'] = 'LAZY'    
from torchvision.models.resnet import resnet34, resnet18    
data = torch.ones((1,3,224,224)).cuda()    
model = resnet18(pretrained=True)    
model.eval()     
model.cuda()     
model_trt = torch2trt(model, [data], fp16_mode=True)     
with torch.no_grad():     
    print((model(data)-model_trt(data)).abs().sum())

tensor(5.3151, device='cuda:0')

if I set fp16_mode=False
then the output is

tensor(0.0017, device='cuda:0')

The text was updated successfully, but these errors were encountered:

Lu-tju · 2023-10-08T10:06:06Z

Hi, the result in my computer is 0.6877 with your code when set fp16_mode=False (and 5.3 when fp16_mode=True). I don't know is this error normal?

itachi1232gg · 2023-10-08T10:25:35Z

Hi, the result in my computer is 0.6877 with your code when set fp16_mode=False (and 5.3 when fp16_mode=True). I don't know is this error normal?

0.6877 is likely to give you the wrong outputs.

JWLee89 · 2023-11-15T05:47:38Z

Instead of testing the absolute sum of differences between the two models, I believe an element-wise difference check to see if all the element-wise differences do not exceed a certain threshold might be a more accurate measure.

For example (did not test the code), we can check whether all elements inside of source and target tensor are within a certain absolute threshold.

output_pt = model(data)
output_trt = model_trt(data)

# Set this value to something that seems appropriate to you
# 1e-5 is generally reasonable
import numpy as np
absolute_tolerance = 1e-5
np.allclose(output_pt, output_trt, atol=absolute_tolerance)

itachi1232gg · 2024-04-20T15:09:56Z

Instead of testing the absolute sum of differences between the two models, I believe an element-wise difference check to see if all the element-wise differences do not exceed a certain threshold might be a more accurate measure.

For example (did not test the code), we can check whether all elements inside of source and target tensor are within a certain absolute threshold.
output_pt = model(data)
output_trt = model_trt(data)

# Set this value to something that seems appropriate to you
# 1e-5 is generally reasonable
import numpy as np
absolute_tolerance = 1e-5
np.allclose(output_pt, output_trt, atol=absolute_tolerance)

np.allclose(output_pt, output_trt, atol=absolute_tolerance) returns False

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

accuracy drops a lot in fp16 mode #879

accuracy drops a lot in fp16 mode #879

itachi1232gg commented Aug 22, 2023 •

edited

Loading

Lu-tju commented Oct 8, 2023

itachi1232gg commented Oct 8, 2023

JWLee89 commented Nov 15, 2023

itachi1232gg commented Apr 20, 2024

accuracy drops a lot in fp16 mode #879

accuracy drops a lot in fp16 mode #879

Comments

itachi1232gg commented Aug 22, 2023 • edited Loading

Lu-tju commented Oct 8, 2023

itachi1232gg commented Oct 8, 2023

JWLee89 commented Nov 15, 2023

itachi1232gg commented Apr 20, 2024

itachi1232gg commented Aug 22, 2023 •

edited

Loading