Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

In jetson nano, inference time of fp16 and fp_32 is almost same? #470

Open
xxy90 opened this issue Dec 14, 2020 · 4 comments
Open

In jetson nano, inference time of fp16 and fp_32 is almost same? #470

xxy90 opened this issue Dec 14, 2020 · 4 comments

Comments

@xxy90
Copy link

xxy90 commented Dec 14, 2020

No description provided.

@jaybdub
Copy link
Contributor

jaybdub commented Dec 16, 2020

Hi xxy90,

Thanks for reaching out!

Do you mind sharing which model architecture you're referring to? The relative performance of FP32 vs. FP16 may depend on model architecture. I think the scaling also might not be linear with bit depth, because of various overhead when using reduced precision.

Best,
John

@zhangchenwei115
Copy link

Hi xxy90,

Thanks for reaching out!

Do you mind sharing which model architecture you're referring to? The relative performance of FP32 vs. FP16 may depend on model architecture. I think the scaling also might not be linear with bit depth, because of various overhead when using reduced precision.

Best,
John

Hi John,
I had the same problem which on Jetson Nano, weather in converting,fp16=True or False. the speed is the same. I used the model of lightweight openpose,here is the link, https://github.com/Daniil-Osokin/lightweight-human-pose-estimation.pytorch

@xxy90
Copy link
Author

xxy90 commented Jan 25, 2021

The model architecture is new anchor-free object-detection Nanodet, whose backbone is shuffleNetv2

@shadowuyl
Copy link

shadowuyl commented Aug 13, 2021

I also meet the same problem while using YOLOX-Nano(ref https://github.com/Megvii-BaseDetection/YOLOX) on jetson nano.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants