converting .pth to .trt engine, do inference in C++, input and output names not matched #405

cam401 · 2020-09-07T16:04:55Z

Firstly, thanks for this project that is of high quality.

I did inference in Python API from .pth model to .trt. The speed-up is very impressive.

However, I also need to do the same inference in C++ API because of the pre-process and post-process in Python are not very ideal.

I converted the .pth file to .trt engine file, which was loaded (parserd) by C++ API successfully (I suppose).

However, when doing inference, the code gives out an error of " can find binding of given name" that was defined as input and output.

I suppose the input and output names have to be specified based on the computational graph as well (In python, it is not necessary).

Now, I wonder how I can find out the names for input and output nodes. (for other model formats, netron can be used to visualise and check input and output names).

Thanks and I look forward to the support.

jaybdub · 2020-09-08T21:07:40Z

Hi cam401,

Thanks for reaching out!

By default, the input/output names are given as input_0, input_1 and output_0, output_1, output_2, etc. So for a single input, single output model these would be input_0 and output_0.

Just to check, you can use the TensorRT Python api

# model_trt from converting model with torch2trt
engine = model_trt.engine

for idx in range(engine.num_bindings):
    is_input = engine.binding_is_input(idx)
    name = engine.get_binding_name(idx)
    print(idx, is_input, name)

Please let me know if this helps or you run into issues.

Best,
John

cam401 · 2020-09-09T20:29:13Z

Hi John,

Thanks for your prompt response.

That helped me solve the problem and I now can do inference in C++ based on .trt engine file.

_

**_> However, I got the following warnings:

[TRT] Parameter check failed at: engine.cpp::executeV2::811, condition: !mEngine.hasImplicitBatchDimension()

and also all inference results are zeros, which are obviously not correct. I will look into this further. At leas the code can run through._**

This has been addressed.

However, the inference speed via C++ API seems to be much slower than via Python API (~5 times slower, batchsize=1).
_

Best,

c

maiminh1996 · 2021-06-13T16:50:06Z

@cam401 Can you share your inference C++ code?

Zhangppppp · 2022-04-14T02:22:22Z

@maiminh1996 Can you share your inference C++ code?Thanks.

sunshinesjw1 · 2022-05-30T08:20:44Z

@cam401 Hello, for this question: "the inference speed via C++ API seems to be much slower than via Python API (~5 times slower, batchsize=1)", have you solved? Can you give me some advices for this situation?Thanks, and wait for your rely.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

converting .pth to .trt engine, do inference in C++, input and output names not matched #405

converting .pth to .trt engine, do inference in C++, input and output names not matched #405

cam401 commented Sep 7, 2020

jaybdub commented Sep 8, 2020

cam401 commented Sep 9, 2020 •

edited

Loading

maiminh1996 commented Jun 13, 2021

Zhangppppp commented Apr 14, 2022

sunshinesjw1 commented May 30, 2022

converting .pth to .trt engine, do inference in C++, input and output names not matched #405

converting .pth to .trt engine, do inference in C++, input and output names not matched #405

Comments

cam401 commented Sep 7, 2020

jaybdub commented Sep 8, 2020

cam401 commented Sep 9, 2020 • edited Loading

maiminh1996 commented Jun 13, 2021

Zhangppppp commented Apr 14, 2022

sunshinesjw1 commented May 30, 2022

cam401 commented Sep 9, 2020 •

edited

Loading