add transformer support?(matmul, layernorm) #216

bigprince97 · 2019-12-26T07:49:25Z

No description provided.

bigprince97 · 2019-12-26T07:53:56Z

Hello, this is very convenient repo to convert pytorch model to tensorrt, but it likely support cv well, support nlp poolly,I try to convert gpt2 model to tensorRT, but there are two operations that transformers(nlp very popular model, it is the base block to build gpt2, bert) needs don't support, include matmul, layernorm, c++ api support it, consider support it in python api? I am very expected to deploy gpt2 in tensorRT.

bigprince97 · 2020-01-03T06:23:37Z

I convert gpt2 model to tensorRT successfully use this repo and some operation I write by myself, and the speed has been increased three times, thanks this good repo.

mowayao · 2020-01-19T07:16:12Z

I convert gpt2 model to tensorRT successfully use this repo and some operation I write by myself, and the speed has been increased three times, thanks this good repo.

Can you share you impl of matmul, plz?

czs1886 · 2020-06-10T03:20:12Z

I convert gpt2 model to tensorRT successfully use this repo and some operation I write by myself, and the speed has been increased three times, thanks this good repo.

Can you share you impl of matmul, plz?

Just use addMatrixMultiply() which is defined here
You need to create a new converter in converters directory. You can copy mul.py and change its addElementwise to addMatrixMultiply.

q248953144 · 2020-10-15T11:28:23Z

I convert gpt2 model to tensorRT successfully use this repo and some operation I write by myself, and the speed has been increased three times, thanks this good repo.

can you share your ways

Fan9 · 2021-04-07T10:05:17Z

@bigprince97 can you shart your code? thanks!

bigprince97 · 2021-04-17T09:57:59Z

Sorry to see the reply now， I write code last year, it can't run because of the update of the repo. fastertransformer supports gpt2 acceleration，It has pytorch interface， you can try.

francescotaioli · 2022-02-05T10:26:39Z

for anyone looking for the matmul, here's the code

@tensorrt_converter('torch.Tensor.__matmul__')
def convert_mul(ctx):
    input_a = ctx.method_args[0]
    input_b = ctx.method_args[1]
    output = ctx.method_return
    input_a_trt, input_b_trt = add_missing_trt_tensors(ctx.network, [input_a, input_b])
    input_a_trt, input_b_trt = broadcast_trt_tensors(ctx.network, [input_a_trt, input_b_trt], len(output.shape) - 1)
    
    #layer = ctx.network.add_elementwise(input_a_trt, input_b_trt, trt.ElementWiseOperation.PROD)
    layer = ctx.network.add_matrix_multiply(input_a_trt, trt.MatrixOperation.NONE, input_b_trt, trt.MatrixOperation.NONE)
    output._trt = layer.get_output(0)

Documentation for python api: here and here

bigprince97 changed the title ~~Hello, this is very convenient repo to convert pytorch model to tensorrt, but it likely support cv well, support nlp poolly,I try to convert gpt2 model to tensorRT, but there a~~ add transformer support?(matmul, layernorm) Dec 26, 2019

czs1886 mentioned this issue Jun 10, 2020

matmul #334

Open

sttobia mentioned this issue Mar 2, 2021

Converters for LayerNorm and MatMul #516

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add transformer support?(matmul, layernorm) #216

add transformer support?(matmul, layernorm) #216

bigprince97 commented Dec 26, 2019

bigprince97 commented Dec 26, 2019

bigprince97 commented Jan 3, 2020

mowayao commented Jan 19, 2020

czs1886 commented Jun 10, 2020

q248953144 commented Oct 15, 2020

Fan9 commented Apr 7, 2021

bigprince97 commented Apr 17, 2021

francescotaioli commented Feb 5, 2022

add transformer support?(matmul, layernorm) #216

add transformer support?(matmul, layernorm) #216

Comments

bigprince97 commented Dec 26, 2019

bigprince97 commented Dec 26, 2019

bigprince97 commented Jan 3, 2020

mowayao commented Jan 19, 2020

czs1886 commented Jun 10, 2020

q248953144 commented Oct 15, 2020

Fan9 commented Apr 7, 2021

bigprince97 commented Apr 17, 2021

francescotaioli commented Feb 5, 2022