Making prediction from encoder and decoder of T5 model without using generate method

Asked Feb 09 '22 at 11:52

Active Feb 09 '22 at 11:52

Viewed 493 times

I was working on the optimization of the T5 model I separated the model into encoder and decoder and converted them to ONNX using Nvidia TensorRT repo https://github.com/NVIDIA/TensorRT/tree/main/demo/HuggingFace but I am unable to make an inference. The model, I used is a QA model based on T5 and its prediction is done using generate method. Hence is there any way by which we can generate using T5 without using generate method?.

asked Feb 09 '22 at 11:52

Vikas Kumar Ojha

Making prediction from encoder and decoder of T5 model without using generate method

0 Answers0