Clarification on when kernels should allocate new tensors, try to forward input tensors or straight up reuse them

slai-nick · November 22, 2022, 1:17pm

Hi,

I would like some clarification on how/when/why tensors should be allocated when implementing plugin kernels.

The API provided has:

TF_NewTensor
TF_AllocateTensor
TF_AllocateOutput
TF_SetOutput
TF_ForwardInputOrAllocateOutput

It seems that TF_ForwardInputOrAllocateOutput is the way to handle all needs, however I’ve seen it constantly allocating instead of forwarding in cases where it was trivially “forwardable” (input → Reshape → Dense: the Reshape can forward instead of allocating).

What are the general guidelines for kernel implementations? What is possible/not possible/etc?
Should all kernels allocate a new output tensor? Is it allowed for kernels to reuse the input tensor by just using TF_SetOutput on it? How does TF_ForwardInputOrAllocateOutput decides when to forward and when to allocate? Why don’t I see it forwarding the input on a simple example like the one I mentioned?

Topic		Replies	Views
Should I call TF_AllocateOutput or TF_SetOutput and reuse input? General Discussion pluggable_device , help_request	1	426	August 6, 2024
Input Forwarding not happening General Discussion pluggable_device , help_request	0	598	September 8, 2022
Input Forwarding and Datatype General Discussion pluggable_device	0	576	September 8, 2022
Intermediate results kept for longer than needed on device General Discussion pluggable_device , tflite_micro , help_request	0	457	November 10, 2022
How to use TF_BitcastFrom to get another "view" on a tensor? General Discussion pluggable_device	2	442	November 22, 2022

Clarification on when kernels should allocate new tensors, try to forward input tensors or straight up reuse them

Related topics