IIUC, this means, that only models, which have a unique layer in the architecture can be efficiently run with VTA? This unique layer should also be one of the last layers, to offload as much as possible on the FPGA.
So if I have a model with the structure¹
nn.conv2d nn.leaky_relu nn.max_pool2d ... nn.conv2d nn.leaky_relu nn.max_pool2d nn.conv2d
I won’t be able to use
graph_pack on this model, since I can’t specify a (good)
stop_name for this network. Which then means, that I can’t offload this model to VTA.
If that’s the case this restricts the usability of VTA to only pretty specific models. Is there already a plan how to address this challenge?
cc @thierry, since you are the author of this code.
¹ This is the (simplified) structure of TinyYOLO from Tensornets