TFLite exports weights in OHWI format which is its default layout. With its current implementation, the TFLite parser transposes the kernel so the layout becomes OIHW. This is presumably because TVM doesn’t fully support OHWI.
This becomes an issue for Arm Compute Library (and presumably other external codegen libraries) since it supports OHWI weights by default. One way to overcome this is to transpose the kernel back to OHWI format before passing them to ACL. This results in 2 different conversions: OHWI->OIHW->OHWI. However, now that external codegen has landed, I don’t believe it should be upto the parser to convert to a supported format specifically for TVM. Instead it should be taken care of lower down the stack, after the graph has been annotated and partitioned. Are there any thoughts on this?