I’d like to ask if there is any plan or any simple way to enable block (NCHWc) data format for whole model instead of plain (NCHW)?
After some playing with TVM it seems that the preferred data type is plain. Significant exceptions are for Intel CPU and Intel Graphics scheduler which are capable of converting from plain to block and back, but it’s not optimal as overall approach.
It appears that Intel HW prefers block data format and it would be profitable for performance to remove those conversions all together. The idea was to add on the model reading stage decision if model should be converted to block or plain, but it requires a lot of other changes down the compilation pipeline.
Could anyone comment here what’s the plan or solution?