Matrix size less than VTA block size

WLeeeee · August 31, 2019, 7:54pm

I was wondering how VTA solves the problem when matrix size is less than VTA tiling size.

By default, the input tiling size is 1 x 16. However, if I want to do a computation with

input matrix dimension = 5 x 5, how does VTA lower such computation in python.

By using padding?

I tried it in vta_get_started.py but I got the following error:

scope local.acc_buffer need to have block=16, shape=[5, 5]

Thanks

thierry · September 1, 2019, 1:07am

Yes, right now that’s a restriction of VTA. That being said, we may be able to add a relay pass that reshapes operators to be of shapes that are multiples of 16, and would automatically zero-pad. It might not be the most efficient thing to do for small shapes, but would ensure that we provide adequate support for all shapes.

WLeeeee · September 1, 2019, 11:21am

Thanks for your answer

zhanghaohit · August 27, 2020, 5:59am

Hi @thierry, is the feature supported already?