I’m trying to enable AutoTVM for OpenCL (intel_graphics
target). So far I managed to have some success in that area, but values are multiple times worst than for generic scheduler.
To begin with I am focusing only on conv2d operation (since this is also only one currently present in intel_graphics
TOPI). I’ve used conv2d_direct.py
file from CUDA to use it as a dummy test file (this scheduler seemed to be the easiest) and get some idea what is required to write my own one. There are few things I don’t understand and I’d appreciate guidance on how such scheduler should be written/what values should I provide. Two most pressing questions for now are:
-
From where you have all of the splitting and other numerical values in
schedule_direct_cuda
? -
How you decided about
tvm.thread_axis
threads?