Is operator fusing supported when using AutoTVM with GPU?
-
From the paper it seems that operator fusion is happening before AutoTVM (operator fusion is described in section 3, while AutoTVM in section 5).
-
There is answer from @eqy on a question regarding auto-tuning, from which I understand that for CUDA and OpenCL it is very difficult task. Hence I assume such fusing is not supported.
-
In AutoTVM source (e.g. https://github.com/dmlc/tvm/blob/master/topi/python/topi/cuda/conv2d.py#L119) decorators appear to be only for non-fused operations.
So how this actually is with this operator fusing?