Is TVM applicable for training?

qiuxiafei · June 24, 2019, 3:17am

Hi, guys!

I have a newbie question as to the title after reading the TVM paper.

In the paper, I see TVM do both graph-level and operator-level optimizations. These methods are applicable for training. But some other important topics in training are missing such as operator placement, leveraging heterogeneous devices, etc. I also searched for some applications of TVM, and I found most of them are for inference. Also, the experiment part of TVM paper concentrates in inference performance of server-side and embedded devices.

So, I wanna know if TVM is for inference just by design. Or it is also targeting the training part, but not so widely used for some reasons.

Thanks !!

sebap · June 25, 2019, 8:19am

I remember reading number of discussions regarding this topic (e.g. Dose TVM support training models？). In general what I understand is that TVM is only for inference.
Although there are some posts here or on GitHub (cannot find it now), that suggests possibility of training, but this is extremely difficult due to missing backprop operations for some nodes etc.

LanTn · June 25, 2019, 8:21am

Tvm is only available for inference for the time being.

csuhawk · August 13, 2019, 8:13am

Facebook Team is already doing this job to support TVM in Pytorch Traing Acceleration, refer to:

As i know, Mxnet is also support TVM on Training backend.

pgplus1628 · August 28, 2019, 2:08am

It seems the example https://github.com/pytorch/tvm/blob/master/test/benchmarks.py provided in pytorch/tvm project is just for inference, not training.

broune · September 5, 2019, 5:16am

Does there exist a list of things that are needed before training can be supported well by TVM? If Facebook is working on it, is there a design doc somewhere? I spent a brief time looking around for it but didn’t find it so far.

cheimu · September 13, 2021, 11:22am

Hi, I‘m very curious about this repo, but it seems like this repo was abandoned. Could I know what happened? Thank you!