I tuned the GPU TVM model of resnet-100 for target Cuda 1080TI(Pascal architecture). I got enough speedup! Good works guys.
But I have a question about deploying and running TVM modules on different NVIDIA GPU architectures. Because when I tried to run this module on Tesla K80(Kepler architecture). I got the following error: " CUDAError: Check failed: ret == 0 (-1 vs. 0) : cuModuleLoadData(&(module_[device_id]), data_.c_str()) failed with error: CUDA_ERROR_INVALID_PTX"
How I understand I need to build a new TVM module for other architecture(Tesla K80). Is it the right way? Or maybe there is a way for building a TVM module for different GPU architectures in one time.
Thank you in advance.