I am working on a Nvidia Drive PX2 with CUDA 9.0 and Tensorflow 1.9.0.
To compile Tensorflow models I execute the from_tensorflow.py script. (see https://docs.tvm.ai/tutorials/nnvm/from_tensorflow.html)
Target settings:
target = ‘cuda’
target_host = ‘llvm -target=aarch64-linux-gnu’
layout = “NCHW”
ctx = tvm.gpu(0)
I get a few warnings like this:
WARNING:autotvm:Cannot find config for target=cuda, workload=(‘conv2d’, (1, 448, 10, 10, ‘float32’), (384, 448, 3, 3, ‘float32’), (1, 1), (0, 0), (1, 1), ‘NCHW’, ‘float32’). A fallback configuration is used, which may bring great performance regression.
I am not sure how they affect the performance.
The top 5 results for TVM and Tensorflow are output. The results look good but the running time is slow.
As I want to do some benchmarking I measured the time for each TVM and Tensorflow when classifying an image.
Compared to Tensorflow the running time for TVM is about four times higher.
How can that be? Is it because of the warnings I get?