Hi:
I am trying to use quantization to accelerate the inference of a Tensorflow graph.
Just using autotvm without quantization works fine, but when relay.quantize.quantize
is used, a segmentation fault occurs.
And when I tried to extracte certain sub_graph of the Tensorflow graph to quantize, it may works or may fail. There exists a node, sub_graph before it can be quantized fine, and when the node is included in the sub_graph, the quantization fails.
So, any suggestion to find out what is wrong and make it work?