Tensorflow hub package for TFLite 2.1 quantized models

anijain2305 · June 23, 2020, 5:16pm

I am adding support for TFLite 2.1 quantized models. Unfortunately, there are no hosted models that we can download.

We can use the TFLite API to quantize the model, but it needs tensorflow hub package to pull the original FP32 models. The code is present in this PR - https://github.com/apache/incubator-tvm/pull/5848

I was wondering if it is ok to add that dependency. If thats not the right way, I can put the quantized models in the TVM data repo. Please let me know.

cc @ramana-arm

anijain2305 · June 24, 2020, 9:12am

Just a gentle ping to get a decision on this @tqchen

anijain2305 · June 27, 2020, 12:56am

Gentle ping … @tqchen

tqchen · June 27, 2020, 1:53am

let us put the model into data folder. to reduce the complexity, we can also consider to trim the model a bit

anijain2305 · June 27, 2020, 3:03am

Just to confirm. By data folder, do you mean this repo - https://github.com/dmlc/web-data/tree/master/tensorflow?

Yes, I will only submit the tflite files. For now, I test 5 models. We can cut to 3 for decent coverage - resnet, inception_v1 and mobilenet_v2. Following are the sizes

26M Jun 21 18:49 /tmp/resnet_50_full_integer_except_io.tflite
4.4M Jun 22 02:38 /tmp/mobilenet_v1_full_integer_except_io.tflite
3.9M Jun 22 03:13 /tmp/mobilenet_v2_full_integer_except_io.tflite
6.6M Jun 22 03:33 /tmp/inception_v1_full_integer_except_io.tflite
24M Jun 22 07:03 /tmp/inception_v3_full_integer_except_io.tflite

tqchen · June 27, 2020, 5:31am

Yes that works, as long as we have a good persistent location to download it is fine

anijain2305 · June 28, 2020, 1:12am

@tqchen Added the PR - https://github.com/dmlc/web-data/pull/257