Input data for TVM C++ API

baileyqbb · May 10, 2018, 4:03pm

Hi, there. I am trying to implement the cpp example in the docs/how_to/deploy.md, by running the compiled modules using TVM c++ API, and facing the error regarding the input data.

In the example, it loads the binary input data by

DLTensor* x;
int in_ndim = 4;
int64_t in_shape[4] = {1, 3, 224, 224};
TVMArrayAlloc(in_shape, in_ndim, dtype_code, dtype_bits, dtype_lanes, device_type, device_id, &x);
// load image data saved in binary
std::ifstream data_fin("cat.bin", std::ios::binary);
data_fin.read(static_cast<char*>(x->data), 3 * 224 * 224 * 4);

In my case, I won’t use binary data, but the Mat data from opencv instead (I don’t find the “cat.bin” from the project either). Here is the way I do:

cv::Mat img = cv::imread("test.jpg");
cv::Mat resized_img;
cv::resize(img, resized_img, cv::Size(224,224));
memcpy(static_cast<unsigned char*>(x->data), resized_img.data, 3*224*224);

Since the resized_img.data is in uchar* format, x->data was converted to uchar* as well.

But it gives ‘Segmentation fault (core dumped)’ error while running the memcpy command.

Here are my two questions:

For the cat.bin data, why the data length is 3*224*224*4? More specific, what does the fourth dimension ‘4’ represent here? (generally, 3 (channels) * 224 (width) * 224 (height) * 1 (sizeof(uchar/char)), right?)
What’s the correct way to set the x->data if using the image data from Mat format?

Thanks!

alex-weaver · May 12, 2018, 3:15pm

Regarding (1) It looks like the * 4 is because dtype_code is kDLFloat, so each element of the tensor has size 4, and the use of uchar* is just so that sizes can be calculated in units of bytes.

As for setting the x->data element correctly, take a look at the DLTensor struct in DLPack:

github.com

dmlc/dlpack/blob/master/include/dlpack/dlpack.h#L94


  * \brief Number of bits, common choices are 8, 16, 32.
  */
 uint8_t bits;
 /*! \brief Number of lanes in the type, used for vector types. */
 uint16_t lanes;
} DLDataType;


/*!
* \brief Plain C Tensor object, does not manage memory.
*/
typedef struct {
 /*!
  * \brief The opaque data pointer points to the allocated data.
  *  This will be CUDA device pointer or cl_mem handle in OpenCL.
  *  This pointer is always aligns to 256 bytes as in CUDA.
  */
 void* data;
 /*! \brief The device context of the tensor */
 DLContext ctx;
 /*! \brief Number of dimensions */
 int ndim;

The key thing here is that the shape and strides arrays match the data you have. You will need to find some way of unpacking the Mat format into memory (a cursory google suggests a few c++ libs are readily available), and then either construct a DLTensor to correctly map to your in-memory data, or copy your in-memory data into a compact buffer and build a DLTensor for that.

Side note - the header docs for DLTensor say that if strides is NULL, this indicates the tensor is compact. I think this means that the last dimension is innermost, but I can’t actually see this documented anywhere in DLPack. It might be good to figure out what ordering ‘compact’ means and get it added to the docs.

Hope that helps

baileyqbb · May 15, 2018, 1:17am

Really thanks for your reply! I will try it and update the post later.

emelife · November 6, 2018, 4:31am

how do you solve this problem?

wk738126046 · December 21, 2018, 6:06am

Hi, did you solve it ???
I got a error result.
Output was :
The maximum position in output vector is: o

Aeroxander · December 10, 2019, 11:51pm

Would also love to know how you loaded the image data

7oud · December 21, 2019, 2:38pm

how to releae the created graph… is there api ?