Hi, @srkreddy1238 @tqchen @merrymercy @ZihengJiang I am deploying tvm compiled model for Cuda and OpenCL. In this what flags i need to set in c++ like.
int dtype_code = kDLFloat; //what i need to set for Cuda? int dtype_bits = 32; //what i need to set for Cuda? int dtype_lanes = 1; //what i need to set for Cuda? int device_type = kDLGPU; // for CUDA int device_id = 0; //what i need to set for Cuda? int in_ndim = 4; (1, 64, 64, 3)
t dtype_code = kDLFloat; //what i need to set for OpenCL?
t dtype_bits = 32; //what i need to set for OpenCL?
t dtype_lanes = 1; //what i need to set for OpenCL?
int device_type = kDLOpenCL; // for OpenCL
t device_id = 0; //what i need to set for OpenCL?
int in_ndim = 4; (1, 64, 64, 3)
Used below DLContext for llvm it’s working fine.
int dtype_code = kDLFloat;
int dtype_bits = 32;
int dtype_lanes = 1;
int device_type = kDLCPU;
int device_id = 0;
int in_ndim = 4;