Output node CUDA Kernel repeatedly excuted

“tvm/src/runtime/cuda/cuda_module.cc” class CUDAWrappedFunc{ }, I found my output wrapped func was invoked twice in one excution, what could be the reason please?