[codegen_cuda] How to write a unittest for my PR?

pzq · August 21, 2019, 4:36pm

I have fixed an error when compiled with float16 model in cuda.
Related question in TVM Discuss can be found here.
Can someone check my pull request?
#3811

Here is the error detail.

/tmp/tmpz_0pydlm/my_kernel.cu(9890): error: more than one instance of overloaded function "max" matches the argument list:
            function "max(int, int)"
            function "max(unsigned int, unsigned int)"
            function "max(int, unsigned int)"
            function "max(unsigned int, int)"
            function "max(long, long)"
            function "max(unsigned long, unsigned long)"
            function "max(long, unsigned long)"
            function "max(unsigned long, long)"
            function "max(long long, long long)"
            function "max(unsigned long long, unsigned long long)"
            function "max(long long, unsigned long long)"
            function "max(unsigned long long, long long)"
            function "max(float, float)"
            argument types are: (half, __half)

pzq · August 21, 2019, 4:40pm

I have submitted a pull request #3811.
@ cchung100m told me need to add the unittest in tests/python/unittest/test_codegen_cuda.py.
Do I need implement the unittest myself?

nicklhy · August 22, 2019, 1:36am

Great work, man!
But I have another question here. The max/min functions are defined for almost any other types (i.e. int, long, float, …) in cuda. Why they are not defined for half type?

pzq · August 22, 2019, 1:47am

Because half is not a basic dtype in cpu?
Actually they have defined many other functions for half, like gt lt and etc…

pzq · August 22, 2019, 2:59am

I have add a unit test for my PR here #3811
Can anybody go for a check?
This unit test was passed on all of my machines.
If the PR is not committed on you TVM, you will get an error when running this unittest, otherwise will success.