I am working on enabling auto-tuning for softmax based on the discussion here, but am finding it difficult to know exactly what code to modify. From some digging, it seems that I have to modify OP2TOPI
in extract_from_program
in relay_integration.py
, and a handful of places in topi_integration.py
.
Further, I know that I have to modify the decorators on top of the existing softmax compute and schedules, as well as define a “workload”, but am still looking for exactly where that is.
Is there a tutorial available for this? It seems like a common enough scenario to warrant one. Or, would someone be able to point me in the right direction for implementing this? After successfully going through the process, I would be happy to write the tutorial.