Hello, I’m trying to use TVM and Relay to do some graph level optimization exploration, e.g. given a compute graph as an input, search the graph and manipulate nodes (split, fuse, etc) based on some rules or cost model and output a new graph that might have better performance.
Should I get started by writing a new pass for Relay? Or there’re some existing works that I can leverage? I see codes like combine_parallel_conv2d, fuse_ops under src/relay/pass. Can anyone give me a pointer here?