How does TVM implement X86 SIMD instructions


TVM as a compiler optimized, if the target is X86, on X86 for accelerated optimization, is it necessary to use SIMD instruction? If so, how does it work? If not, how does it work。


SIMD instructions such as as AVX-2/AVX-512 extensions can be targeted using schedule primitives such as vectorize. The GEMM CPU tutorial is a good example:


ok,thank you very much;
I want to find out how is vectorize implemented,
When I look for the code definition of vectorize,found it using the founction of _api_internal.StageVectorize ,but When I open file, there are only comments in it


it’s registered in cpp side
Vectorization is implemented in