Inferring bounds for env_threads

Currently, when using the .env_threads() scheduling primitive to make a scan op persistent, it seems that one needs to provide bounds for the thread axis passed as arguments. This does not seem necessary to me, at least for the case of GPUs. I was wondering if this restriction can be lifted, and if not, when can it not be?