https://www.nvidia.com/en-us/on-demand/session/gtcspring21-s31429/
Would you like to make your CUDA code run faster on any GPU? Are you tired of having to manually search for the best performing sizes for your thread block
tuning made easycode optimizationcudaautogtc