https://hgpu.org/?p=17392
Automatically Selecting Profitable Thread Block Sizes Using Machine Learning