https://hgpu.org/?p=18781
Improving GPU Performance through Instruction Redistribution and Diversification