https://hgpu.org/?p=18623
A Fast and Simple Approach to Merge and Merge Sort using Wide Vector Instructions