https://hgpu.org/?p=895
Importance of Explicit Vectorization for CPU and GPU Software Performance