https://hgpu.org/?p=12709
An Investigation of Unified Memory Access Performance in CUDA