https://hgpu.org/?p=3534
A Parallel Algorithm for Dot Product over Word-Size Finite Field Using Floating-Point Arithmetic