https://hgpu.org/?p=25626
An Experimental Study of SYCL Task Graph Parallelism for Large-Scale Machine Learning Workloads