As the trends of process scaling make memory system even more crucial bottleneck, the importance of latency hiding techniques such as prefetching grows further. However, naively using prefetching can harm performance and energy efficiency and hence, several factors and parameters need to be taken into account to fully realize its potential. In this paper, we survey several recent techniques that aim to improve implementation and effectiveness of prefetching. We characterize the techniques on several parameters to highlight their similarities and differences. The aim of this survey is to provide insights to researchers into working of prefetching techniques and spark interesting future work for improving the performance advantages of prefetching even further.
March 29, 2016 by hgpu
March 29, 2016 by hgpu
March 29, 2016 by hgpu
March 29, 2016 by hgpu
March 25, 2016 by hgpu
March 25, 2016 by hgpu
March 25, 2016 by hgpu
An Efficient Implementation of the Longest Common Subsequence Algorithm with Bit-Parallelism on GPUs
March 25, 2016 by hgpu
March 25, 2016 by hgpu
March 22, 2016 by hgpu