13857
Yutong Qin, Jianbiao Lin, Xiang Huang
Ray tracing is a technique for generating an image by tracing the path of light through pixels in an image plane and simulating the effects of high-quality global illumination at a heavy computational cost. Because of the high computation complexity, it can’t reach the requirement of real-time rendering. The emergence of many-core architectures, makes it […]
View View   Download Download (PDF)   
Joshua Penton
Deployment of parallel architectures in computing systems is increasing. In this paper we study the performance effects of a variety of programming techniques and technologies that utilize these parallel architectures as applied to example algorithms. We demonstrate that algorithms, which are highly parallel in nature, gain significant performance increases through proper application of both parallel […]
Sachitsing Dwarkan
Medical image registration is a computational task involving the spatial realignment of multiple sets of images of the same or different modalities. A novel method of using the Open Computing Language (OpenCL) framework to accelerate affine image registration across multiple processing architectures is presented. The use of this method on graphics processors results in a […]
View View   Download Download (PDF)   
Jonathan Thompson, Kristofer Schlachter
This paper presents an overview of the OpenCL 1.1 standard [Khronos 2012]. We first motivate the need for GPGPU computing and then discuss the various concepts and technological background necessary to understand the programming model. We use concurrent matrix multiplication as a framework for explaining various performance characteristics of compiling and running OpenCL code, and […]
View View   Download Download (PDF)   
Derek K. Gerstmann, Toby Potter, Michael Houston, Paul Bourke, Kwan-Liu Ma, Andreas Wicenec
Simulating the expansion of a Type II supernova using an adaptive computational fluid dynamics (CFD) engine yields a complex mixture of turbulent flow with dozens of physical properties. The dataset shown in this sketch was initially simulated on iVEC’s EPIC supercomputer (a 9600 core Linux cluster) using FLASH [Fryxell et al. 2000] to model the […]
View View   Download Download (PDF)   
James Sweet
Due to the high demand for secure Internet usage, an improvement of the SSL performance is needed. This paper describes a technique to improve the performance of SSL by creating a CPU/GPU hybrid proxy to sit in front of a web server to only handle the SSL overheads. This will allow the utilization of high […]
View View   Download Download (PDF)   

* * *

* * *

Follow us on Twitter

HGPU group

1753 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

372 people like HGPU on Facebook

HGPU group © 2010-2016 hgpu.org

All rights belong to the respective authors

Contact us: