Holger Pirk
In this thesis, we study the management of relational data in modern, i.e., asymmetric computer systems. We explore different strategies to identify asymmetries in persistent data, map them to asymmetries in the memory landscape and, eventually, exploit them to increase query processing performance. To this end, we study memory conscious decomposition and storage of data […]
View View   Download Download (PDF)   
Karl Rupp, Philippe Tillet, Florian Rudolf, Josef Weinbub, Tibor Grasser, Ansgar Jungel
The performance portability of OpenCL kernel implementations for common memory bandwidth limited linear algebra operations across different hardware generations of the same vendor as well as across vendors is studied. Certain combinations of kernel implementations and work sizes are found to exhibit good performance across compute kernels, hardware generations, and, to a lesser degree, vendors. […]
View View   Download Download (PDF)   
D.William Albert, Dr.K.Fayaz, D.Veerabhadra Babu
Frequent pattern mining is one of the widely used data mining techniques for discovering trends or patterns from databases. As data is growing in exponential pace, data mining activities need more powerful computing. Fortunately modern GPUs (Graphics Processing Units) have specialized electronic circuits and support parallel processing. GPUs are capable of processing huge amount of […]
View View   Download Download (PDF)   
Philippe Helluy, Jonathan Jung
In this work we propose an efficient finite volume approximation of two-fluid flows. Our scheme is based on three ingredients. We first construct a conservative scheme that removes the pressure oscillations phenomenon at the interface. The construction relies on a random sampling at the interface [6, 5]. Secondly, we replace the exact Riemann solver by […]
View View   Download Download (PDF)   
Slo-Li Chu, Chih-Chieh Hsiao
Graphics processing units (GPUs) in a computer system are increasingly adopted to create a realistic environment in 3D applications. Despite the extremely high parallelism of these devices with a tremendous amount of processing elements, GPUs are seldom used in scientific applications owing to their difficulty in programming. Additionally, vendor-provided APIs are too specific to cross […]
View View   Download Download (PDF)   
Philippe Helluy, Jonathan Jung
In this paper, we propose a new very simple numerical method for solving liquid-gas compressible flows. Such flows are difficult to simulate because classic conservative finite volume schemes generate pressure oscillations at the liquid-gas interface. We extend to several dimensions the random choice scheme that we have constructed in [2]. The extension is performed through […]
View View   Download Download (PDF)   
Slo-Li Chu, Chih-Chieh Hsiao
Heterogeneous multicore architectures with CPU and add-on GPUs or streaming processors are now widely used in computer systems. These GPUs provide substantially more computation capability and memory bandwidth compared to traditional multi-cores. Also, because they are highly programmable, they provide the computational performance needed for realistic graphics rendering. Applications with general computations can also be […]
View View   Download Download (PDF)   
Rahul Garg, Laurie Hendren
High-performance computing systems today include a variety of compute devices such as multi-core CPUs, GPUs and many-core accelerators. OpenCL allows programming different types of compute devices using a single API and kernel language. However, there is no standard matrix operations library in OpenCL for operations such as matrix multiplication that works well on a variety […]
View View   Download Download (PDF)   
Arthur Densmore
Reflector antennas play a key role in the communication industry, and enhancing the speed of the analysis of reflector antenna systems can provide better responsiveness to the needs of industry as well as promote better understanding of software modeling through faster visualization. A reflector antenna system typically consists of a feed assembly, with a feedhorn […]
View View   Download Download (PDF)   
Cole Kendrick
The main goal of this project is to develop a computer program to model the creation of structure in Saturn’s ring system. The computer program will be used to answer these questions: (1) How are gaps in Saturn’s Rings formed; (2) how accurately can I model gap formation with a 3D N-Body simulation; and (3) […]
Hakan Johansson
Volume rendering is quite an old concept of representing images, dating back to the 1980’s. It is very useful in the medical field for visualizing the results of a computer tomography (CT) and magnet resonance tomography (MRT) in 3D. Apart from these two major applications for volume rendering, there aren’t many other fields of usage […]
View View   Download Download (PDF)   
Anais Crestetto, Philippe Helluy, Jonathan Jung
We present several numerical simulations of conservation laws on recent multicore processors, such as GPU’s, using the OpenCL programming framework. Depending on the chosen numerical method, different implementation strategies have to be considered, for achieving the best performance. We explain how to program efficiently three methods: a finite volume approach on a structured grid, a […]
View View   Download Download (PDF)   
Page 1 of 3123

* * *

* * *

Follow us on Twitter

HGPU group

1658 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

335 people like HGPU on Facebook

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.3
  • SDK: AMD APP SDK 3.0

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: