12553
Brian Hamilton, Stefan Bilbao, Craig J. Webb
Viscothermal effects in air lead to a damping of high frequencies over time. Such effects cannot be neglected in large-scale room acoustics simulations for the full audible bandwidth. In this study, full-bandwidth room acoustics is modelled using a variant of the three-dimensional wave equation including viscothermal losses in air following from a simplification of the […]
View View   Download Download (PDF)   
Jie Liu, Chunye Gong, Weimin Bao, Guojian Tang, Yuewen Jiang
We present a parallel GPU solution of the Caputo fractional reaction-diffusion equation in one spatial dimension with explicit finite difference approximation. The parallel solution, which is implemented with CUDA programming model, consists of three procedures: preprocessing, parallel solver, and postprocessing. The parallel solver involves the parallel tridiagonal matrix vector multiplication, vector-vector addition, and constant vector […]
View View   Download Download (PDF)   
Qi Xu, Gang-Lin Yu, Kan Wang, Jia-Long Sun
In this paper, the adaptability of the neutron diffusion numerical algorithm on GPUs was studied, and a GPUaccelerated multi-group 3D neutron diffusion code based on finite difference method was developed. The IAEA 3D PWR benchmark problem was calculated in the numerical test. The results demonstrate both high efficiency and adequate accuracy of the GPU implementation […]
View View   Download Download (PDF)   
Claas Abert, Gregor Wautischer, Florian Bruckner, Armin Satz, Dieter Suess
We implement an efficient energy-minimization algorithm for finite-difference micromagnetics that proofs especially usefull for the computation of hysteresis loops. Compared to results obtained by time integration of the Landau-Lifshitz-Gilbert equation, a speedup of up to two orders of magnitude is gained. The method is implemented in a finite-difference code running on CPUs as well as […]
Brian Vyhnalek
Ultra-wideband (UWB) wireless systems have recently gained considerable attention as effective communications platforms with the properties of low power and high data rates. Applications of UWB such as wireless USB put size constraints on the antenna, however, which can be very difficult to meet using typical narrow band antenna designs. The aim of this thesis […]
View View   Download Download (PDF)   
Jacob Lundgren, Nils Olofsson
The inherent issues of properly deploying finite difference calculations onto GPUs are described and solutions are suggested. A speedup of 60x is achieved over the CPU version. Four visualization methods were implemented using OpenGL and compared in terms of the clarity of their visual result. A combination of hedgehogs and slices was deemed to give […]
View View   Download Download (PDF)   
David S Medina, Amik St-Cyr, T. Warburton
The inability to predict lasting languages and architectures led us to develop OCCA, a C++ library focused on host-device interaction. Using run-time compilation and macro expansions, the result is a novel single kernel language that expands to multiple threading languages. Currently, OCCA supports device kernel expansions for the OpenMP, OpenCL, and CUDA platforms. Computational results […]
View View   Download Download (PDF)   
Alexander Grebhahn, Norbert Siegmund, Sven Apel, Sebastian Kuckuk, Christian Schmitt, Harald Kostler
A standard technique to numerically solve elliptic partial differential equations on structured grids is to discretize them via finite differences and then to apply an efficient geometric multi-grid solver. Unfortunately, finding the optimal choice of multi-grid components and parameters is challenging and platform dependent, especially, in cases where domain knowledge is incomplete. Auto-tuning is a […]
Dmitri Priimak
We present finite differences numerical algorithm for solving 2D spatially homogeneous Boltzmann transport equation for semiconductor superlattices (SL) subject to time dependant electric field along SL axis and constant perpendicular magnetic field. Algorithm is implemented in C language targeted to CPU and in CUDA C language targeted to commodity NVidia GPUs. We compare performance and […]
View View   Download Download (PDF)   
Robert Georg Ilgner
The parallel FDTD method as used in computational electromagnetics is implemented on a variety of different high performance computing platforms. These parallel FDTD implementations have regularly been compared in terms of performance or purchase cost, but very little systematic consideration has been given to how much effort has been used to create the parallel FDTD […]
View View   Download Download (PDF)   
Freddie D Witherden, Antony M Farrington, Peter E Vincent
High-order numerical methods for unstructured grids combine the superior accuracy of high-order spectral or finite difference methods with the geometric flexibility of low-order finite volume or finite element schemes. The Flux Reconstruction (FR) approach unifies various high-order schemes for unstructured grids within a single framework. Additionally, the FR approach exhibits a significant degree of element […]
View View   Download Download (PDF)   
Nicholas J. Stewart, David W. Holmes, Wenxian Lin, Steven W. Armfield, Michael P. Kirkpatrick
Sheared convective boundary layers (SCBL) are a frequently observed boundary layer in nature and industry. This paper presents work conducted to validate a numerical fluid model of sheared convective boundary layers implemented in Nvidia’s CUDA programming language for graphical processing units. The code is based on finite difference implementation of the SIMPLE algorithm using the […]
View View   Download Download (PDF)   
Page 1 of 812345...Last »

* * *

* * *

Like us on Facebook

HGPU group

136 people like HGPU on Facebook

Follow us on Twitter

HGPU group

1204 peoples are following HGPU @twitter

Featured events

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: AMD APP SDK 2.9
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.2
  • SDK: nVidia CUDA Toolkit 6.0.1, AMD APP SDK 2.9

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2014 hgpu.org

All rights belong to the respective authors

Contact us: