12718
Ziming Zhong
Over the past decade, the design of microprocessors has been shifting to a new model where the microprocessor has multiple homogeneous processing units, aka cores, as a result of heat dissipation and energy consumption issues. Meanwhile, the demand for heterogeneity increases in computing systems due to the need for high performance computing in recent years. […]
View View   Download Download (PDF)   
Volodymyr Kindratenko
This book brings together research on numerical methods adapted for Graphics Processing Units (GPUs). It explains recent efforts to adapt classic numerical methods, including solution of linear equations and FFT, for massively parallel GPU architectures. This volume consolidates recent research and adaptations, covering widely used methods that are at the core of many scientific and […]
View View   Download Download (PDF)   
A. Gorobets, F.X. Trias, R. Borrell, G. Oyarzun, A. Oliva
The purpose of the work is twofold. Firstly, it is devoted to the development of efficient parallel algorithms for large-scale simulations of turbulent flows on different supercomputer architectures. It reports experience with massively-parallel accelerators including graphics processing units of AMD and NVIDIA and Intel Xeon Phi coprocessors. Secondly, it introduces new series of direct numerical […]
View View   Download Download (PDF)   
Adam McLaughlin, David A. Bader
Graphs that model social networks, numerical simulations, and the structure of the Internet are enormous and cannot be manually inspected. A popular metric used to analyze these networks is betweenness centrality, which has applications in community detection, power grid contingency analysis, and the study of the human brain. However, these analyses come with a high […]
Alessandro Dal Pal'u, Agostino Dovier, Andrea Formisano, Enrico Pontelli
The parallel computing power offered by Graphical Processing Units (GPUs) has been recently exploited to support general purpose applications-by exploiting the availability of general API and the SIMT-style parallelism present in several classes of problems (e.g., numerical simulations, matrix manipulations) – where relatively simple computations need to be applied to all items in large sets […]
Matthias Bartelt, Michael Gross
This paper deals with a Galerkin-based multi-scale time integration of a viscoelastic rope model. Using Hamilton’s dynamical formulation, Newton’s equation of motion as a second-order partial differential equation is transformed into two coupled first order partial differential equations in time. The considered finite viscoelastic deformations are described by means of a deformation-like internal variable determined […]
View View   Download Download (PDF)   
Emmanuel Agullo, Berenger Bramas, Olivier Coulaud, Eric Darve, Matthias Messner, Toru Takahashi
High performance FMM is crucial for the numerical simulation of many physical problems. In a previous study, we have shown that task-based FMM provides the flexibility required to process a wide spectrum of particle distributions efficiently on multicore architectures. In this paper, we now show how such an approach can be extended to fully exploit […]
View View   Download Download (PDF)   
Xian Wang, Yanqin Shangguan, Hiromichi KOBAYASHI, Takayuki AOKI, Naoyuki Onodera
Direct numerical simulation (DNS) and large eddy simulation (LES) were performed on the wall-bounded flow at Re_tau = 180 using lattice Boltzmann method (LBM) and multiple Graphic Processing Units (GPUs). In the DNS, 8 K20M GPUs were adopted. The maximum number of meshes is 6.7×10^7, which results in the non-dimensional mesh size of Delta+=1.41 for […]
View View   Download Download (PDF)   
Luis Miguel de la Cruz, Daniel Monsivais
A two-phase (water and oil) flow model in a homogeneous porous media is studied, considering immiscible and incompressible displacement. This model is numerically solved using the Finite Volume Method (FVM) and we compare four numerical schemes for the approximation of fluxes on the faces of the discrete volumes. We describe briefly how to obtain the […]
View View   Download Download (PDF)   
Philippe Helluy, Thomas Strub
In this paper we present three-dimensional numerical simulations of electromagnetic waves. The Maxwell equations are solved by the Discontinuous Galerkin (DG) method. For achieving high performance, we exploit two levels of parallelism. The coarse grain parallelism is managed through MPI and a classical domain decomposition. The fine grain parallelism is managed with OpenCL in order […]
View View   Download Download (PDF)   
Michel Massaro, Philippe Helluy, Vincent Loechner
In this work we compute the MHD equations with divergence cleaning on GPU. The method is based on the finite volume approach and Strang dimensional splitting. The simplicity of the approach makes it a good candidate for a GPU implementation with OpenCL. With adequate memory optimization access, we achieve very high speedups, compared to a […]
View View   Download Download (PDF)   
Amani AlOnazi
The progress of high performance computing platforms is dramatic, and most of the simulations carried out on these platforms, result in improvements on one level, yet exposes shortcomings of the current CFD packages capabilities. Therefore, hardware-aware design and optimizations are crucial towards exploiting the modern computing resources. This thesis proposes optimizations aimed at acceleration numerical […]
View View   Download Download (PDF)   
Page 1 of 712345...Last »

* * *

* * *

Like us on Facebook

HGPU group

149 people like HGPU on Facebook

Follow us on Twitter

HGPU group

1239 peoples are following HGPU @twitter

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: AMD APP SDK 2.9
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.2
  • SDK: nVidia CUDA Toolkit 6.0.1, AMD APP SDK 2.9

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2014 hgpu.org

All rights belong to the respective authors

Contact us: