12876
Andrea Arteaga, Daniel Ruprecht, Rolf Krause
In view of the rapid rise of the number of cores in modern supercomputers, time-parallel methods that introduce concurrency along the temporal axis are becoming increasingly popular. For the solution of time-dependent partial differential equations, these methods can add another direction for concurrency on top of spatial parallelization. The paper presents an implementation of the […]
View View   Download Download (PDF)   
Havard Heitlo Holm
As parallel and heterogeneous computing becomes more and more a necessity for implementing high performance simulators, it becomes increasingly harder for scientists and engineers without experience in high performance computing to achieve good performance. Even for those who knows how to write efficient code the process for doing so is time consuming and error prone, […]
Alexander Ayriyan, Jan Busa Jr., Eugeny E. Donets, Hovik Grigorian, Jan Pribis
A model of a multilayer device with non-trivial geometrical and material structure and its working process is suggested. The thermal behavior of the device as one principle characteristic is simulated. The algorithm for solving the non-stationary heat conduction problem with a time-dependent periodical heating source is suggested. The algorithm is based on finite difference explicit–implicit […]
View View   Download Download (PDF)   
Moritz Schmid, Oliver Reiche, Christian Schmitt, Frank Hannig, Jurgen Teich
Multiresolution Analysis (MRA) is a mathematical method that is based on working on a problem at different scales. One of its applications is medical imaging where processing at multiple scales, based on the concept of Gaussian and Laplacian image pyramids, is a well-known technique. It is often applied to reduce noise while preserving image detail […]
View View   Download Download (PDF)   
Istvan Zoltan Reguly
The last decade saw the long tradition of frequency scaling of processing units grind to a halt, and efforts were re-focused on maintaining computational growth by other means; such as increased parallelism, deep memory hierarchies and complex execution logic. After a long period of "boring productivity", a host of new architectures, accelerators, programming languages and […]
View View   Download Download (PDF)   
Fabio Luporini, Ana Lucia Varbanescu, Florian Rathgeber, Gheorghe-Teodor Bercea, J. Ramanujam, David A. Ham, Paul H.J. Kelly
The numerical solution of partial differential equations using the finite element method is one of the key applications of high performance computing. Local assembly is its characteristic operation. This entails the execution of a problem-specific kernel to numerically evaluate an integral for each element in the discretized problem domain. Since the domain size can be […]
View View   Download Download (PDF)   
Harald Koestler, Christian Schmitt, Sebastian Kuckuk, Frank Hannig, Juergen Teich, Ulrich Ruede
Many problems in computational science and engineering involve partial differential equations and thus require the numerical solution of large, sparse (non)linear systems of equations. Multigrid is known to be one of the most efficient methods for this purpose. However, the concrete multigrid algorithm and its implementation highly depend on the underlying problem and hardware. Therefore, […]
View View   Download Download (PDF)   
P. Yang
This tutorial is written for beginners as an introduction to basic wave propagation using nite di erence method, from acoustic and elastic wave modeling, to reverse time migration and full waveform inversion. Most of the theoretical delineations summarized in this tutorial have been implemented in Madagascar with Matlab, C and CUDA programming, which will bene t readers’ […]
View View   Download Download (PDF)   
Steffen Muthing, Dirk Ribbrock, Dominik Goddeke
A major challenge in PDE software is the balance between user-level flexibility and performance on heterogeneous hardware. We discuss our ideas on how this challenge can be tackled, exemplarily for the DUNE framework and in particular its linear algebra and solver components. We demonstrate how the former MPI-only implementation is modified to support MPI+[CPU/GPU] threading […]
View View   Download Download (PDF)   
Matthias Bartelt, Michael Gross
This paper deals with a Galerkin-based multi-scale time integration of a viscoelastic rope model. Using Hamilton’s dynamical formulation, Newton’s equation of motion as a second-order partial differential equation is transformed into two coupled first order partial differential equations in time. The considered finite viscoelastic deformations are described by means of a deformation-like internal variable determined […]
View View   Download Download (PDF)   
Zheng Xia
Scientific computation is the field of study that uses computers to implement mathematical models of physical phenomena such as FEM in deformation measurement in virtual reality. Scientific and engineering problems that would be almost impossible to solve by hand whereas on a computer, it can be handled properly. A numerical algorithm calculating for different fields […]
View View   Download Download (PDF)   
R. Pethiyagoda, S. W. McCue, T. J. Moroney, J. M. Back
The nonlinear problem of steady free-surface flow past a submerged source is considered as a case study for three-dimensional ship wave problems. Of particular interest is the distinctive wedge-shaped wave pattern that forms on the surface of the fluid. By reformulating the governing equations with a standard boundary-integral method, we derive a system of nonlinear […]
View View   Download Download (PDF)   
Page 1 of 1012345...10...Last »

* * *

* * *

Like us on Facebook

HGPU group

172 people like HGPU on Facebook

Follow us on Twitter

HGPU group

1283 peoples are following HGPU @twitter

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: AMD APP SDK 2.9
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.2
  • SDK: nVidia CUDA Toolkit 6.0.1, AMD APP SDK 2.9

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2014 hgpu.org

All rights belong to the respective authors

Contact us: