12961
Moritz Kreutzer, Georg Hager, Gerhard Wellein, Andreas Pieper, Andreas Alvermann, Holger Fehske
The Kernel Polynomial Method (KPM) is a well-established scheme in quantum physics and quantum chemistry to determine the eigenvalue density and spectral properties of large sparse matrices. In this work we demonstrate the high optimization potential and feasibility of peta-scale heterogeneous CPU-GPU implementations of the KPM. At the node level we show that it is […]
View View   Download Download (PDF)   
Blesson Varghese, Andrew Rau-Chaplin
Aggregate Risk Analysis is a computationally intensive and a data intensive problem, thereby making the application of high-performance computing techniques interesting. In this paper, the design and implementation of a parallel Aggregate Risk Analysis algorithm on multi-core CPU and many-core GPU platforms are explored. The efficient computation of key risk measures, including Probable Maximum Loss […]
View View   Download Download (PDF)   
Aman Bahl, Oliver Baltzer, Andrew Rau-Chaplin, Blesson Varghese
At the heart of the analytical pipeline of a modern quantitative insurance/reinsurance company is a stochastic simulation technique for portfolio risk analysis and pricing process referred to as Aggregate Analysis. Support for the computation of risk measures including Probable Maximum Loss (PML) and the Tail Value at Risk (TVAR) for a variety of types of […]
View View   Download Download (PDF)   
George Zaki
A variety of multiprocessor architectures has proliferated even for off-the-shelf computing platforms. To make use of these platforms, traditional implementation frameworks focus on implementing Digital Signal Processing (DSP) applications using special platform features to achieve high performance. However, due to the fast evolution of the underlying architectures, solution redevelopment is error prone and re-usability of […]
View View   Download Download (PDF)   
Jamroz Michal, Kolinski Andrzej
BACKGROUND: The development, optimization and validation of protein modeling methods require efficient tools for structural comparison. Frequently, a large number of models need to be compared with the target native structure. The main reason for the development of Clusco software was to create a high-throughput tool for all-versus-all comparison, because calculating similarity matrix is the […]
Teruyoshi Washizawa, Yasuhiro Nakahara
We investigate applicability of GPU to DEM. NVIDIA’s code obtained superior performance than CPU in computational time. A model of contact forces in NVIDIA’s code is too simple for practical use. We modify this model by replacing it with the practical model. The simulation shows that the practical model obtains the computing speed 6 times […]
View View   Download Download (PDF)   
Jose M. Dominguez, Alejandro J.C. Crespo, Moncho Gomez-Gesteira
Much of the current focus in high performance computing (HPC) for computational fluid dynamics (CFD) deals with grid based methods. However, parallel implementations for new meshfree particle methods such as Smoothed Particle Hydrodynamics (SPH) are less studied. In this work, we present optimizations for both central processing unit (CPU) and graphics processing unit (GPU) of […]
View View   Download Download (PDF)   
J.C. Chedjou, K. Kyamakya, U.A. Khan, M.A. Latif
One of the most common approaches to avoid complexity while numerically solving stiff ordinary differential equations (ODEs) is approximating them by ignoring the nonlinear terms. While facing stiff partial differential equations (PDEs) the same is done by avoiding/suppressing the nonlinear terms from the Taylor’s series expansion. By so doing, the traditional methods for solving stiff […]
View View   Download Download (PDF)   
Jean-Charles Tournier, Vaibhav Donde, Zhao Li
This paper investigates the potential of General Purpose Graphic Processing Unit (GPGPU) for the serve rand HMI parts of Energy Management System (EMS). TheHMI investigation focuses on the applicability and performance improvement of GPGPU for scattered data interpolation algorithms typically used to visually represent the overall state of a power network. The server side investigation […]
K. Morimoto, M. Inui
Large molds with very deep shape are well used in producing bumpers and inner panels of automobiles. In order to realize the precise and stable machining of such deep molds, 3-axis milling with inclined cutters are often applied. In this paper, we propose a new algorithm for determining the optimal cutting direction in such inclined […]
M. Kinsner, D. Capson, A. Spence
Imaging systems for computer vision play an important role in today’s world. Typical computer vision systems operate on large scale scenes, where objects are relatively far from the camera and the depth of field in which objects appear focussed is large. Close-range camera systems, on the other hand, typically have a narrow depth of field. […]
T. Gotoh, J. Amagai, T. Hobiger, M. Fujieda, M. Aida
We have developed a new two-way time transfer modem, which adopts a dual pseudo-random noise (DPN) code. The usage of DPN signal dramatically improves time transfer precision without increasing connection fees of commercial communication satellites. In addition, the new modem is based on software-defined radio technology. As the only drawback of such a software implementation […]
Page 1 of 3123

* * *

* * *

Like us on Facebook

HGPU group

169 people like HGPU on Facebook

Follow us on Twitter

HGPU group

1276 peoples are following HGPU @twitter

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: AMD APP SDK 2.9
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.2
  • SDK: nVidia CUDA Toolkit 6.0.1, AMD APP SDK 2.9

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2014 hgpu.org

All rights belong to the respective authors

Contact us: