8886

Posts

Jan, 17

The 20th IEEE International Conference on High Performance Computing, HiPC 2013

Authors are invited to submit original unpublished research manuscripts that demonstrate current research in all areas of high performance computing including design and analysis of parallel and distributed systems, embedded systems, and their applications in scientific, engineering, and commercial areas. Topics of interest include but are not limited to: * High-Performance Computing * Parallel and […]
Jan, 17

2013 IEEE High Performance Extreme Computing Conference, HPEC ’13

The HPEC charter is to be the premier conference in the world on the confluence of HPC and Embedded Computing. The High Performance Extreme Computing (HPEC) technical committee announces the Seventeenth Annual HPEC conference to be held 24–26 September 2013 at the Westin Hotel in Waltham, Massachusetts. The technical committee seeks new presentations that clearly […]
Jan, 17

Performance Analysis and Tuning For: General-Purpose Graphics Processing Units (GPGPU)

General-purpose graphics processing units (GPGPU) have emerged as an important class of shared memory parallel processing architectures, with widespread deployment in every computer class from high-end supercomputers to embedded mobile platforms. Relative to more traditional multicore systems of today, GPGPUs have distinctly higher degrees of hardware multithreading (hundreds of hardware thread contexts vs. tens), a […]
Jan, 17

Inter-Warp Instruction Temporal Locality in Deep-Multithreaded GPUs

GPUs employ thousands of threads per core to achieve high throughput. These threads exhibit localities in control-flow, instruction and data addresses and values. In this study we investigate inter-warp instruction temporal locality and show that during short intervals a significant share of fetched instructions are fetched unnecessarily. This observation provides several opportunities to enhance GPUs. […]
Jan, 17

A GPU Based Implementation of Side Effect Analysis

In this paper, we discuss a novel approach of improving the performance and accuracy of parallelization compilers by utilizing massively parallel processing power of GPGPU (General Purpose Graphical Processing Units). With the advent of multi core processors, it has become necessary to use parallel programming methodologies. However, parallel programmers need tools that can provide support […]
Jan, 17

Application of GPU Smooth Particle Hydrodynamics: Wave Runup and Overtopping on Composite Slopes

Smooth Particle Hydrodynamics is a Lagrangian meshless numerical method with substantially improved capabilities in simulation of both fluid dynamics and solid mechanics due to its meshless nature. GPUSPH is an implementation of Smoothed Particle Hydrodynamics (SPH) on Nvidia CUDA-enabled (graphics) cards. In this paper the GPUSPH is applied to runup and overtopping applications and compared […]
Jan, 17

Bouncing Behavior of Microscopic Dust Aggregates

CONTEXT: Bouncing collisions of dust aggregates within the protoplanetary may have a significant impact on the growth process of planetesimals. Yet, the conditions that result in bouncing are not very well understood. Existing simulations studying the bouncing behavior used aggregates with an artificial, very regular internal structure. Aims: Here, we study the bouncing behavior of […]
Jan, 16

Burrows-Wheeler Aligner: A Parallel Approach

The advent of mainframe computing brought about a fundamentally different way of approaching problems for many branches of science. But none has transformed quite like the science of biology. With genome sequencing now commonplace, an organism may be completely represented as a sequence of numbers. Harnessing the power of computers, sequences of genomes (and thus […]
Jan, 16

A parallel implementation of a derivative pricing model incorporating SABR calibration and probability lookup tables

We describe a high performance parallel implementation of a derivative pricing model, within which we introduce a new parallel method for the calibration of the industry standard SABR (stochastic-alpha beta rho) stochastic volatility model using three strike inputs. SABR calibration involves a non-linear three dimensional minimisation and parallelisation is achieved by incorporating several assumptions unique […]
Jan, 16

Efficient implementation of multiuser precoding algorithms on GPU for MIMO-OFDM systems

In this paper, we focus on the signal precoding stage in multiuser multicarrier systems, which can be often a computationally expensive task. In order to reduce their computational time, the implementation of some of the most employed multiuser precoding algorithms on a general purpose Graphic Processing Unit (GPU) is presented. These devices allow for a […]
Jan, 16

On the Use of Graphic Processing Units for the Efficient Implementation of MIMO Detectors

The use of Graphic Processing Units (GPU) for the efficient implementation of signal processing algorithms for MIMO communication systems is receiving incremental attention recently. This is mainly due to their high capability of parallel processing together with their reasonable cost. In this work, the interest of GPU for the rapid prototyping of MIMO receivers is […]
Jan, 16

Analytic Visibility on the GPU

This paper presents a parallel, implementation-friendly analytic visibility method for triangular meshes. Together with an analytic filter convolution, it allows for a fully analytic solution to anti-aliased 3D mesh rendering on parallel hardware. Building on recent works in computational geometry, we present a new edge-triangle intersection algorithm and a novel method to complete the boundaries […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: