Posts
Nov, 7
Parallel hyperbolic PDE simulation on clusters: Cell versus GPU
Increasingly, high-performance computing is looking towards data-parallel computational devices to enhance computational performance. Two technologies that have received significant attention are IBM’s Cell Processor and NVIDIA’s CUDA programming model for graphics processing unit (GPU) computing. In this paper we investigate the acceleration of parallel hyperbolic partial differential equation simulation on structured grids with explicit time […]
Nov, 7
PIConGPU: A Fully Relativistic Particle-in-Cell Code for a GPU Cluster
The particle-in-cell (PIC) algorithm is one of the most widely used algorithms in computational plasma physics. With the advent of graphical processing units (GPUs), large-scale plasma simulations on inexpensive GPU clusters are in reach. We present an implementation of a fully relativistic plasma PIC algorithm for GPUs based on the NVIDIA CUDA library. It supports […]
Nov, 7
Parallel implementation of a spatio-temporal visual saliency model
The human vision has been studied deeply in the past years, and several different models have been proposed to simulate it on computer. Some of these models concerns visual saliency which is potentially very interesting in a lot of applications like robotics, image analysis, compression, video indexing. Unfortunately they are compute intensive with tight real-time […]
Nov, 7
Real-time multi-agent path planning on arbitrary surfaces
Path planning is an active topic in the literature, and efficient navigation over non-planar surfaces is an open research question. In this work we present a novel technique for navigation of multiple agents over arbitrary triangular domains. The proposed solution uses a fast hierarchical computation of geodesic distances over triangular meshes to allow interactive frame […]
Nov, 7
Real-time path-based surface detail
We present a GPU algorithm to render path-based 3D surface detail in real-time. Our method models these features using a vector representation that is efficiently stored in two textures. First texture is used to specify the position of the features, while the second texture contains their paths, profiles and material information. A fragment shader is […]
Nov, 7
Performance Analysis of General-Purpose Computation on Commodity Graphics Hardware: A Case Study Using Bioinformatics
Using modern graphics processing units for no-graphics high performance computing is motivated by their enhanced programmability, attractive cost/performance ratio and incredible growth in speed. Although the pipeline of a modern graphics processing unit (GPU) permits high throughput and more concurrency, they bring more complexities in analyzing the performance of GPU-based applications. In this paper, we […]
Nov, 7
Implementation of a High Throughput Soft MIMO Detector on GPU
Multiple-input multiple-output (MIMO) significantly increases the throughput of a communication system by employing multiple antennas at the transmitter and the receiver. To extract maximum performance from a MIMO system, a computationally intensive search based detector is needed. To meet the challenge of MIMO detection, typical suboptimal MIMO detectors are ASIC or FPGA designs. We aim […]
Nov, 7
GPU Acceleration for General Conservation Equations and its Application to several Engineering Problems
Presented is a general method for conservation equations called SHLL (split HLL) applied using Graphics Processing Unit (GPU) acceleration. The SHLL method is a purely vector-split approximation of the classical HLL method [Harten, Lax and van Leer, 1983] which assumes the presence of local wave propagation in the algabraic derivation of fluxes across cell surfaces. […]
Nov, 7
GPU-based physical cut in interactive haptic simulations
PURPOSE: Interactive, physics based, simulations of deformable bodies are a growing research area with possible applications to computer-aided surgery. Their aim is to create virtual environments where surgeons are free to practice. To ensure the needed realism, the simulations must be performed with deformable bodies. The goal of this paper is to describe the approach […]
Nov, 7
GPU-based Acceleration of System-level Design Tasks
Many system-level design tasks (e.g., high-level timing analysis, hardware/software partitioning and design space exploration) involve computational kernels that are intractable (usually NP-hard). As a result, they involve high running times even for mid-sized problems. In this paper we explore the possibility of using commodity graphics processing units (GPUs) to accelerate such tasks that commonly arise […]
Nov, 7
Evolution of thread-level parallelism in desktop applications
As the effective limits of frequency and instruction level parallelism have been reached, the strategy of microprocessor vendors has changed to increase the number of processing cores on a single chip each generation. The implicit expectation is that software developers will write their applications with concurrency in mind to take advantage of this sudden change […]
Nov, 7
GPU-accelerated molecular dynamics simulation for study of liquid crystalline flows
We have developed a GPU-based molecular dynamics simulation for the study of flows of fluids with anisotropic molecules such as liquid crystals. An application of the simulation to the study of macroscopic flow (backflow) generation by molecular reorientation in a nematic liquid crystal under the application of an electric field is presented. The computations of […]