7409

Posts

Mar, 21

Parallel Two-Stage Least Squares algorithms for Simultaneous Equations Models on GPU

Today it is usual to have computational systems formed by a multicore together with one or more GPUs. These systems are heterogeneous, due to the di erent types of memory in the GPUs and to the di erent speeds of computation of the cores in the CPU and the GPU. To accelerate the solution of […]
Mar, 21

Fast Antenna Characterization Using the Sources Reconstruction Method on Graphics Processors

The Sources Reconstruction Method (SRM) is a non-invasive technique for, among other applications, antenna characterization. The SRM is based on obtaining a distribution of equivalent currents that radiate the same field as the antenna under test. The computation of these currents requires solving a linear system, usually ill-posed, that may be very computationally demanding for […]
Mar, 21

CPU-GPU Hybrid Parallel Binomial American Option Pricing

We present in this paper a novel parallel binomial algorithm that computes the price of an American option. The algorithm partitions a binomial tree constructed for the pricing into blocks of multiple levels of nodes, and assigns each such block to multiple processors. Each of the processors then computes the option’s values at its assigned […]
Mar, 21

Shallow water simulations on multiple GPUs

We present a state-of-the-art shallow water simulator running on multiple GPUs. Our implementation is based on an explicit high-resolution finite volume scheme suitable for modeling dam breaks and flooding. We use row domain decomposition to enable multi-GPU computations, and perform traditional CUDA block decomposition within each GPU for further parallelism. Our implementation shows near perfect […]
Mar, 20

Multicore and GPU Programming Models, Languages and Compilers Workshop, PLC 2012

Co-located with 26th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2012). This workshop provides a forum for the presentation of research on all aspects of GPU and multicore processors programming models, compiler optimizations, language extensions, and software tools for GPU and Multicore processor platforms. Areas of interest include but are not limited to the […]
Mar, 20

G-Node Workshop on Neuronal GPU Computing

Graphics processing units (GPUs) offer a low-cost approach to parallel high-performance computing. Neuronal simulations can be parallelized efficiently and are particularly well suited for implementation on GPUs. There is also great potential for GPU-based high-throughput analysis of neuronal data. The field is progressing at rapid pace, and has reached a point where it may strongly […]
Mar, 19

Generating optimal CUDA sparse matrix-vector product implementations for evolving GPU hardware

The CUDA model for graphics processing units (GPUs) presents the programmer with a plethora of different programming options. These includes different memory types, different memory access methods and different data types. Identifying which options to use and when is a non-trivial exercise. This paper explores the effect of these different options on the performance of […]
Mar, 19

Spatial Join with R-Tree on Graphics Processing Units

Spatial operations such as spatial join combine two objects on spatial predicates. It is different from relational join because objects have multi dimensions and spatial join consumes large execution time. Recently, many researches tried to find methods to improve the execution time. Parallel spatial join is one method to improve the execution time. Comparison between […]
Mar, 19

Analysis of the Performance of the Fish School Search Algorithm Running in Graphic Processing Units

Fish School Search (FSS) is a computational intelligence technique invented by Bastos-Filho and Lima-Neto in 2007 and first presented in Bastos-Filho et al. (2008). FSS was conceived to solve search problems and it is based on the social behavior of schools of fish. In the FSS algorithm, the search space is bounded and each possible […]
Mar, 19

GPU Enhanced Simulation of Angiogenesis

In the paper we present the use of graphic processor units to accelerate the most time-consuming stages of a simulation of angiogenesis and tumor growth. By the use of advanced CUDA mechanisms such as shared memory, textures and atomic operations, we managed to speed up the CUDA kernels by a factor of 57x. However, in […]
Mar, 19

Parallelization of Particle Filter Algorithms

This paper presents the parallelization of the particle filter algorithm in a single target video tracking application. In this document we demonstrate the process by which we parallelized the particle filter algorithm, beginning with a MATLAB implementation. The final CUDA program provided approximately 71x speedup over the initial MATLAB implementation.
Mar, 19

Computational Intelligence on Consumer Games and Graphics Hardware CIGPU-2012

The fifth International workshop and tutorial on Computational Intelligence on Consumer Games and Graphics Hardware (CIGPU 2012) will be held as a hybrid special session of the IEEE WCCI 2012 conference in Brisbane, 10-15 June 2012. WCCI 2012, the IEEE world congress on computational intelligence, joins together three international conferences: IJCNN 2012, FUZZ-IEEE 2012 and […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: