Posts
Dec, 4
FusionSim: Characterizing the Performance Benefits of Fused CPU/GPU Systems
We present FusionSim, a modeling framework capable of cycle-accurate simulation of a complete x86-based computer system with (a) a CPU and a GPU on the same die, and (b) a CPU and a GPU connected as separate components. We use FusionSim to characterize the performance of the Rodinia benchmarks on fused and discrete systems. We […]
Dec, 4
A MPI back-end for the OpenACC accULL. Exploiting OpenACC semantics in Message Passing Clusters
The irruption in the HPC scene of hardware acceletarors has made available unprecedented performance to developers. However, even expert developers may not be ready to exploit the complex hierarchies of these new heterogeneous systems. We need to find a way to leverage the programming effort in these architectures at programming language level, otherwise, developers will […]
Dec, 4
Molecular dynamics for long-range interacting systems on Graphic Processing Units
We present implementations of a fourth-order symplectic integrator on graphic processing units for three $N$-body models with long-range interactions of general interest: the Hamiltonian Mean Field, Ring and two-dimensional self-gravitating models. We discuss the algorithms, speedups and errors using one and two GPU units. Speedups can be as high as 140 compared to a serial […]
Dec, 3
GPU-Based Implementation of JPEG2000 Encoder
JPEG2000 has become one of the most rewarding image coding standards. It provides a practical set of features which weren’t necessarily available in the previous standards. The features were realized as a result of two new techniques, namely the Discrete Wavelet Transform (DWT), and Embedded Block Coding with Optimized Truncation (EBCOT). The complexity of EBCOT […]
Dec, 3
Hybrid Sample-based Surface Rendering
The performance of rasterization-based rendering on current GPUs strongly depends on the abilities to avoid overdraw and to prevent rendering triangles smaller than the pixel size. Otherwise, the rates at which high-resolution polygon models can be displayed are affected significantly. Instead of trying to build these abilities into the rasterization-based rendering pipeline, we propose an […]
Dec, 3
Simulations of Complex and Microscopic Models of Cardiac Electrophysiology Powered by Multi-GPU Platforms
Key aspects of cardiac electrophysiology, such as slow conduction, conduction block, and saltatory effects have been the research topic of many studies since they are strongly related to cardiac arrhythmia, reentry, fibrillation, or defibrillation. However, to reproduce these phenomena the numerical models need to use subcellular discretization for the solution of the PDEs and nonuniform, […]
Dec, 3
GPU-based Space Situational Awareness Simulation utilising parallelism for enhanced multi-sensor management
As a result of continual space activity since the 1950s, there are now a large number of man-made Resident Space Objects (RSOs) orbiting the Earth. Because of the large number of items and their relative speeds, the possibility of destructive collisions involving important space assets is now of significant concern to users and operators of […]
Dec, 3
Non-steady relaxation and critical exponents at the depinning transition
We study the non-steady relaxation of a driven one-dimensional elastic interface at the depinning transition by extensive numerical simulations concurrently implemented on graphics processing units (GPUs). We compute the time-dependent velocity and roughness as the interface relaxes from a flat initial configuration at the thermodynamic random-manifold critical force. Above a first, non-universal microscopic time-regime, we […]
Dec, 2
Parallelism in Database Operations
The developments in the memory and hard disk bandwidth latencies have made databases CPU bound. Recent studies have shown that this bottleneck can be helped with parallelism. We give a survey of the methodologies that can be used to implement this kind of parallelism. Mainly, there are two kinds of parallel disciplines that have been […]
Dec, 2
Efficient Hybrid Execution of C++ Applications using Intel(R) Xeon Phi(TM) Coprocessor
The introduction of Intel(R) Xeon Phi(TM) coprocessors opened up new possibilities in development of highly parallel applications. The familiarity and flexibility of the architecture together with compiler support integrated into the Intel C++ Composer XE allows the developers to use familiar programming paradigms and techniques, which are usually not suitable for other accelerated systems. It […]
Dec, 2
FIELA: A Fast Image Encryption with Lorenz Attractor using Hybrid Computing
In the past few years, the transmission of digital images across the world has increased. Images such as military images, personal photos are transmitted which are not intended for all. It is essential to secure these images from unauthorized access and modifications. Chaos theory is a scientific discipline that deals with non-linear dynamical systems that […]
Dec, 2
Numerical resolution of conservation laws with OpenCL
We present several numerical simulations of conservation laws on recent multicore processors, such as GPU’s, using the OpenCL programming framework. Depending on the chosen numerical method, different implementation strategies have to be considered, for achieving the best performance. We explain how to program efficiently three methods: a finite volume approach on a structured grid, a […]