Posts
Aug, 7
CuBA – a CUDA implementation of BAMPS
Using CUDA as programming language, we create a code named CuBA which is based on the CPU code "Boltzmann Approach for Many Parton Scattering (BAMPS)" developed in Frankfurt in order to study a system of many colliding particles resulting from heavy ion collisions. Furthermore, we benchmark our code with the Riemann Problem and compare the […]
Aug, 7
Swarm-NG: a CUDA Library for Parallel n-body Integrations with focus on Simulations of Planetary Systems
We present Swarm-NG, a C++ library for the efficient direct integration of many n-body systems using highly-parallel Graphics Processing Unit (GPU), such as NVIDIA’s Tesla T10 and M2070 GPUs. While previous studies have demonstrated the benefit of GPUs for n-body simulations with thousands to millions of bodies, Swarm-NG focuses on many few-body systems, e.g., thousands […]
Aug, 6
Accelerating Cryptographic Primitives with GPUs
In this paper, we review the current state-of-the-art in accelerating cryptographic and other computer-security-related primitives using graphics processing units and provide a critical analysis of the appropriateness of graphics accelerators to this task. Generalpurpose programming of graphics processing units (GPGPUs) has garnered much attention recently in the high-performance computing community, as it offers orders-of-magnitude performance […]
Aug, 6
Coordinated system level resource management for heterogeneous many-core platforms
A challenge posed by future computer architectures is the efficient exploitation of their many and sometimes heterogeneous computational cores. This challenge is exacerbated by the multiple facilities for data movement and sharing across cores resident on such platforms. To answer the question of how systems software should treat heterogeneous resources, this dissertation describes an approach […]
Aug, 6
Comparison of OpenMP & OpenCL Parallel Processing Technologies
This paper presents a comparison of OpenMP and OpenCL based on the parallel implementation of algorithms from various fields of computer applications. The focus of our study is on the performance of benchmark comparing OpenMP and OpenCL. We observed that OpenCL programming model is a good option for mapping threads on different processing cores. Balancing […]
Aug, 6
Visualization of Astronomical Nebulae via Distributed Multi-GPU Compressed Sensing Tomography
The 3D visualization of astronomical nebulae is a challenging problem since only a single 2D projection is observable from our fixed vantage point on Earth. We attempt to generate plausible and realistic looking volumetric visualizations via a tomographic approach that exploits the spherical or axial symmetry prevalent in some relevant types of nebulae. Different types […]
Aug, 6
Improving GPU Sparse Matrix-Vector Multiplication for Probabilistic Model Checking
We present several methods to improve the run times of probabilistic model checking on general-purpose graphics processing units (GPUs). The methods enhance sparse matrix-vector multiplications, which are in the core of the probabilistic model checking algorithms. The improvement is based on the analysis of the transition matrix structures corresponding to state spaces of a selection […]
Aug, 6
On Parallel Software Verification using Boolean Equation Systems
Multi- and many-core hardware platforms are today widely accessible and used to significantly accelerate many computationally demanding tasks. In this paper we describe a parallel approach to solve Boolean Equation Systems (BESs) in the context of model checking. We focus on the applicability of state-of-the-art, shared-memory parallel hardware – multi-core CPUs and many-core GPUs – […]
Aug, 5
Algorithms for the mapping of genome sequences in GPGPU
This project focuses on using GPGPUs for solving the inexact alignment of short-reads with respect to a reference indexed using the Burrows-Wheeler Transform. To be more specific we dealt with a solution of an alignment that allows up to one error.
Aug, 5
GPU and CPU Cooperative Accelaration for Face Detection on Modern Processors
Along with the inclusion of GPU cores within the same CPU die, the performance of Intel’s processor-graphics has been significantly improved over earlier generation of integrated graphics. The need to efficiently harness the computational power of the GPU in the same CPU die is more than ever. This paper presents a highly optimized Haar-based face […]
Aug, 5
Clustering Based Search Algorithm For Motion Estimation
Motion estimation is the key part of video compression since it removes the temporal redundancy within frames and significantly affects the encoding quality and efficiency. In this paper, a novel fast motion estimation algorithm named Clustering Based Search algorithm is proposed, which is the first to define the clustering feature of motion vectors in a […]
Aug, 5
Modeling of Heat Diffusion Through Isotropic Media Using Graphical Processing Units
With accurate simulation of ever-more complex phenomena desired, numerical models are getting increasingly sophisticated and thus take considerable amount of time to run on normal computers. Parallel computing has emerged as an important paradigm in response, allowing engineers to run programs faster. In recent years graphics processing units (GPU) are among the massively parallel devices […]

