Posts
Oct, 14
Real-Time Use of GPUs in NA62 Experiment
We describe a pilot project for the use of GPUs in a real-time triggering application in the early trigger stages at the CERN NA62 experiment, and the results of the first field tests together with a prototype data acquisition (DAQ) system. This pilot project within NA62 aims at integrating GPUs into the central L0 trigger […]
Oct, 14
Exploiting Data Parallelism in GPUs
Mainstream microprocessor design no longer delivers performance boosts by increasing the processor clock frequency due to power and thermal constraints. Nonetheless, advances in semiconductor fabrication still allow the transistor density to increase at the rate of Moore’s law. This has resulted in the proliferation of many-core parallel architectures and accelerators, among which GPUs (graphics processing […]
Oct, 14
Lattice Group Models: GPU Acceleration and Numerics
Lattice group models (LGpM) are kinetic models on integer lattices derived from the automorphism group of the lattice. In the last decades it was too expensive to simulate large systems (100 – 1000 velocities in a 2D or 3D model), with complex physical two or three dimensional domains, on normal computers or clusters within an […]
Oct, 14
Optimization Solutions for Improving the Performance of the Parallel Reduction Algorithm Using Graphics Processing Units
In this paper, we research, analyze and develop optimization solutions for the parallel reduction function using graphics processing units (GPUs) that implement the Compute Unified Device Architecture (CUDA), a modern and novel approach for improving the software performance of data processing applications and algorithms. Many of these applications and algorithms make use of the reduction […]
Oct, 13
Scalable GPU Acceleration of B-Spline Signal Processing Operations
B-Splines are a useful tool in signal processing, and are widely used in the analysis of two and three-dimensional images. B-Splines provide a continuous representation of the signal, image, or volume, which is useful for interpolation, resampling, noise removal, and differentiation – all important steps in many signal processing algorithms. These splines are defined entirely […]
Oct, 13
Mean shift for graph bundling
We present a fast and simple adaption of the well-known mean shift technique for image segmentation to compute bundled layouts of general graphs. For this, we first transform a given graph drawing into a density map using kernel density estimation. Next, we apply the equivalent of mean shift segmentation on this image, i.e. sharpen the […]
Oct, 13
An implicit Tensor-Mass solver on the GPU for soft bodies simulation
The realistic and interactive simulation of deformable objects has become a challenge in Computer Graphics. In this paper, we propose a GPU implementation of the resolution of the mechanical equations, using a semi-implicit as well as an implicit integration scheme. At the contrary of the classical FEM approach, forces are directly computed at each node […]
Oct, 13
AeminiumGPU: An Intelligent Framework for GPU Programming
As a consequence of the immense computational power avail-able in GPUs, the usage of these platforms for running data-intensive general purpose programs has been increasing. Since memory and pro-cessor architectures of CPUs and GPUs are substantially different, pro-grams designed for each platform are also very different and often resort to a very distinct set of […]
Oct, 13
Fast Parallel Implementation of Fractional Packing and Covering Linear Programs
We present a parallel implementation of the randomized (1 + e)-approximation algorithm for packing and covering linear programs presented by Koufogiannakis and Young [4]. In order to make the algorithm more parallelizable we also implemented a deterministic version of the algorithm, i.e. instead of updating a single random entry at each iteration we updated deterministically […]
Oct, 13
.NET High Performance Computing
Graphics Processing Units (GPUs) have been extensively applied in the High Performance Computing (HPC) community. HPC applications require additional special programming environments to improve the utilization of GPUs, for example, NVIDIA’s CUDA and Khronos group’s OpenCL. This thesis will introduce a preprocessor framework called HPC.NET, which is deployed on the Microsoft .NET platform to meet […]
Oct, 13
FPGA-GPU-CPU Heterogenous Architecture for Real-time Cardiac Physiological Optical Mapping
Real-time optical mapping technology is a technique that can be used in cardiac disease study and treatment technology development to obtain accurate and comprehensive electrical activity over the entire heart. It provides a dense spatial electrophysiology. Each pixel essentially plays the role of a probe on that location of the heart. However, the high throughput […]
Oct, 13
Parallel H-Tree Based Data Cubing on Graphics Processors
Graphics processing units (GPUs) have an SIMD architecture and have been widely used recently as powerful general-purpose co-processors for the CPU. In this paper, we investigate efficient GPU-based data cubing because the most frequent operation in data cube computation is aggregation, which is an expensive operation well suited for SIMD parallel processors. H-tree is a […]