Posts
Dec, 16
Finding the Force – Consistent Particle Seeding for Satellite Aerodynamics
When calculating satellite trajectories in low-earth orbit, engineers need to adequately estimate aerodynamic forces. But to this day, obtaining the drag acting on the complicated shapes of modern spacecraft suffers from many sources of error. While part of the problem is the uncertain density in the upper atmosphere, this works focuses on improving the modeling […]
Dec, 15
A GPU Implementation of Dynamic Programming for the Optimal Polygon Triangulation
This paper presents a GPU (Graphics Processing Units) implementation of dynamic programming for the optimal polygon triangulation. Recently, GPUs can be used for general purpose parallel computation. Users can develop parallel programs running on GPUs using programming architecture called CUDA (Compute Unified Device Architecture) provided by NVIDIA. The optimal polygon triangulation problem for a convex […]
Dec, 15
A Distributed Approximation Algorithm for Mixed Packing-Covering Linear Programs
Mixed packing-covering linear programs capture a simple but expressive subclass of linear programs. They commonly arise as linear programming relaxations of a number important combinatorial problems, including various network design and generalized matching problems. In this paper, we propose an efficient distributed approximation algorithm for solving mixed packing-covering problems which requires a poly-logarithmic number of […]
Dec, 15
A Parallel Method for Impulsive Image Noise Removal on Hybrid CPU/GPU Systems
A parallel algorithm for image noise removal is proposed. The algorithm is based on peer group concept and uses a fuzzy metric. An optimization study on the use of the CUDA platform to remove impulsive noise using this algorithm is presented. Moreover, an implementation of the algorithm on multi-core platforms using OpenMP is presented. Performance […]
Dec, 15
Real time Multi-GPU-based Event Detection in High Definition Videos
Video processing algorithms present a very important tool for many applications related to computer vision domain such as motion tracking, videos indexation, robot navigation and event detection. However, the new video standards, especially in high definitions, cause that the current implementations, even running on modern hardware, no longer respect the needs of real-time processing. In […]
Dec, 15
OpenCL-Accelerated Computation of a 3D SPECT Projection Operator for the Content Adaptive Mesh Model
In this manuscript, we present a preliminary evaluation of a fully 3D projection operator calculation aimed at emission tomography on a non-circular orbit. The proposed methodology uses the content-adaptive mesh model (CAMM) for volumetric data representation. The CAMM is an efficient data representation based on adaptive non-uniform sampling and linear interpolation. The presented projection operator […]
Dec, 13
Data Transfer Matters for GPU Computing
Graphics processing units (GPUs) embrace manycore compute devices where massively parallel compute threads are offloaded from CPUs. This heterogeneous nature of GPU computing raises non-trivial data transfer problems especially against latency-critical real-time systems. However even the basic characteristics of data transfers associated with GPU computing are not well studied in the literature. In this paper, […]
Dec, 13
GPU hardware acceleration for industrial applications: using computation to push beyond physical limitations
This thesis explores the possibility of utilizing Graphics Processing Units (GPUs) to address the computational demand of algorithms used to mitigate the inherent physical limitations in devices such as microscopes and 3D-scanners. We investigate the outcome and test our methodology for the following case studies: – the narrow field of view found in microscopes. – […]
Dec, 13
All-pairs Shortest Path Algorithm based on MPI+CUDA Distributed Parallel Programming Model
In view of the problem that computing shortest paths in a graph is a complex and time-consuming process, and the traditional algorithm that rely on the CPU as computing unit solely can’t meet the demand of real-time processing, in this paper, we present an all-pairs shortest paths algorithm using MPI+CUDA hybrid programming model, which can […]
Dec, 13
TuCCompi: A Multi-Layer Programing Model for Heterogeneous Systems with Auto-Tuning Capabilities
During the last decade, parallel processor architectures have become a powerful tool to deal with massively-parallel problems that require High Performance Computing (HPC). The last trend of HPC is the use of heterogeneous environments, that combine different computational power units, such as CPU-cores and GPUs. Performance maximization of any GPU parallel implementation of an algorithm […]
Dec, 13
Augur: a Modeling Language for Data-Parallel Probabilistic Inference
It is time-consuming and error-prone to implement inference procedures for each new probabilistic model. Probabilistic programming addresses this problem by allowing a user to specify the model and having a compiler automatically generate an inference procedure for it. For this approach to be practical, it is important to generate inference code that has reasonable performance. […]
Dec, 12
GPU Based Dose Calculation
The goal of this dissertation was to parallelize a dose calculation code for radiotherapy cancer treatment and explore the suitability of the new Intel Xeon Phi technology for such task. The source code proved to have many bugs and as such it took a long time to be able to produce consistent results. Thus, the […]