Posts
Oct, 15
Massively Parallel Lossless Compression of Medical Images Using Least-Squares Prediction and Arithmetic Coding
Medical imaging in hospitals requires fast and efficient image compression to support the clinical work flow and to save costs. Leastsquares autoregressive pixel prediction methods combined with arithmetic coding constitutes the state of the art in lossless image compression. However, a high computational complexity of both prevents the application of respective CPU implementations in practice. […]
Oct, 15
High-Performance GPGPU Programming with OCaml
We present an OCaml GPGPU library with a DSL embedded into OCaml to express GPGPU kernels. The level of performance achieved is measured through different examples. We also discuss the use of GPGPU programming to increase the performance of multicore-CPUs software, written in OCaml.
Oct, 15
Uses of GPU Powered Interval Optimization for Parameter Identification in the Context of SO Fuel Cells
In this paper, we discuss parameter identification for models based on ordinary differential equations in the context of solid oxide fuel cells. In this case, verified methods (e.g. interval analysis), which provide a guarantee of correctness for the computed result, can be of great help for dealing with the appearing uncertainty and for devising accurate […]
Oct, 13
GPU-Specfic Kalman Filtering and Retrodiction for Large-Scale Target Tracking
In the field of Tracking and Data Fusion most, if not all, computations executed by a computer are carried out serially. The sole part of the process that is not entirely serial is the collection of data from multiple sensors, which can be executed in parallel. However, once the data is to be filtered the […]
Oct, 13
Power Control for GPU Clusters in processing large-scale streams
Many emerging online data analysis applications require Large-scale streams data processing. GPU cluster is becoming a significantly parallel computing scheme to handling large-scale streams data tasks. However power optimization is a challenging issue. In this paper, we present a novel power consumption control model to shift power budge among nodes in the cluster based on […]
Oct, 13
Contributions to parallel stochastic simulation: Application of good software engineering practices to the distribution of pseudorandom streams in hybrid Monte-Carlo simulations
The race to computing power increases every day in the simulation community. A few years ago, scientists have started to harness the computing power of Graphics Processing Units (GPUs) to parallelize their simulations. As with any parallel architecture, not only the simulation model implementation has to be ported to the new parallel platform, but all […]
Oct, 13
Simulating Active Membrane Systems Using GPUs
Software development for cellular computing is growing up yielding new applications. In this paper, we describe a simulator for the class of recognizer P systems with active membranes, which exploits the massively parallel nature of P systems computations by using GPUs (Graphics Processing Units). The newest generation of GPUs provide a massively parallel framework to […]
Oct, 13
Characterizing the Challenges and Evaluating the Efficacy of a CUDA-to-OpenCL Translator
The proliferation of heterogeneous computing systems has led to increased interest in parallel architectures and their associated programming models. One of the most promising models for heterogeneous computing is the accelerator model, and one of the most cost-effective, high-performance accelerators currently available is the general-purpose, graphics processing unit (GPU). Two similar programming environments have been […]
Oct, 12
High performance sequence mining using pairwise statistical significance
With the amount of sequence data deluge as a result of next generation sequencing, there comes a need to leverage the large-scale biological sequence data. Therefore, the role of high performance computational methods to mining interesting information solely from these sequence data becomes increasingly important. Almost everything in bioinformatics counts on the inter-relationship between sequences, […]
Oct, 12
Regional Heritability Advanced Complex Trait Analysis for GPU and Traditional Parallel Architectures
MOTIVATION: Quantification of the contribution of genetic variation to phenotypic variation for complex traits becomes increasingly computationally demanding with increasing numbers of SNPs and individuals. To meet the challenges in making feasible large scale studies, we present the REACTA software. Adapted from ACTA (and, in turn, GCTA), it is tailored to exploit the parallelism present […]
Oct, 12
Coupling a Generalized DEM and an SPH Models Under a Heterogeneous Massively Parallel Framework
The interaction of flows and solid objects is a recurring problem in several engineering disciplines. The objective of this work is to present a fully coupled model, based on the fundamental conservation laws of hydrodynamics, namely the continuity and Navier-Stokes equations, and the equation of conservation of momentum of solid bodies. The coupled numerical solution, […]
Oct, 12
Automatic run-time mapping of polyhedral computations to heterogeneous devices with memory-size restrictions
Tools that aim to automatically map parallel computations to heterogeneous and hierarchical systems try to divide the whole computation in parts with computational loads adjusted to the capabilities of the target devices. Some parts are executed in node cores, while others are executed in accelerator devices. Each part requires one or more data-structure pieces that […]