Posts
Jan, 13
GPU Implementation of a Multiobjective Search Algorithm
In this paper we observe the possibility to accelerate a search algorithm for multiobjective optimization problems with help of a graphics processing unit. Besides an implementation we present test results for it and the conclusions that can be drawn from these results.
Jan, 13
A Real-Time Capable Software-Defined Receiver Using GPU for Adaptive Anti-Jam GPS Sensors
Due to their weak received signal power, Global Positioning System (GPS) signals are vulnerable to radio frequency interference. Adaptive beam and null steering of the gain pattern of a GPS antenna array can significantly increase the resistance of GPS sensors to signal interference and jamming. Since adaptive array processing requires intensive computational power, beamsteering GPS […]
Jan, 13
Parallel mutual information estimation for inferring gene regulatory networks on GPUs
BACKGROUND: Mutual information is a measure of similarity between two variables. It has been widely used in various application domains including computational biology, machine learning, statistics, image processing, and financial computing. Previously used simple histogram based mutual information estimators lack the precision in quality compared to kernel based methods. The recently introduced B-spline function based […]
Jan, 13
Parallel unmixing of remotely sensed hyperspectral images on commodity graphics processing units
Hyperspectral imaging instruments are capable of collecting hundreds of images, corresponding to different wavelength channels, for the same area on the surface of the Earth. One of the main problems in the analysis of hyperspectral data cubes is the presence of mixed pixels, which arise when the spatial resolution of the sensor is not enough […]
Jan, 13
Scientific Computing with Python on GPUs
This short documentation was written by me for the High Performance Computing Seminar in the winter semester 2010/11 of Prof. G. Haase at the University Graz Austria. In order to learn Sphinx and to make it possible for others to get a quick start with Python in mathematics and scientific computing I started to write […]
Jan, 13
Implementation of Virtual Embryology using the Thrust library for CUDA
The model to be described is based on the findings of evolutionary developmental biology (evo devo) and is determined to emphasize the fundamental importance of the development process additionally to genetic material, forming virtual embryos and furthermore Artificial Neural Networks (ANN). It was originally developed by Ronald Thenius in 2008 and extended by Michael Bodi […]
Jan, 13
An FPGA-based processing pipeline for high definition stereo video
This paper presents a real-time processing platform for high definition stereo video. The system is capable to process stereo-video streams at resolutions up to 1920×1080 at 30 frames per second (1080p30). In the hybrid FPGA-GPU-CPU system, a high-density FPGA is used to perform not only the low-level image processing tasks such as color interpolation and […]
Jan, 13
Mesh deformations in X3D via CUDA with freeform deformation lattices
In this paper we present a GPU-accelerated implementation of the well-known freeform deformation algorithm to allow for deformable objects within fully interactive virtual environments. We furthermore outline how our real-time deformation approach can be integrated into the X3D standard for more accessibility of the proposed methods. The presented technique can be used to deform complex […]
Jan, 13
Fast Greeks: Case of Credit Valuation Adjustments
(Counterparty) Credit Valuation Adjustments (CVA) has become a prevailing form of pricing default risk on over-the-counter (OTC) contracts. Due to the large size of portfolios included in the CVA calculation and its computational complexity, large computing grids are needed for the evaluation. The main purpose of this thesis is to investigate an even more computationally […]
Jan, 12
Enhancing Data Parallelism for Ant Colony Optimisation on GPUs
Graphics Processing Units (GPUs) have evolved into highly parallel and fully programmable architectures over the past five years, and the advent of CUDA has facilitated their application to many real-world applications. In this paper, we deal with a GPU implementation of Ant Colony Optimisation (ACO), a population-based optimisation method which comprises two major stages: Tour […]
Jan, 12
GKLEE: Concolic Verification and Test Generation for GPUs
Programs written for GPUs often contain correctness errors such as races, deadlocks, or may compute the wrong result. Existing debugging tools often miss these errors because of their limited input-space and execution-space exploration. Existing tools based on conservative static analysis or conservative modeling of SIMD concurrency generate false alarms resulting in wasted bug-hunting. They also […]
Jan, 12
Incomplete-LU and Cholesky Preconditioned Iterative Methods Using CUSPARSE and CUBLAS
In this white paper we show how to use the CUSPARSE and CUBLAS libraries to achieve a 2x speedup over CPU in the incomplete-LU and Cholesky preconditioned iterative methods. We focus on the Bi-Conjugate Gradient Stabilized and Conjugate Gradient iterative methods, that can be used to solve large sparse nonsymmetric and symmetric positive definite linear […]