Posts
Aug, 27
The development and expansion of HOOMD-blue through six years of GPU proliferation
HOOMD-blue is the first general purpose MD code built from the ground up for GPU acceleration, and has been actively developed since March 2007. It supports a variety of force fields and integrators targeted at soft-matter simulations. As an open source project, numerous developers have contributed useful feature additions back to the main code. High […]
Aug, 27
Compilation techniques and language support to facilitate dependence-driven computation
As the demand increases for high performance and power efficiency in modern computer runtime systems and architectures, programmers are left with the daunting challenge of fully exploiting these systems for efficiency, high-level expressibility, and portability across different computing architectures. Emerging programming models such as the task-based runtime StarPU and many-core architectures such as GPUs force […]
Aug, 27
Solutions for Optimizing the Monte Carlo Option Pricing Method’s Implementation Using the Compute Unified Device Architecture
Finance-related problems require more and more computations; therefore, the problem of finding efficient implementations for option pricing models on modern architectures has become an important challenge. Although there are numerous implementations of the Monte Carlo method on central processing units, many of them face limitations arising from the necessary increased computational power. In this paper, […]
Aug, 27
Multiple Time Scales Recurrent Neural Network for Complex Action Acquisition
This paper presents novel results of complex action learning experiments based on the use of extended multiple time-scales recurrent neural networks (MTRNN). The experiments were carried out with the iCub humanoid robot, as a model of the developmental learning of motor primitives as the basis of sensorimotor and linguistic compositionality. The model was implemented through […]
Aug, 27
GPU-based simulation of the long-range Potts model via parallel tempering
We discuss the efficiency of parallelization on graphical processing units (GPUs) for the simulation of the one dimensional Potts model with long range interactions via parallel tempering. We investigate the behaviour of some thermodynamic properties, such as equilibrium energy and magnetization, critical temperatures as well as the separation between the first- and second-order regime. By […]
Aug, 26
Aquila 2.0: Software Architecture for Cognitive Robotics
The modelling of the integration of various cognitive skills and modalities requires complex and computationally intensive algorithms running in parallel while controlling high-performance systems. The distribution of processing across many computers has certainly advanced our software ecosystem and opened up research to new possibilities. While this was an essential move, we are aspiring to augment […]
Aug, 26
Fast Object Re-Detection and Localization in Video for Spatio-Temporal Fragment Creation
This paper presents a method for the detection and localization of instances of user-specified objects within a video or a collection of videos. The proposed method is based on the extraction and matching of SURF descriptors in video frames and further incorporates a number of improvements so as to enhance both the detection accuracy and […]
Aug, 26
Estimating the WCET of GPU-Accelerated Applications using Hybrid Analysis
The massive parallelism offered by Graphics Processing Units (GPUs) is now routinely exploited to accelerate computationally intensive tasks in a wide variety of application domains. Efficient GPU programming in languages such as CUDA and OpenCL requires careful application of hand optimisations to exploit parallelism and locality while minimising synchronisation. The effectiveness of such optimisations can […]
Aug, 26
Lattice Boltzmann Simulations of Multiphase Flows
This thesis is a comprehensive account of my experiences implementing the Lattice Boltzmann Method (LBM) for the purpose of simulating multiphase flows relevant to Air Conditioning and Refrigeration Center (ACRC) applications. Other methodologies have been used to simulate multiphase flow including finite volume based Navier-Stokes solvers. These methods have found reasonable success in simulating multiphase […]
Aug, 26
OpenCL programming using Python syntax
We describe ocl, a Python library built on top of pyOpenCL and numpy. It allows programming GPU devices using Python. Python functions which are marked up using the provided decorator, are converted into C99/OpenCL and compiled using the JIT at runtime. This approach lowers the barrier to entry to programming GPU devices since it requires […]
Aug, 26
SystemC simulation on GP-GPUs: CUDA vs. OpenCL
SystemC is a widespread language for developing SoC designs. Unfortunately, most SystemC simulators are based on a strictly sequential scheduler that heavily limits their performance, impacting verification schedules and time-to-market of new designs. Parallelizing SystemC simulation entails a complete re-design of the simulator kernel for the specific target parallel architectures. This paper proposes an automatic […]
Aug, 26
Performance Evaluation of Intel Xeon Phi Coprocessor using XKaapi
This paper presents preliminary performance comparisons of parallel applications developed natively for the Intel Xeon Phi accelerator using three different parallel programming environments and their associated runtime systems. We compare Intel OpenMP, Intel CilkPlus and XKaapi together on the same benchmark suite. Our benchmark suite is composed of two computing kernels: a Fibonacci computation that […]