Posts
Nov, 17
Programming Future Parallel Architectures with Haskell and Intel ArBB
New parallel architectures, such as Cell, Intel MIC, GPUs, and tiled architectures, enable high performance but are often hard to program. What is needed is a bridge between high-level programming models where programmers are most productive and modern parallel architectures. We propose that that bridge is Embedded Domain Specific Languages (EDSLs). One attractive target for […]
Nov, 17
Scientific GPU Programming with Data-Flow Languages
Graphical Processing Units or GPUs are processors used primarily to render images from computer models for domains ranging from gaming to design engineering. As the generation of very accurate images often in real time is extremely computationally intensive, they have developed into extremely powerful processors. To achieve this they have relied on being able to […]
Nov, 17
FPGA and ASIC Convergence
The growing demands on multimedia applications and high-speed high-quality telecommunication systems with real-time constrains oriented to portable, low power consumption, devices, have being driven technologies development, methodologies and design flows of embedded systems during the last years. Through the analysis of design methodologies and strategies facing multi-core, reconfigurability and power consumption challenges, this educational survey […]
Nov, 17
Characterization and Transformation of Unstructured Control Flow in GPU Applications
Hardware and compiler techniques for mapping data-parallel programs with divergent control flow to SIMD architectures have recently enabled the emergence of new GPGPU programming models such as CUDA and OpenCL. Although this technology is widely used, commodity GPUs use different schemes to implement it, and the performance limitations of these different schemes under real workloads […]
Nov, 17
Massive Image Editing on the Cloud
Processing massive imagery in a distributed environment currently requires the effort of a skilled team to efficiently handle communication, synchronization, faults, and data/process distribution. Moreover, these implementations are highly optimized for a specific system or cluster, therefore portability or improved performance due to system improvements is rarely considered. Much like early GPU computing, cluster computing […]
Nov, 17
Adaboost GPU-based Classifier for Direct Volume Rendering
In volume visualization, the voxel visibitity and materials are carried out through an interactive editing of Transfer Function. In this paper, we present a two-level GPU-based labeling method that computes in times of rendering a set of labeled structures using the Adaboost machine learning classifier. In a pre-processing step, Adaboost trains a binary classifier from […]
Nov, 17
The role of GPU computing in medical image analysis and visualization
The role of computers in medical image display and analysis continues to be one of the most computationally demanding tasks facing modern computers. Recent advances in GPU architecture have allowed for a new programming paradigm which utilized the massively parallel computational capacity of GPUs for general purpose computing. These parallel processors provide substantial performance benefits […]
Nov, 17
Parallel Performance Measurement of Heterogeneous Parallel Systems with GPUs
The power of GPUs is giving rise to heterogeneous parallel computing, with new demands on programming environments, runtime systems, and tools to deliver high-performing applications. This paper studies the problems associated with performance measurement of heterogeneous machines with GPUs. A heterogeneous computation model and alternative host-GPU measurement approaches are discussed to set the stage for […]
Nov, 16
Simulations of Large Particle Systems in Real Time
Simulation of interacting particle systems has been a well established method for many years now. Such systems can span different scales, including microscopic (where particles represent atoms, as in Molecular Dynamics simulations) as well as macroscopic. In the latter case, growing interest is put into Smoothed Particle Hydrodynamics approach. Traditionally, over many years, simulation of […]
Nov, 16
Object Space Based Collision Detection for Cloth Simulation on the GPU
This paper presents an approach for cloth-body collision detection in computer graphics simulations of clothing. It is an object-space based algorithm implemented in OpenCL on the GPU. The underlying idea behind this work is to speed up the solution of the collision detection problem by utilizing the excessive computational capacity of contemporary GPUs. Results of […]
Nov, 16
Parallel Approach for Longest Common Subsequence problem on GPU
Recent developments in genomic and molecular technologies produced a tremendous amount of information related to molecular biology. The management and analysis of these biological data require intensive computing power. Sequence aligning is one of the algorithmic tools in bioinformatics to look for resemblance among sequences of amino acids. The longest common subsequence (LCS) of biological […]
Nov, 16
Scope for performance enhancement of CMU Sphinx by parallelising with OpenCL
Automatic Speech Recognition (ASR) system that utilises many-core Graphics Processing Unit (GPU) architecture enables myriad of emerging applications like mobile based speech recognition, multimedia content transcription, and voice based language translation. This article discusses the feasibility and challenges in performance enhancement of CMU Sphinx-3.08 by parallelising the data-parallel parts using OpenCL that can utilise the […]