3150

Posts

Feb, 27

A Domain-Specific Approach To Heterogeneous Parallelism

Exploiting heterogeneous parallel hardware currently requires mapping application code to multiple disparate programming models. Unfortunately, general-purpose programming models available today can yield high performance but are too low-level to be accessible to the average programmer. We propose leveraging domainspecific languages (DSLs) to map high-level application code to heterogeneous devices. To demonstrate the potential of this […]
Feb, 26

StarPU: a Runtime System for Scheduling Tasks over Accelerator-Based Multicore Machines

Multicore machines equipped with accelerators are becoming increasingly popular. The TOP500-leading RoadRunner machine is probably the most famous example of a parallel computer mixing IBM Cell Broadband Engines and AMD opteron processors. Other architectures, featuring GPU accelerators, are expected to appear in the near future. To fully tap into the potential of these hybrid machines, […]
Feb, 26

A Tuning Framework for Software-Managed Memory Hierarchies

New architectures are emerging at a rapid pace, architectures with multiple processing units on a chip and with deep memory hierarchies have become pervasive; while architectures with software-managed memory hierarchies (such as the Sony/Toshiba/IBM Cell processor) have gained popularity. Due to the increased complexity of architectures, re-targeting a legacy application to a new architecture requires […]
Feb, 26

Real-Time Approaches to Computer Vision

Perhaps the extensive reliance on our visual sensory inputs, makes the use of artificial visual sensors seem like an intuitive choice. Thus, Machine Vision or Computer Vision has become an exciting field of research, finding its way into many industrial applications. The results from Computer Vision research can be incorporated in autonomous machine navigation, industrial […]
Feb, 26

Believe it or Not! Multi-core CPUs Can Match GPU Performance for FLOP-intensive Application!

In this work, we evaluate performance of a real-world image processing application that uses a cross-correlation algorithm to compare a given image with a reference one. The algorithm processes individual images represented as 2-dimensional matrices of single-precision floating-point values using O(n4) operations involving dot-products and additions. We implement this algorithm on a nVidia GTX 285 […]
Feb, 26

GPU-Based Foreground-Background Segmentation Using an Extended Colinearity Criterion

We present a GPU-based foreground-background segmentation that processes image sequences in less than 4ms per frame. Change detection wrt. the background is based on a color similarity test in a small pixel neighbourhood, and is integrated into a Bayesian estimation framework. An iterative MRF-based model is applied, exploiting parallelism on modern graphics hardware. Resulting segmentation […]
Feb, 26

Concurrent GPU Programming

Monte Carlo algorithms use repeated random sampling to find solutions to problems. One common example uses points randomly selected from the unit box to approximate the value of pi. Another example is a simulation called a virtual spectrophotometer which measures the reflectance of a modeled material [1]. The repetitive nature of Monte Carlo algorithms usually […]
Feb, 26

Efficient GPU-Accelerated Elastic Image Registration

Elastic intra-patient registration can be used to correct for local motion within biomedical images. The application of elastic registration during interventional treatment is seriously hampered by its considerable computation time. The Graphics Processing Units (GPU) can be used to accelerate the calculation of such elastic registrations, without changing the basic registration algorithm. This article discusses […]
Feb, 26

A Watermarking Co-Processor for New Generation Graphics Processing Units

Recent growth of high speed Internet and high resolution imaging has enabled electronic storage and transfer of digital multimedia contents without resorting to the loss of quality. In order to protect the illegal reproduction of the digital multimedia elements, many researchers have suggested digital watermarking as a feasible solution. Like other signal and image processing […]
Feb, 26

Handbook of open source tools

Handbook of Open Source Tools introduces a comprehensive collection of advanced open source tools useful in developing software applications. The book contains information on more than 200 open-source tools which include software construction utilities for compilers, virtual-machines, database, graphics, high-performance computing, OpenGL, geometry, algebra, graph theory, GUIs and more. Special highlights for software construction utilities […]
Feb, 26

Real-time blood flow visualization using the graphics processing unit

Laser speckle imaging (LSI) is a technique in which coherent light incident on a surface produces a reflected speckle pattern that is related to the underlying movement of optical scatterers, such as red blood cells, indicating blood flow. Image-processing algorithms can be applied to produce speckle flow index (SFI) maps of relative blood flow. We […]
Feb, 25

Porous Rock Simulations and Lattice Boltzmann on GPUs

Investigating how fluids flow inside the complicated geometries of porous rocks is an important problem in the petroleum industry. The lattice Boltzmann method (LBM) can be used to calculate porous rockst’ permeability. In this paper, we show how to implement this method ef?ciently on modern GPUs. Both a sequential CPU implementation and a parallelized GPU […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: