7001

Posts

Jan, 15

Enhancing Performance for Solving Finite Element Mesh using Heterogeneous Platforms

Finite element methods (FEM) are most widely used for simulation of structural dynamics problems. Due to their highly compute intensive nature, these methods are used with domain decomposition where the problem is divided into subdomains which are individually solved and coupled together to obtain the final solution. One of the latest and most efficient approach […]
Jan, 15

Declarative Parallel Programming for GPUs

The recent rise in the popularity of Graphics Processing Units (GPUs) has been fueled by software frameworks, such as NVIDIA’s Compute Unified Device Architecture (CUDA) and Khronos Group’s OpenCL that make GPUs available for general purpose computing. However, CUDA and OpenCL are still lowlevel approaches that require users to handle details about data layout and […]
Jan, 15

A visibility-based approach for occupancy grid computation in disparity space

Occupancy grids are a very convenient tool for environment representation in robotics. This paper will detail a novel approach to compute occupancy grids from stereo-vision, and shows its application for the field of intelligent vehicles. In the proposed approach, occupancy is initially computed directly in the stereoscopic sensor’s disparity space. The calculation formally accounts for […]
Jan, 14

Linear Algebra Algorithms for Hybrid Architectures with XKaapi

The emergence and continuing use of multicore architectures with GPU accelerators require changes in the current software to address the gap between the accelerators’ computer vs the CPU-GPU communication speed. We describe how to develop linear algebra algorithms for these new and emerging hybrid architectures using XKaapi.
Jan, 14

Efficient Knowledge Extraction from Structured Data

Knowledge extraction from structured data aims for identifying valid, novel, potentially useful, and ultimately understandable patterns in the data. The core step of this process is the application of a data mining algorithm in order to produce an enumeration of particular patterns and relationships in large databases. Clustering is one of the major data mining […]
Jan, 14

An Integrated Framework for Feature Extraction, Object Recognition and Stereo Vision with GPU support

This paper investigates the integration of feature extraction, object recognition and 3D reconstruction by stereo vision into a unified framework. In doing so, stereo vision can be made more robust by applying feature extraction results to the stereo matching process, and object recognition can be extended through the integration of depth information as another feature […]
Jan, 14

Graphics Processor Clusters for High Speed Backpropagation

This paper describes the use of GPU clusters to accelerate backpropagation for Synthetic Aperture Sonar (SAS) systems. We extended a GPU-based implementation of backpropagation to support clusters of GPU-enhanced nodes. The GPU accelerated implementation formed a 3,936 x 3,936 SAS image from 60s of sonar data in under 12s using a single GTX480, and under […]
Jan, 14

Accelerating Bit Error Rate Simulation in MATLAB with Graphics Processors

Bit error rate simulations are used to estimate the error probability for a communications channel. Typically, many millions of trials must be run in order to have a reasonable estimate of the error probability. The Communications System Toolbox in MATLAB contains tools that allow the user to construct these simulations, but executing the required trials […]
Jan, 13

BarraCUDA – a fast short read sequence aligner using graphics processing units

BACKGROUND: With the maturation of next-generation DNA sequencing (NGS) technologies, the throughput of DNA sequencing reads has soared to over 600 gigabases from a single instrument run. General purpose computing on graphics processing units (GPGPU), extracts the computing power from hundreds of parallel stream processors within graphics processing cores and provides a cost-effective and energy […]
Jan, 13

Efficient Convex Optimization Approaches to Variational Image Fusion

Image fusion is an imaging technique to visualize information from multiple imaging sources by one single image, which is widely used in remote sensing, medical imaging etc. In this work, we study two variational approaches to image fusion which are closely related to the standard TV-L2 and TV-L1 image approximation methods. We investigate their convex […]
Jan, 13

DFG Implementation on Multi GPU Cluster with Computation-Communication Overlap

Nowadays, computers embed many CPUs and at least one GPU. Workstations can host several GPU cards, which are well suited for scientific and engineering computations. Such computers are linked through high bandwidth networks to compose clusters for HPC. These machines provide highly parallel multicore architectures while being cost-effective. Moreover, they significantly reduce dissipated power, and […]
Jan, 13

Acceleration and Optimisation of a Monte Carlo Code for Light Propagation in Sprays and Other Scattering Media

In this thesis several steps towards the optimization and acceleration of a Monte Carlo code for the simulation of light propagation in particulate scattering media have been taken. This is performed by parallelizing a Monte Carlo code originally written by E. Berrocal [1] and running the simulation on a modern computer graphic card; a process […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: