8516

Posts

Oct, 30

Impact of GPU Memory Access Patterns on FDTD

The application of General Purpose computing on a GPU is an effective way to accelerate the FDTD method. This work explores the different domain decomposition techniques from the literature and extends the theoretically best approach with additional flexibility. We examine the performance on both Tesla and Fermi architecture GPUs and identify the best way to […]
Oct, 30

GPGPU Implementation of Matrix Formalism for Beam Dynamics Simulation

Matrix formalism is a map integration method for ODE solving. It allows to present solution of the system as sums and multiplications of 2-indexes numeric matrix. This approach can be easy implement in parallel codes. As the most natural for matrix operation GPU architecture has been chosen. The set of the methods for beam dynamics […]
Oct, 30

Characterization and Exploitation of GPU Memory Systems

Graphics Processing Units (GPUs) are workhorses of modern performance due to their ability to achieve massive speedups on parallel applications. The massive number of threads that can be run concurrently on these systems allow applications which have data-parallel computations to achieve better performance when compared to traditional CPU systems. However, the GPU is not perfect […]
Oct, 29

A GPU Approach to Fortran Legacy Systems

A large number of Fortran legacy programs are still running in production environments, and most of these applications are running sequentially. Multi- and Many- core architectures are established as (almost) the only processing hardware available, and new programming techniques that take advantage of these architectures are necessary. In this report, we will explore the impact […]
Oct, 29

Parallelized Segmentation of CT-Angiography datasets using CUDA

Segmentation of CT-Angiography datasets is an important and difficult task. Several algorithms and approaches have already been invented and implemented to solve this problem. In this work, we present automatic algorithms for the segmentation of these CTA datasets, implemented in CUDA, and evaluate our results regarding speed and error rates. Starting with local approaches like […]
Oct, 29

Exploring computational capabilities of GPUs using H.264 prediction algorithms

In recent years, there has been a drive towards parallel architectures to further increase computational performance. The many-core architecture of modern Graphics Processing Units (GPU) can be used for general computations in addition to graphics processing, and provide good performance for data parallel computations. In this thesis, we explore the processing powers of two generations […]
Oct, 29

Silhouette Extraction using Graphics Processing Units

A silhouette gives an outline of a person or an object. When extracting silhouettes from an image or video, we acquire information about where in the image the foreground objects are located, their sizes and their shapes. This silhouette information can be useful in many ways. In computer vision, silhouette extraction plays an important role. […]
Oct, 28

Investigating Host-Device communication in a GPU-based H.264 encoder

Modern graphical processing units (GPU) are powerful parallel processors, capable of running thousands of concurrent threads. While originally limited to graphics processing, newer generations can be used for general computing (GPGPU). Through frameworks such as nVidia Compute Unified Device Architecture (CUDA) and OpenCL, GPU programs can be written using established programming languages (with minor extensions) […]
Oct, 28

Three dimensional tracking of gold nanoparticles using digital holographic microscopy

In this paper we present a digital holographic microscope to track gold colloids in three dimensions. We report observations of 100nm gold particles in motion in water. The expected signal and the chosen method of reconstruction are described. We also discuss about how to implement the numerical calculation to reach real-time 3D tracking.
Oct, 28

Historygrams: Enabling Interactive Global Illumination in Direct Volume Rendering using Photon Mapping

In this paper, we enable interactive volumetric global illumination by extending photon mapping techniques to handle interactive transfer function (TF) and material editing in the context of volume rendering. We propose novel algorithms and data structures for finding and evaluating parts of a scene affected by these parameter changes, and thus support efficient updates of […]
Oct, 28

Analysis & Design of Efficient Cryptographic Systems

Grain-128 is a stream cipher, which was proposed in the course of the eSTREAM project. This stream cipher is already in use today, for example in RFID applications. Therefore, the security of Grain-128 is of great interest. The Conditional Differential Analysis, the Static Cube Attack and the Dynamic Cube Attack are methods for the cryptanalysis […]
Oct, 28

Parallel computation of a SPECT projection operator for a content adaptative mesh model

In this paper we explore a parallel implementation for fast calculation of a tomographic projection operator for content-adaptive mesh model (CAMM) image reconstruction. Previously we introduced 2D and 3D tomographic image reconstruction using a CAMM for single positron emission computed tomography (SPECT). The proposed parallel method is fast and allows incorporation of a non-uniform attenuation […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: