1846

Posts

Nov, 27

A Note on Auto-tuning GEMM for GPUs

The development of high performance dense linear algebra (DLA) critically depends on highly optimized BLAS, and especially on the matrix multiplication routine (GEMM). This is especially true for Graphics Processing Units (GPUs), as evidenced by recently published results on DLA for GPUs that rely on highly optimized GEMM. However, the current best GEMM performance, e.g. […]
Nov, 27

Efficient Parallelization of Stochastic Simulation Algorithm for Chemically Reacting Systems on the Graphics Processing Unit

The small number of some reactant molecules in biological systems formed by living cells can result in dynamical behavior which cannot be captured by traditional deterministic models. In such a problem, a more accurate simulation can be obtained with discrete stochastic simulation (Gillespie’s stochastic simulation algorithm – SSA). Many stochastic realizations are required to capture […]
Nov, 27

Parallel View-Dependent Level-of-Detail Control

We present a scheme for view-dependent level-of-detail control that is implemented entirely on programmable graphics hardware. Our scheme selectively refines and coarsens an arbitrary triangle mesh at the granularity of individual vertices to create meshes that are highly adapted to dynamic view parameters. Such fine-grain control has previously been demonstrated using sequential CPU algorithms. However, […]
Nov, 27

Real-time virtual environment signal extraction and denoising using programmable graphics hardware

Abstract The sense of being within a three-dimensional (3D) space and interacting with virtual 3D objects in a computer-generated virtual environment (VE) often requires essential image, vision and sensor signal processing techniques such as differentiating and denoising. This paper describes novel implementations of the Gaussian filtering for characteristic signal extraction and wavelet-based image denoising algorithms […]
Nov, 27

Implications of the Turing completeness of reaction-diffusion models, informed by GPGPU simulations on an XBox 360: cardiac arrhythmias, re-entry and the Halting problem

In the arsenal of tools that a computational modeller can bring to bare on the study of cardiac arrhythmias, the most widely used and arguably the most successful is that of an excitable medium, a special case of a reaction-diffusion model. These are used to simulate the internal chemical reactions of a cardiac cell and […]
Nov, 27

Modeling of tsunami waves and atmospheric swirling flows with graphics processing unit

The faster growth curves in the speed of graphics processing units (GPUs) relative to CPUs have spawned a new area of development in computational technology. There is much potential in utilizing GPUs for solving evolutionary partial differential equations and producing the attendant visualization. We are concerned with modeling tsunami waves, where computational time is of […]
Nov, 27

Accelerating Lattice Boltzmann Fluid Flow Simulations Using Graphics Processors

Lattice Boltzmann Methods (LBM) are used for the computational simulation of Newtonian fluid dynamics. LBM-based simulations are readily parallelizable; they have been implemented on general-purpose processors, field-programmable gate arrays (FPGAs), and graphics processing units (GPUs). Of the three methods, the GPU implementations achieved the highest simulation performance per chip. With memory bandwidth of up to […]
Nov, 27

Accelerating geoscience and engineering system simulations on graphics hardware

Many complex natural systems studied in the geosciences are characterized by simple local-scale interactions that result in complex emergent behavior. Simulations of these systems, often implemented in parallel using standard Central Processing Unit (CPU) clusters, may be better suited to parallel processing environments with large numbers of simple processors. Such an environment is found in […]
Nov, 27

Jitter analysis of PLL-generated clock propagation using Jitter Mitigation techniques with laser voltage probing

A new Jitter Mitigation feature in the latest generation laser voltage probing (LVP) tool effectively removes PLL jitter from LVP waveforms [Ng Yin S, Lo W, Wilsher K. Next generation laser voltage probing. In: Proceeding, international symposium on testing and failure analysis; 2008. p. 249]. It facilitates the probing of phase-locked loop (PLL) driven circuitry […]
Nov, 27

Optimizing the SUSAN corner detection algorithm for a high speed FPGA implementation

In many embedded systems for video surveillance distinctive features are used for the detection of objects. In this contribution a real-time FPGA implementation of a feature detector, namely the SUSAN algorithm is described. As the original SUSAN algorithm performs poorly on non-synthetic images a significant quality improvement of this algorithm is presented. The hardware accelerator […]
Nov, 27

Simulation of Shallow-Water systems using Graphics Processing Units

This paper addresses the speedup of the numerical solution of shallow-water systems in 2D domains by using modern Graphics Processing Units (GPUs). A first order well-balanced finite volume numerical scheme for 2D shallow water systems is considered. The potential data parallelism of this method is identified and the scheme is efficiently implemented on GPUs for […]
Nov, 27

A survey of BRDF models for computer graphics

To produce photo-realistic images in computer graphics, we must effectively describe the interactions between light and surfaces. In this paper, we focus on Bidirectional Reflectance Distribution Functions (BRDFs), which characterize these interactions. We survey on most BRDF representations introduced so far and we investigate their usage, importance and applications. We look at in detail their […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: