6650

Posts

Dec, 14

Water Surface Animation using Damped Wave Equation and CUDA Acceleration

The damped wave equation is used for simulating water waves. The differential equation is approximated by finite differences. Explicit integration produces water height fields in real time. The CUDA framework is used to perform parallel computations on the GPU. It is shown that the GPU provides considerable speedup in comparison to the CPU.
Dec, 13

Development of an unified FDTD-FEM library for electromagnetic analysis with CPU and GPU computing

We describe a C++ library for electromagnetics based on the Finite-Difference Time-Domain method for transient analysis, and the Finite Element Method for modal analysis. Both methods share the same core and also both methods are optimized for CPU and GPU computing. The FEM method is applied for solving Laplace’s equation and analyzes the relation between […]
Dec, 13

Automatic library generation for BLAS3 on GPUs

High-performance libraries, the performance-critical building blocks for high-level applications, will assume greater importance on modern processors as they become more complex and diverse. However, automatic library generators are still immature, forcing library developers to manually tune library to meet their performance objectives. We are developing a new script-controlled compilation framework to help domain experts reduce […]
Dec, 13

RaVioli: a GPU Supported High-Level Pseudo Real-time Video Processing Library

Real-time video processing applications such as intruder detection system are now in demand and being developed. However, on general purpose computers, it is difficult to guarantee that enough CPU resources can be surely be provided. We have proposed a pseudo real-time video processing library RaVioli for solving this problem. RaVioli conceals two types of resolutions, […]
Dec, 13

Parallel Implementations of Beamforming Design and Filtering for Microphone Array Applications

One of the main limitations of microphone array algorithms for audio applications has been their high computational cost in real acoustic environments when real-time signal processing is absolutely required. Regarding audio/speech signal processing, beamforming algorithms have been used for the recovery of acoustic signals from their observations when they are corrupted by noise, reverberation and […]
Dec, 13

Developing an OO Model for Generalized Matrix Multiplication: Preliminary Considerations

Recent changes in computational sciences force reevaluation of the role of dense matrix multiplication. Among others, this resulted in a proposal to consider generalized matrix multiplication, based on the theory of algebraic semirings. The aim of this note is to outline an initial object oriented model of the generalized matrix-multiply-add operation.
Dec, 13

Collision-Driven Volumetric Deformation on the GPU

We present a novel parallel algorithm to animate the deformation of a soft body in response to collision. Our algorithm incorporates elements of physically-based methods, and at the same time, it allows artistic control of general deformation behavior. Our solver has important benefits for practical use, such as evaluation of animation frames in an arbitrary […]
Dec, 13

Lattice Boltzmann Simulation of Binary Mixture Diffusion Using Modern Graphics Processors

A highly optimized implementation of binary mixture diffusion with no initial bulk velocity on graphics processors is presented. The lattice Boltzmann model is employed for simulating the binary diffusion of oxygen and nitrogen into each other with different initial concentration distributions. Simulations have been performed using the latest proposed lattice Boltzmann model that satisfies both […]
Dec, 13

High performance computing for linear acoustic wave simulation

Parallel computing techniques are applied to a linear acoustic wave model to reduce execution time. Three parallel computing models are developed to parallelize computations. The fork-and-join, SPMD and SIMT models define the execution of parallel computations. The precision and efficiency of the linear acoustic wave model are improved through substantial speedups in all implementations. Furthermore, […]
Dec, 13

Divergence Analysis with Affine Constraints

The rise of graphics processing units in high-performance computing is bringing renewed interest in code optimization techniques that target SIMD processors. Many of these optimizations rely on divergence analyses, which classify variables as uniform, if they have the same value on every thread, or divergent, if they might not. This paper introduces a new kind […]
Dec, 13

Graphics Processing Units: More Than the Pathway to Realistic Video-Games

The huge video games market has propelled the development of hardware and software focused on making the game environment more realistic. Among such developments are graphics processing units (GPUs). These devices are intended to alleviate the central processing unit (CPU) of the host computer from the computation that creates "life" for video games. The GPUs […]
Dec, 12

Design and study of a massively multi threaded shared memory architecture

Most biocomputing problems require a high processing power with high memory needs while showing massive parallelism opportunities. Unfortunately, although advances are made in software parallelism, current architectures do not provide a transparent way to use this parallelism at its full potential. We thus started to design an massively parallel megathreaded architecture that would match biocomputing […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: