4798

Posts

Jul, 10

A Comparison of FPGA and GPU for Real-Time Phase-based Optical Flow, Stereo, and Local Image Features

Low level computer vision algorithms have extreme computational requirements. In this work we compare two real-time architectures developed using FPGA and GPU devices for the computation of phase-based optical flow, stereo and local image features (energy, orientation and phase). The presented approach requires a massive degree of parallelism to achieve real-time performance and allows us […]
Jul, 10

Two-Way Real Time Fluid Simulation Using a Heterogeneous Multicore CPU and GPU Architecture

Natural phenomena simulation, such as water and smoke, is a very important topic to increase real time scene realism in video-games. However, the computational fluid simulation is an expensive task since we must numerically solve the Navier-Stokes equations. Additionally, an immersing simulation requires interaction between the flow and the objects in the scene, increasing even […]
Jul, 10

Study on acceleration technique for calculating near field of horn antenna based on GPU

Horn antennas are extremely popular in microwave region, so it has great practical significance of studying its near field. The radiation of horn antenna can be equivalent to that of the surface current on aperture by Huygens principle. Since the dipole is the simplest and most familiar antenna, we use the array of dipoles to […]
Jul, 10

Synthesis and rendering of bidirectional texture functions on arbitrary surfaces

The bidirectional texture function (BTF) is a 6D function that describes the appearance of a real-world surface as a function of lighting and viewing directions. The BTF can model the fine-scale shadows, occlusions, and specularities caused by surface mesostructures. We present algorithms for efficient synthesis of BTFs on arbitrary surfaces and for hardware-accelerated rendering. For […]
Jul, 10

Implementation of stereophonic acoustic echo canceller on nVIDIA GeForce graphics processing unit

This paper presents an implementation of a stereophonic acoustic echo canceller on nVIDIA GeForce graphics processor and CUDA software development environment. For efficiency, fast shared memory has been used as much as possilbe. A tree adder is introduced to reduce the cost for summing thread outputs up. The performance evaluation results suggest that Even a […]
Jul, 10

Hardware-assisted visibility sorting for unstructured volume rendering

Harvesting the power of modern graphics hardware to solve the complex problem of real-time rendering of large unstructured meshes is a major research goal in the volume visualization community. While, for regular grids, texture-based techniques are well-suited for current GPUs, the steps necessary for rendering unstructured meshes are not so easily mapped to current hardware. […]
Jul, 10

FPGA and GPU implementation of large scale SpMV

Sparse matrix-vector multiplication (SpMV) is a fundamental operation for many applications. Many studies have been done to implement the SpMV on different platforms, while few work focused on the very large scale datasets with millions of dimensions. This paper addresses the challenges of implementing large scale SpMV with FPGA and GPU in the application of […]
Jul, 9

Implementation of usual computerized tomography methods on GPU using the Compute Unified Device Architecture (CUDA)

CUDA (Compute Unified Device Architecture) is an efficient architecture developed by NVIDIA to compute parallel algorithms on Graphic Processing Unit (GPU). Using the API associated with this architecture, we develop fast parallel algorithms to compute standard methods for computerized tomography. Computation times are compared to their similar implementations on CPU to illustrate the efficiency of […]
Jul, 9

GPU implementation of volume reconstruction and object detection in Digital Holographic Microscopy

Using Digital Holographic Microscopy (DHM) we can gather information from a whole volume and thus we can avoid the small depth of field constraint of the conventional microscopes. This way a volume inspection system can be constructed, which is capable to find, segment, collect, and later classify those objects that flow through an inspection chamber. […]
Jul, 9

Power and Performance Characterization of Computational Kernels on the GPU

Nowadays Graphic Processing Units (GPU) are gaining increasing popularity in high performance computing (HPC). While modern GPUs can offer much more computational power than CPUs, they also consume much more power. Energy efficiency is one of the most important factors that will affect a broader adoption of GPUs in HPC. In this paper, we systematically […]
Jul, 9

The use of overlapping subgrids to accelerate the FDTD on GPU devices

The method Finite Difference Time Domain (FDTD) is widely used in electromagnetic simulations to solve problems of microwave tomography, radar and telecommunications. Since this method is a data intensive and computation intensive problem, there are a lot of initiatives to improve the scalability and the performance of the FDTD. Despite the progress, performance in FDTD […]
Jul, 9

Accelerating data clustering on GPU-based clusters under shared memory abstraction

Many-core graphics processors are playing today an important role in the advancements of modern highly concurrent processors. Their ability to accelerate computation is being explored under several scientific fields. In the current paper we present the acceleration of a widely used data clustering algorithm, K-means, in the context of high performance GPU clusters. As opposed […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: