8245

Posts

Sep, 6

GPU-Accelerated First-Order Scattering Simulation for X-Ray CT Image Reconstruction

In recent years the GPU has become an increasingly popular tool in various fields. In this paper, we will introduce our preliminary work on first-order scatter simulation in X-ray imaging accelerated by GPU. As this is preliminary work, we explore the GPU accelerated scattering simulation in 2D space and test it with physics-based simulated data. […]
Sep, 6

GPU Acceleration of BCP Procedure for SAT Algorithms

The satisfiability problem (SAT) is widely applicable and one of the most basic NP-complete problems. This problem has been required to be solved as fast as possible because of its significance, but it takes exponential time in the worst case to solve. Therefore, we aim to save the computation time by parallel computing on a […]
Sep, 6

Architectural Analysis and Performance Characterization of NVIDIA GPUs using Microbenchmarking

Emergence of new Graphical Processors for general purpose computing presents new challenges for application developers. Graphical Processors vary in terms of number of processor cores per chip, processor speed and memory subsystems. NVIDIA’s CUDA provides a C-like abstraction layer for software developers to implement their applications on GPUs often with little knowledge of the underlying […]
Sep, 6

CudaGIS: Report on the Design and Realization of a Massive Data Parallel GIS on GPUs

We report the design and realization of a highperformance parallel GIS, i.e., CudaGIS, based on the General Purpose computing on Graphics Processing Units (GPGPU) technologies. Still under active developments, CudaGIS currently supports major types of geospatial data (point, polyline, polygon and raster) and provides modules for spatial indexing, spatial join and other types of geospatial […]
Sep, 5

GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs

BACKGROUND: The analysis of biological networks has become a major challenge due to the recent development of high-throughput techniques that are rapidly producing very large data sets. The exploding volumes of biological data are craving for extreme computational power and special computing facilities (i.e. super-computers). An inexpensive solution, such as General Purpose computation based on […]
Sep, 5

Multi-user real-time speech recognition with a GPU

We have developed a multi-user large vocabulary speech recognition system employing a fully composed one-level weighted finite state transducer (WFST) based network on a Graphics Processing Unit (GPU). This system improves the overall throughput and latency of speech recognition engine which processes multiple users’ utterances at the same time with efficient scheduling, parameter sharing, and […]
Sep, 5

Accelerating and Characterizing Seam Carving Using a Heterogeneous CPU-GPU System

Seam carving has been widely used for contentaware resizing of images and videos with little to no perceptible distortion. Unfortunately, for high-resolution videos and large images it becomes computationally unfeasible to do the resizing in real-time using small-scale CPU systems. In this paper, we exploit the highly parallel computational capabilities of CUDA-enabled Graphics Processing Units […]
Sep, 5

A GPGPU Implementation of Approximate String Matching with Regular Expression Operators and Comparison with Its FPGA Implementation

In this paper, we propose an efficient GPGPU implementation of an algorithm for approximate string matching with regular expression operators, originally implemented on an FPGA, and compare the GPGPU, FPGA and CPU implementations by experiments. Approximate string matching with regular expression operators is used in various applications, such as full text database search and DNA […]
Sep, 5

GPU-accelerated Fourier-continuation solvers and physically exact computational boundary conditions for wave scattering problems

Many important engineering problems, ranging from antenna design to seismic imaging, require the numerical solution of problems of time-domain propagation and scattering of acoustic, electromagnetic, elastic waves, etc. These problems present several key difficulties, including numerical dispersion, the need for computational boundary conditions, and the extensive computational cost that arises from the extremely large number […]
Sep, 4

GPU implementation of a hybrid lattice Boltzmann method for non-isothermal flows

We propose a novel method to simulate non-isothermal flows. This method is ideally suited for the GPU architecture. The new algorithm is derived by coupling the lattice Boltzmann formulation for the flow with the finite difference scheme for the temperature field. We apply this algorithm to solve for the flow in the well known buoyancy […]
Sep, 4

Automated Enhanced Parallelization of Sequential C to Parallel OpenMP

The paper presents the work towards implementation of a technique to enhance parallel execution of auto-generated OpenMP programs by considering the architecture of on-chip cache memory, thereby achieving higher performance. It avoids false-sharing in ‘for-loops’ by generating OpenMP code for dynamically scheduling chunks by placing each core’s data cache line size apart. It has been […]
Sep, 4

Accelerating distance matrix calculations utilizing GPU

When modeling pedestrian movement, it is necessary to find a path to the target point. It is possible to use a distance matrix or derived gradient map for this purpose. Calculations of distance matrix for large areas and multiple targets are very time-consuming. Therefore this article focuses on acceleration of these calculations utilizing Graphics Processing […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: