3493

Posts

Mar, 29

Scandalously Parallelizable Mesh Generation

We propose a novel approach which employs random sampling to generate an accurate non-uniform mesh for numerically solving Partial Differential Equation Boundary Value Problems (PDE-BVP’s). From a uniform probability distribution U over a 1D domain, we sample M discretizations of size N where M>>N. The statistical moments of the solutions to a given BVP on […]
Mar, 29

Multi-mass solvers for lattice QCD on GPUs

Graphical Processing Units (GPUs) are more and more frequently used for lattice QCD calculations. Lattice studies often require computing the quark propagators for several masses. These systems can be solved using multi-shift inverters but these algorithms are memory intensive which limits the size of the problem that can be solved using GPUs. In this paper, […]
Mar, 28

GPU-Based Shooting and Bouncing Ray Method for Fast RCS Prediction

The shooting and bouncing ray (SBR) method is highly effective in the radar cross section (RCS) prediction. For electrically large and complex targets, computing scattered fields is still time-consuming in many applications like range profile and ISAR simulation. In this paper, we propose a GPU-based SBR that is fully implemented on the graphics processing unit […]
Mar, 28

An Empirically Optimized Radix Sort for GPU

In this paper, we propose an empirical optimization technique for one of the most important sorting routines on GPU, the radix sort, that generates highly efficient code for a number of representative NVIDIA GPUs with a wide variety of architectural specifications. Our study has been focused on the algorithmic parameters of radix sort that can […]
Mar, 28

GPU architecture evaluation for multispectral and hyperspectral image analysis

Graphical Processing Units (GPU) architectures are massively used for resource-intensive computation. Initially dedicated to imaging, vision and graphics, these architectures serve nowadays a wide range of multi-purpose applications. The GPU structure, however, does not suit all applications. This can lead to performance shortage. Among several applications, the aim of this work is to analyze GPU […]
Mar, 28

GPU accelerated real time polarimetric image processing through the use of CUDA

Recent advancements in semi-conductor fabrication has led to a dramatic increase in the size of data sets of advanced imaging sensors. While increased pixel counts leads to greater area coverage and higher resolution, it also results in higher image processing time. If real-time image processing is required, power and size requirements go up as large […]
Mar, 28

GPU Based Spot Noise Parallel Algorithm for 2D Vector Field Visualization

Graphic Processing Unit (GPU) has involved into a parallel computation for it’s massively multi threaded architecture. Due to its high computational power, GPU has been used to deal with many problems that can be easily parallelized. This paper will present a GPU based spot noise parallel algorithm for 2D vector field visualization. It uses spot […]
Mar, 28

A Chunking Method for Euclidean Distance Matrix Calculation on Large Dataset Using Multi-GPU

Calculating Euclidean distance matrix is a data intensive operation and becomes computationally prohibitive for large datasets. Recent development of Graphics Processing Units (GPUs) has produced superb performance on scientific computing problems using massive parallel processing cores. However, due to the limited size of device memory, many GPU based algorithms have low capability in solving problems […]
Mar, 28

GPU-Based Fast Minimum Spanning Tree Using Data Parallel Primitives

Minimum spanning tree is a classical problem in graph theory that plays a key role in a broad domain of applications. This paper proposes a minimum spanning tree algorithm using Prim’s approach on Nvidia GPU under CUDA architecture. By using new developed GPU-based Min-Reduction data parallel primitive in the key step of the algorithm, higher […]
Mar, 28

A Batched GPU Algorithm for Set Intersection

Intersection of inverted lists is a frequently used operation in search engine systems. Efficient CPU and GPU intersection algorithms for large problem size are well studied. We propose an efficient GPU algorithm for high performance intersection of inverted index lists on CUDA platform. This algorithm feeds queries to GPU in batches, thus can take full […]
Mar, 28

GMH: A Message Passing Toolkit for GPU Clusters

Driven by the market demand for high-definition 3D graphics, commodity graphics processing units (GPUs) have evolved into highly parallel, multi-threaded, many-core processors, which are ideal for data parallel computing. Many applications have been ported to run on a single GPU with tremendous speedups using general C-style programming languages such as CUDA. However, large applications require […]
Mar, 28

Two improved GPU acceleration strategies for force-directed graph layout

Force directed approach is one of the most widely used methods in graph drawing research. However, the running time is increased intolerablely along with the enlargement of the graph size, which restricts the algorithm’s practicability. By the aid of GPU (graphics processing unit) computing platform, we can speed-up the graph layout with low cost, but […]

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us: