5377

Posts

Aug, 27

A new adaptive model for real-time fluid simulation with complex boundaries

In this paper, we present a new adaptive model for real-time fluid simulation with complex boundaries based on smoothed particle hydrodynamics (SPH) framework. Firstly, we introduce an adaptive SPH framework that is based on our character field function composed of 4 factors: geometrical complexity, boundary condition, physical complexity and complementary condition in terms of the […]
Aug, 26

Parallel Fast Gauss Transform

We present fast adaptive parallel algorithms to compute the sum of N Gaussians at N points. Direct sequential computation of this sum would take $O(N^2)$ time. The parallel time complexity estimates for our algorithms are $O(N/np)$ for uniform point distributions and $O(N/np log N/np + nplognp)$ for nonuniform distributions using np CPUs. We incorporate a […]
Aug, 26

PFunc: modern task parallelism for modern high performance computing

HPC today faces new challenges due to paradigm shifts in both hardware and software. The ubiquity of multi-cores, many-cores, and GPGPUs is forcing traditional serial as well as distributed-memory parallel applications to be parallelized for these architectures. Emerging applications in areas such as informatics are placing unique requirements on parallel programming tools that have not […]
Aug, 26

Challenging cloning related problems with GPU-based algorithms

Graphics Processing Unit (GPU) have been around for a while. Although they are primarily used for high-end 3D graphics processing, their use is now acknowledged for general massive parallel computing. This paper presents an original technique based on [10] to compute many instances of the longest common subsequence problem on a generic GPU architecture using […]
Aug, 26

Considerations when evaluating microprocessor platforms

Motivated by recent papers comparing CPU and GPU performance, this paper explores the questions: Why do we compare microprocessors and by what means should we compare them? We distinguish two distinct perspectives from which to make comparisons: application developers and computer architecture researchers. We survey the distinct concerns of these groups, identifying essential information each […]
Aug, 26

Exploring graphics processing units as parallel coprocessors for online aggregation

Multidimensional aggregation is one of the most important computational building blocks and hence also a potential performance bottleneck in Online Analytic Processing (OLAP). In order to deliver fast query responses for interactive operations such as slicing, dicing, roll-up and drill-down, it is essential that aggregates along the relevant dimensions of a data cube can be […]
Aug, 26

Parallel Viewshed Analysis on GPU Using CUDA

Viewshed analysis is a long established function of many geographical information systems to determine the visible cells of an input raster from one or more observers. It can be extended into large scale or higher resolution which requires the parallel implementation for time-tolerance. In this paper, we describe a GPU parallelization of viewshed analysis using […]
Aug, 26

GPU Based Real-time Correction for Optical Distortions in Head-Mounted Displays

This paper presents a GPU-based real-time method to correct optical distortions in head-mounted displays (HMDs). The HMD to be corrected is a lightweight and wide field-of-view HMD system with free-form-surface (FFS) prism, in which the image distortion is not rectilinear and centrosymmetric. A special predistortion model is constructed to correct the distortion of the HMD. […]
Aug, 26

Acceleration of an improved Retinex algorithm

Retinex is an image restoration method and the center/surround Retinex is appropriate for parallelization because it utilizes a convolution operation with large kernel size to achieve dynamic range compression and color/lightness rendition. However, its great capability for image enhancement comes with intensive computation. This paper presents a GPURetinex, which is a data parallel algorithm based […]
Aug, 26

Accelerating tetrahedral interpolation with data-level and Thread-Level Parallel optimization

The tetrahedral interpolation method for color space conversion consumes the longest time in the entire color management process. This makes it difficult to implement a purely software-based high-end image processing system. In this study, SIMD (Single Instruction Multiple Data) and GPGPU (General Purpose Graphics Processing Unit) based optimizations for tetrahedral interpolation are implemented. To exploit […]
Aug, 26

Multi-level parallelism, global arrays, GPGPU Programming: Unify programming paradigms on Grid computing with efficiency

As technology advances, computing resources also gain benefits in many aspects: larger capacity, increased capability as well as rapidity. However, with heterogeneously distributed resources in Grid computing environment, the development an application to fully utilize the resources is a challenge. Especially, the computing resources themselves regularly upgrade their computing power for example by recruiting General […]
Aug, 25

TH-1: China’s first petaflop supercomputer

In recent years, heterogeneous systems and cooperative computing have become popular research directions in the field of high performance computing. With fast scaling of the size of high performance computer systems, problems such as power consumption and reliability come to the forefront. The aim of high performance computing has thus shifted from merely seeking peak […]

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us: