high performance computing on graphics processing units: hgpu.org

Posts

Jan, 11

Toward a Generic Hybrid CPU-GPU Parallelization of Divide-and-Conquer Algorithms

In the last few years, the development of programming languages for general purpose computing on Graphic Processing Units (GPUs) has led to the design and implementation of fast parallel algorithms for this architecture for a large spectrum of applications. Given the streaming-processing characteristics of GPUs, most practical applications consist of tasks that admit highly data-parallel […]

OpenCL

Jan, 10

An octree-based proxy for collision detection in large-scale particle systems

Particle systems are important building block for simulating vivid and detail-rich effects in virtual world. One of the most difficult aspects of particle systems has been detecting collisions between particlesand mesh surface. Due to the huge computation, a variety of proxy-based approaches have been proposed recently to perform visually correct simulation. However, all either limit […]

CUDA

Jan, 10

Integrating Occlusion Culling with Parallel LOD for Rendering Complex 3D Environments on GPU

Real-time rendering of complex 3D models is still a very challenging task. Recently, many GPU-based level-of-detail (LOD) algorithms have been proposed to decrease the complexity of 3D models in a parallel fashion. However, LOD approaches alone are not sufficient to reduce the amount of geometry data for interactive rendering of massive scale models. Visibility-based culling, […]

CUDA

•

OpenGL

Jan, 10

Saddle Vertex Graph (SVG): A Novel Solution to the Discrete Geodesic Problem

This paper presents the Saddle Vertex Graph (SVG), a novel solution to the discrete geodesic problem. The SVG is a sparse undirected graph that encodes complete geodesic distance information: a geodesic path on the mesh is equivalent to a shortest path on the SVG, which can be solved efficiently using the shortest path algorithm (e.g., […]

CUDA

Jan, 10

GPU Accelerated 3-D Modeling and Simulation of a Blended Kinetic Impact and Nuclear Subsurface Explosion

This paper develops a modeling and simulation framework for determining mission effectiveness of a two-body Hypervelocity Asteroid Intercept Vehicle (HAIV). This HAIV concept has been being developed to overcome key difficulties in coupling energy from nuclear explosives to an asteroid target at high relative velocities of 5 – 30 km/s. It does so by blending […]

CUDA

Jan, 10

Parallel Algorithm for BSDEs Based High Dimensional American Option Pricing on the GPU

In this paper, we explore the opportunity for solving high dimensional Backward Stochastic Differential Equations (BSDEs) on the GPU with application in high dimensional American option pricing. A Least Square Monte Carlo method based numerical algorithm for solving the BSDEs is studied and summarized in four phases. For the parallel GPU algorithms of different phases, […]

CUDA

Jan, 9

Nengo: a Python tool for building large-scale functional brain models

Neuroscience currently lacks a comprehensive theory of how cognitive processes can be implemented in a biological substrate. The Neural Engineering Framework (NEF) proposes one such theory, but has not yet gathered significant empirical support, partly due to the technical challenge of building and simulating large-scale models with the NEF. Nengo is a software tool that […]

OpenCL

Jan, 9

On-the-fly Generation and Rendering of Infinite Cities on the GPU

In this paper, we present a new approach for shape-grammar-based generation and rendering of huge cities in real-time on the graphics processing unit (GPU). Traditional approaches rely on evaluating a shape grammar and storing the geometry produced as a preprocessing step. During rendering, the pregenerated data is then streamed to the GPU. By interweaving generation […]

CUDA

•

OpenGL

Jan, 9

Efficient Emission Computation in Hidden Semi-Markov Models on Diverse Hardware

Hidden Semi-Markov Models (HSMMs) are powerful generalizations of Hidden Markov Models that have been effectively employed in tasks such as machine translation and optical character recognition. A principal computational bottleneck on these systems as applied to optical character recognition [5] is the need to compute emission probabilities for a large number of possible model states. […]

CUDA

Jan, 9

Real Time KAP Systems for Image Enhancement/Reconstruction of Remote Sensing Imagery

In this study, an implementation of a Kite Aerial Photography (KAP) system for real time image enhancement /reconstruction of remote sensing (RS) imagery is presented. The system is comprised in three stages: first, a gyro-stabilized mechatronic platform for the image acquisition is developed; second, the multispectral images are transmitted via RF to ground station; and […]

CUDA

Jan, 9

Fast Effective Deterministic Primality Test Using CUDA/GPGPU

There are great interests in understanding the manner by which the prime numbers are distributed throughout the integers. Prime numbers are being used in secret codes for more than 60 years now. Computer security authorities use extremely large prime numbers when they devise cryptographs, like RSA (short for Rivest, Shamir, and Adleman) algorithm, for protecting […]

CUDA

Jan, 6

Multicore Computing: Algorithms, Architectures, and Applications

Every area of science and engineering today has to process voluminous data sets. Using exact, or even approximate, algorithms to solve intractable problems in critical areas, such as computational biology, takes time that is exponential in some of the underlying parameters. Parallel computing addresses this issue and has become affordable with the advent of multicore […]

CUDA