high performance computing on graphics processing units: hgpu.org

Posts

Jun, 27

Efficient implementation of the overlap operator on multi-GPUs

Lattice QCD calculations were one of the first applications to show the potential of GPUs in the area of high performance computing. Our interest is to find ways to effectively use GPUs for lattice calculations using the overlap operator. The large memory footprint of these codes requires the use of multiple GPUs in parallel. In […]

Jun, 26

Scientific and Engineering Computing Using ATI Stream Technology

This continuing exploration of GPU technology examines ATI Stream technology and its use in scientific and engineering applications.

Jun, 26

Improving processing time for visual measurements of displacements of IPMC actuators using CUDA

A Graphics Processing Unit (GPU) based measuring system which is used for processing images from a camera to provide information about displacement is presented in this work. The proposed approach has been developed for measuring small movements of micro robotic systems based on synthetic IPMC (Ionic Polymer Metal Composites) materials using CUDA (Compute Unified Device […]

CUDA

Jun, 26

A Personal Surround Environment: Projective Display with Correction for Display Surface Geometry and Extreme Lens Distortion

Projectors equipped with wide-angle lenses can have an advantage over traditional projectors in creating immersive display environments since they can be placed very close to the display surface to reduce user shadowing issues while still producing large images. However, wide-angle projectors exhibit severe image distortion requiring the image generator to correctively pre-distort the output image. […]

Jun, 26

A Low-Cost Solution For Excavator Simulation With Realistic Visual Effect

A low-cost excavator simulator has been developed in this paper for training human operators and evaluating control strategies for heavy-duty hydraulic machines. In such a system, the operator controls a virtual excavator by means of a joystick while experiencing realistic operating feelings through force feedback, graphical displays, and sound effects in virtual operating environments. A […]

Jun, 26

Importance-Driven Particle Techniques for Flow Visualization

Particle tracing has been established as a powerful visualization technique to show the dynamics of 3D flows. Particle tracing in 3D, however, quickly overextends the viewer due to the massive amount of visual information that is typically produced by this technique. In this paper, we present strategies to reduce this amount at the same time […]

Jun, 26

Visual Simulation of Breaking Waves in Shallow Water

We introduce a composite method for simulation and rendering of breaking waves in computer graphics applications. The method generated not only a simple wave belt like previous work, but also a manipulative model by influence of the wind. The particle system we adopted combine POINT primitive as basic particle and fat particle as characteristics particle […]

Jun, 26

Kd-Jump: a Path-Preserving Stackless Traversal for Faster Isosurface Raytracing on GPUs

Stackless traversal techniques are often used to circumvent memory bottlenecks by avoiding a stack and replacing return traversal with extra computation. This paper addresses whether the stackless traversal approaches are useful on newer hardware and technology (such as CUDA). To this end, we present a novel stackless approach for implicit kd-trees, which exploits the benefits […]

CUDA

Jun, 26

A real-time coarse-to-fine multiview capture system for all-in-focus rendering on a light-field display

We present an end-to-end system capable of real-time capturing and displaying with full horizontal parallax high-quality 3D video contents on a cluster-driven multiprojector light-field display. The capture component is an array of low-cost USB cameras connected to a single PC. RawM-JPEG data coming fromthe software-synchronized cameras are multicast over Gigabit Ethernet to the back-end nodes […]

CUDA

•

OpenGL

Jun, 26

Acceleration of the Method of Moments Calculations by Using Graphics Processing Units

The graphics processing unit (GPU) has been used to speed up the conventional method of moments (MoM) calculations for electromagnetic scattering from arbitrary three-dimensional conducting objects. The acceleration ratio of filling impedance matrix has reached 30, while the total acceleration ratio (including iteration) is about 20. Moreover, a matrix splitting algorithm is developed to break […]

Jun, 26

CUDA-based acceleration and algorithm refinement for volume image registration

In this paper, we propose a GPU-based acceleration method to speed up volume image registration using Compute Unified Device Architecture(CUDA). A novel CUDA-based method for joint histogram computation is introduced in this paper, which is also valuable for 2D image registration and other general graphics applications. Additionally, an algorithm refinement is proposed to improve the […]

CUDA

Jun, 25

Parallel Processing for Normal Mixture Models of Hyperspectral Data Using a Graphics Processor

Multivariate normal mixture models, where a complex statistical distribution is represented by a weighted sum of several multivariate normal probability distributions, have many potential applications including anomaly detection (AD) in hyperspectral (HS) images. The high computational cost of mixture models requires hardware and/or algorithmic acceleration to make AD run in real time. In this paper […]

CUDA

high performance computing on graphics processing units: hgpu.org

Posts

Efficient implementation of the overlap operator on multi-GPUs

Scientific and Engineering Computing Using ATI Stream Technology

Improving processing time for visual measurements of displacements of IPMC actuators using CUDA

A Personal Surround Environment: Projective Display with Correction for Display Surface Geometry and Extreme Lens Distortion

A Low-Cost Solution For Excavator Simulation With Realistic Visual Effect

Importance-Driven Particle Techniques for Flow Visualization

Visual Simulation of Breaking Waves in Shallow Water

Kd-Jump: a Path-Preserving Stackless Traversal for Faster Isosurface Raytracing on GPUs

A real-time coarse-to-fine multiview capture system for all-in-focus rendering on a light-field display

Acceleration of the Method of Moments Calculations by Using Graphics Processing Units

CUDA-based acceleration and algorithm refinement for volume image registration

Parallel Processing for Normal Mixture Models of Hyperspectral Data Using a Graphics Processor

Recent source codes

CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization

LC Framework

pplx-garden: Perplexity open source garden for inference technology

Atlas CLI: Machine Learning (ML) Lifecycle & Transparency Manager

transformers_tvm: Implementation of Encoder Decoder transformer on TVM

OpScanner

INT v.s. FP: A framework to compare low-bit integer and float-point formats

AutoDock-GPU: AutoDock for GPUs and other accelerators

NCCLX: collective communication framework

Tutoring LLM into a Better CUDA Optimizer

Most viewed papers (last 30 days)