high performance computing on graphics processing units: hgpu.org

Posts

Jun, 29

Adaptation of algorithms for underwater sonar data processing to GPU-based systems

In this master thesis, algorithms for acoustic simulations in underwater environments are ported for GPU processing. The GPU parallel computing platforms used are CUDA, OpenCL and SkePU. The purpose of this master thesis is to adapt and evaluate the ported algorithms’ performance on two modern NVIDIA GPUs, Tesla K20 and Quadro K5000. Several optimizations, described […]

CUDA

•

OpenCL

Jun, 29

Hinomiyagura Infrastructure Competiton TDP: Platform of rescue simulation using GPGPU

We propose a new platform that consists of new traffic simulator and scenario generator. The traffic simulation system using GPGPU that enables to simulate rescue and evacuation simulation with more agents and faster than the present system. And it can simulate agents’ motions in a three-dimensional map. Our proposal provides a platform to widen the […]

OpenCL

Jun, 29

Betatron tune measurement with the LHC damper using a GPU

This thesis studies a possible future implementation of a betatron tune measurement in the Large Hadron Collider (LHC) at European organization for nuclear research (CERN) using a General Purpose Graphic Processing Unit (GPGPU) to analyse data acquired with the LHC transverse transverse damper (ADT). The present hardware and future possible implementations using ADT acquisitions and […]

CUDA

Jun, 29

Efficient computation of constrained parameterizations on parallel platforms

Constrained isometric planar parameterizations are central to a broad spectrum of applications. In this work, we present a non linear solver developed on OpenCL that is efficiently parallelizable on modern massively parallel architectures. We establish how parameterization relates to mesh smoothing and show how to ciently and robustly solve the planar mesh parameterization problem with […]

OpenCL

Jun, 28

An Embedding Method for Interactive Simulation on Dynamic Surfaces

Numerical simulation on curved surfaces enables the dynamic texturing of three-dimensional objects. In this thesis, I introduce methods for the realtime simulation and visualization of intrinsic fluid dynamics on deforming surfaces. These novel techniques support arbitrary, including open or nonorientable, surfaces and are universally applicable to a wide range of partial differential equation (PDE) problems […]

Jun, 28

Towards Rapid Prototyping of Parallel and HPC Applications (GPU Focus)

Developing on highly parallel architectures is hard, time consuming, error prone, takes a lot of developers’ focus and effort to producing a production quality application. This is counter productive and results are unknown in advance whether it is worth it to go through such experience. In this work, we will take a complete overview of […]

CUDA

Jun, 28

High-Performance Computing using GPUs

In the last few years, emergence of High-Performance Computing has largely influenced computer technology in the field of financial analytics, data mining, image/signal processing, simulations and modeling etc. Multi-threading, hyper-threading and other parallel programming technologies, multicore machines, clusters etc. have helped achieve high performance with high availability and high throughput. However, hybrid clusters have been […]

Jun, 28

GPU-CC: a Reconfigurable GPU Architecture with Communicating Cores

GPUs have evolved to programmable, energy efficient compute accelerators for massively parallel applications. Still, compute power is lost in many applications because of cycles spent on data movement and control instead of computations on actual data. Additional cycles can be lost as well on pipeline stalls due to long latency operations. To improve performance and […]

CUDA

Jun, 28

Point-wise Adaptive Filtering for Fast Monte Carlo Noise Reduction

Monte Carlo based photorealistic image synthesis has proven to be one of the most flexible and powerful rendering techniques, but is plagued with undesirable artifacts known as Monte Carlo noise. We present an adaptive filtering method designed for Monte Carlo rendering systems that counteracts noise while respecting sharp features. The filter operates as a post-process […]

CUDA

Jun, 27

The 28th IEEE International Parallel & Distributed Processing Symposium, IPDPS 2014

Authors are invited to submit manuscripts that present original unpublished research in all areas of parallel and distributed processing, including the development of experimental or commercial systems. Work focusing on emerging technologies is especially welcome. Topics of interest include, but are not limited to: Parallel and distributed algorithms, focusing on topics such as: numerical, combinatorial, […]

Jun, 26

Optimization procedures during parallelization of specialized software for fluid flow simulations

Modern fluid flow simulations can be extremely complex and computationally demanding. Using GPU devices (Graphics Processing Unit) they can execute up to several tens of times faster and simulations can be observed interactively. In this study the basic principles of GPU programming are applied to the implementation of lattice Boltzmann (LB) method. The software that […]

CUDA

Jun, 26

Modeling of the behavior of 222 Rn progeny in diffusion chamber using CUDA

Parallel program has been developed for simulation of Radon progeny behavior in the diffusion chamber. The program executes on general purpose graphics processing unit based on CUDA platform. Algorithm of the sequential version based on Brownian motion and diffusion has been rewritten for parallel use. During development, special attention has been paid to instruction execution […]

CUDA

* * *

high performance computing on graphics processing units: hgpu.org

Posts

Adaptation of algorithms for underwater sonar data processing to GPU-based systems

Hinomiyagura Infrastructure Competiton TDP: Platform of rescue simulation using GPGPU

Betatron tune measurement with the LHC damper using a GPU

Efficient computation of constrained parameterizations on parallel platforms

An Embedding Method for Interactive Simulation on Dynamic Surfaces

Towards Rapid Prototyping of Parallel and HPC Applications (GPU Focus)

High-Performance Computing using GPUs

GPU-CC: a Reconfigurable GPU Architecture with Communicating Cores

Point-wise Adaptive Filtering for Fast Monte Carlo Noise Reduction

The 28th IEEE International Parallel & Distributed Processing Symposium, IPDPS 2014

Optimization procedures during parallelization of specialized software for fluid flow simulations

Modeling of the behavior of 222 Rn progeny in diffusion chamber using CUDA

Recent source codes

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

PELSI: Power-Efficient Layer-Switched Inference

Ouroboros: Virtualized Queues for dynamic memory management

MSCCL++: A GPU-driven communication stack for scalable AI applications

Benchmark compute shader of Unity against InteropUnityCUDA

Most viewed papers (last 30 days)