high performance computing on graphics processing units: hgpu.org

Posts

Jun, 28

An Embedding Method for Interactive Simulation on Dynamic Surfaces

Numerical simulation on curved surfaces enables the dynamic texturing of three-dimensional objects. In this thesis, I introduce methods for the realtime simulation and visualization of intrinsic fluid dynamics on deforming surfaces. These novel techniques support arbitrary, including open or nonorientable, surfaces and are universally applicable to a wide range of partial differential equation (PDE) problems […]

Jun, 28

Towards Rapid Prototyping of Parallel and HPC Applications (GPU Focus)

Developing on highly parallel architectures is hard, time consuming, error prone, takes a lot of developers’ focus and effort to producing a production quality application. This is counter productive and results are unknown in advance whether it is worth it to go through such experience. In this work, we will take a complete overview of […]

CUDA

Jun, 28

High-Performance Computing using GPUs

In the last few years, emergence of High-Performance Computing has largely influenced computer technology in the field of financial analytics, data mining, image/signal processing, simulations and modeling etc. Multi-threading, hyper-threading and other parallel programming technologies, multicore machines, clusters etc. have helped achieve high performance with high availability and high throughput. However, hybrid clusters have been […]

Jun, 28

GPU-CC: a Reconfigurable GPU Architecture with Communicating Cores

GPUs have evolved to programmable, energy efficient compute accelerators for massively parallel applications. Still, compute power is lost in many applications because of cycles spent on data movement and control instead of computations on actual data. Additional cycles can be lost as well on pipeline stalls due to long latency operations. To improve performance and […]

CUDA

Jun, 28

Point-wise Adaptive Filtering for Fast Monte Carlo Noise Reduction

Monte Carlo based photorealistic image synthesis has proven to be one of the most flexible and powerful rendering techniques, but is plagued with undesirable artifacts known as Monte Carlo noise. We present an adaptive filtering method designed for Monte Carlo rendering systems that counteracts noise while respecting sharp features. The filter operates as a post-process […]

CUDA

Jun, 27

The 28th IEEE International Parallel & Distributed Processing Symposium, IPDPS 2014

Authors are invited to submit manuscripts that present original unpublished research in all areas of parallel and distributed processing, including the development of experimental or commercial systems. Work focusing on emerging technologies is especially welcome. Topics of interest include, but are not limited to: Parallel and distributed algorithms, focusing on topics such as: numerical, combinatorial, […]

Jun, 26

Optimization procedures during parallelization of specialized software for fluid flow simulations

Modern fluid flow simulations can be extremely complex and computationally demanding. Using GPU devices (Graphics Processing Unit) they can execute up to several tens of times faster and simulations can be observed interactively. In this study the basic principles of GPU programming are applied to the implementation of lattice Boltzmann (LB) method. The software that […]

CUDA

Jun, 26

Modeling of the behavior of 222 Rn progeny in diffusion chamber using CUDA

Parallel program has been developed for simulation of Radon progeny behavior in the diffusion chamber. The program executes on general purpose graphics processing unit based on CUDA platform. Algorithm of the sequential version based on Brownian motion and diffusion has been rewritten for parallel use. During development, special attention has been paid to instruction execution […]

CUDA

Jun, 26

Large Scale Physical Modeling Sound Synthesis

Sound synthesis based on physical models of musical instruments is, ultimately, an exercise in numerical simulation. As such, for complex systems of the type seen in musical acoustics, simulation can be a computationally costly undertaking, particularly if simplifying hypotheses, such as those of traveling wave or mode decompositions are not employed. In this paper, large […]

CUDA

Jun, 26

Using of New Possibilities of Fermi Architecture by Development of GPGPU Programs

Description of additional functions of hardware and software, which are presented in the structure of new architecture of FERMI graphic processors made by company NVIDIA, was given. Recommendations of their use within the realization of algorithms of scientific and technical calculations by means of the graphic processors were given. Application of the new possibilities of […]

CUDA

Jun, 26

GPU-Accelerated Real-Time Visualization and Interaction for Coupled Fluid Dynamics

For real-time applications (dynamic data-driven applications systems like computer-assisted surgery, command and control, etc.), it is necessary to design fast or strongly-accelerated computational approaches. Reduced-order modeling (ROM) is a candidate methodology that summarizes all the parameter-dependent PDE solutions into an easy-to-compute condensed form. ROM usually requires an offline learning process that returns the essential components […]

CUDA

Jun, 25

Room acoustics modelling using GPU-accelerated finite difference and finite volume methods on a face-centered cubic grid

In this paper, a room acoustics simulation using a finite difference approximation on a face-centered cubic (FCC) grid with finite volume impedance boundary conditions is presented. The finite difference scheme is accelerated on an Nvidia Tesla K20 graphics processing unit (GPU) using the CUDA programming language. A performance comparison is made between 27-point finite difference […]

CUDA

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

* * *

high performance computing on graphics processing units: hgpu.org

Posts

An Embedding Method for Interactive Simulation on Dynamic Surfaces

Towards Rapid Prototyping of Parallel and HPC Applications (GPU Focus)

High-Performance Computing using GPUs

GPU-CC: a Reconfigurable GPU Architecture with Communicating Cores

Point-wise Adaptive Filtering for Fast Monte Carlo Noise Reduction

The 28th IEEE International Parallel & Distributed Processing Symposium, IPDPS 2014

Optimization procedures during parallelization of specialized software for fluid flow simulations

Modeling of the behavior of 222 Rn progeny in diffusion chamber using CUDA

Large Scale Physical Modeling Sound Synthesis

Using of New Possibilities of Fermi Architecture by Development of GPGPU Programs

GPU-Accelerated Real-Time Visualization and Interaction for Coupled Fluid Dynamics

Room acoustics modelling using GPU-accelerated finite difference and finite volume methods on a face-centered cubic grid

Recent source codes

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Most viewed papers (last 30 days)