high performance computing on graphics processing units: hgpu.org

Posts

Nov, 26

Iterative Solution of Linear Systems in Electromagnetics (and not only): Experiences with CUDA

In this paper, we propose the use of graphics processing units as a low-cost and efficient solution of electromagnetic (and other) numerical problems. Based on the software platform CUDA (Compute Unified Device Architecture), a solver for unstructured sparse matrices with double precision complex data has been implemented and tested for several practical cases. Benchmark results […]

CUDA

Nov, 26

A framework for network traffic analysis using GPUs

During the last years the computer networks have become an important part of our society. Networks have kept growing in size and complexity, making more complex its management and traffic monitoring and analysis processes, due to the huge amount of data and calculations involved. In the last decade, several researchers found effective to use graphics […]

CUDA

Nov, 26

Fast Hydraulic and Thermal Erosion on GPU

Computer games, TV series, movies, simulators, and many other computer graphics applications use external scenes where a realistic looking terrain is a vital part of the viewing experience. Creating such terrains is a challenging task. In this paper we propose a method that generates realistic virtual terrains by simulation of hydraulic and thermal erosion on […]

Nov, 26

A Short Note on Gaussian Process Modeling for Large Datasets using Graphics Processing Units

The graphics processing unit (GPU) has emerged as a powerful and cost effective processor for high performance computing. GPUs are capable of an order of magnitude more floating point operations per second as compared to modern central processing units (CPUs), and thus provide a great deal of promise for computationally intensive statistical applications (Brodtkorb et […]

CUDA

Nov, 26

High performance system for the Interactive rendering of a 3D Model into MPEG-4

The goal of the project is to create a Multiprocessor system capable of rendering a 3D model into an MPEG-4 stream. This paper outlines the design, software architecture and hardware setup for the system. Preliminary success in the previous setup[1] helped us gain experience as well as motivation for this highly optimized and powerful second […]

CUDA

•

OpenGL

Nov, 26

High Performance Iterative Solver for Linear System using Multi GPU

The variable preconditioned (VP) Krylov subspace method on multi Graphics Processing Unit (GPU) is numerically investigated. Besides, the linear system obtained by finite element method with an edge element is adopted for the problem. The results of computations show that VP conjugate gradient method on multi GPU demonstrated significant achievement than that of CPU. Especially, […]

CUDA

Nov, 26

Partitioned Memory Parallel Programming Framework

We present a framework for parallel programming. It consists of a distributed shared memory based simplified programming model, which leaves the application developer to focus mainly on task decomposition. This is a unified model for many-core processors (e.g., CPUs and GPUs), multiple processors on a system, as well as multiple systems. We also present a […]

Nov, 26

Scaling Results for a Discontinuous Galerkin Finite-Element Wave Solver on Multi-GPU Systems

Among the various techniques for solving hyperbolic partial differential equations with inhomogeneous, irregularly-shaped domains, a relatively new type of finite element method has grown in popularity because of its flexibility and scalability across many parallel cores. Discont inuous Galerkin (DG) methods have shown themselves to be an effective scheme for the simulation of wave-propagation problems […]

CUDA

Nov, 25

Seismic Wave Propagation Simulation Using Support Operator Method on multi-GPU system

The method of Support Operator(SOM) is a numerical method based on finite difference method. We use SOM to simulate seismic wave propagation by solving the three dimension viscoelastic equations. The Support Operator Rupture Dynamics(SORD) has been proved to be highly scalable in large-scale multi-processors computing. This paper discusses accelerating SORD on multi-GPU system using NVIDIA […]

CUDA

Nov, 25

Parallel GMRES implementation for solving sparse linear systems on GPU clusters

In this paper, we propose an efficient parallel implementation of the GMRES method for GPU clusters. This implementation requires us to parallelize the GMRES algorithm between the CPUs of the cluster. Hence, all parallel and intensive computations on local data are performed on GPUs and reduction operations to compute global results are carried out by […]

CUDA

Nov, 25

Design of 3D FFT on Multi-GPU Clusters

Turbulence simulations are computationally expensive and typically take weeks to complete. With the advent of GPGPU(General Purpose computing on GPUs), one can achieve significant speedup. Spectral method is one of the several known methods used in turbulence simulations. It is based on Fast Fourier Transforms(FFT) to an extent that it spends 70-80% time in calculating […]

CUDA

Nov, 25

Fastplay: A Parallelization Model and Implementation of SMC on CUDA Based GPU Cluster Architecture

We propose a four-tiered parallelization model for acceleration of the secure multiparty computation (SMC) on the CUDA based Graphic Processing Unit (GPU) cluster architecture. Specification layer is the top layer, which adopts the SFDL of Fairplay for specification of secure computations. The SHDL file generated by the SFDL compiler of Fairplay is used as inputs […]

CUDA

* * *

high performance computing on graphics processing units: hgpu.org

Posts

Iterative Solution of Linear Systems in Electromagnetics (and not only): Experiences with CUDA

A framework for network traffic analysis using GPUs

Fast Hydraulic and Thermal Erosion on GPU

A Short Note on Gaussian Process Modeling for Large Datasets using Graphics Processing Units

High performance system for the Interactive rendering of a 3D Model into MPEG-4

High Performance Iterative Solver for Linear System using Multi GPU

Partitioned Memory Parallel Programming Framework

Scaling Results for a Discontinuous Galerkin Finite-Element Wave Solver on Multi-GPU Systems

Seismic Wave Propagation Simulation Using Support Operator Method on multi-GPU system

Parallel GMRES implementation for solving sparse linear systems on GPU clusters

Design of 3D FFT on Multi-GPU Clusters

Fastplay: A Parallelization Model and Implementation of SMC on CUDA Based GPU Cluster Architecture

Recent source codes

Specx: Speculative task-based runtime system

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

KISim: Kubernetes Intelligent Scheduling Simulator

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

Efficient GPU Implementation of Multi-Precision Integer Division

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

exa-AMD: Exascale Accelerated Materials Discovery

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

Most viewed papers (last 30 days)