high performance computing on graphics processing units: hgpu.org

Posts

Aug, 12

A Parallel Algorithm for Calculation of Large Determinants with High Accuracy for GPUs and MPI clusters

We present a parallel algorithm for calculating very large determinants with arbitrary precision on computer clusters. This algorithm minimises data movements between the nodes and computes not only the determinant but also all minors corresponding to a particular row or column at a little extra cost, and also the determinants and minors of all submatrices […]

CUDA

Aug, 10

Heterogeneous Computing for Real-Time Stereo Matching

Stereo matching is used in many computer vision applications such as 3D reconstruction, robot navigation, robotic surgery, 3-D video surveillance, and tracking object in 3D space. Real time stereo matching is difficult due to the heavy computation required for matching algorithms. In this paper a CPU/GPU heterogeneous computing platform is used to accelerate the processing […]

CUDA

Aug, 10

Analyzing Optimization Techniques for Power Efficiency on Heterogeneous Platforms

Graphics processing units (GPUs) have become widely accepted as the computing platform of choice in many high performance computing domains. The availability of programming standards such as OpenCL are used to leverage the inherent parallelism offered by GPUs. Source code optimizations such as loop unrolling and tiling when targeted to heterogeneous applications have reported large […]

OpenCL

Aug, 10

PixelPie: Maximal Poisson-disk Sampling with Rasterization

We present PixelPie, a highly parallel geometric formulation of the Poisson-disk sampling problem on the graphics pipeline. Traditionally, generating a distribution by throwing darts and removing conflicts has been viewed as an inherently sequential process. In this paper, we present an efficient Poisson-disk sampling algorithm that uses rasterization in a highly parallel manner. Our technique […]

CUDA

•

OpenGL

Aug, 10

Sequence Data Indexing Method Exploiting the Parallel Processing Resources of GPGPU

In this paper, we propose a new sequence retrieval method that utilizes the parallel processing power of General-Purpose Computation on Graphics Processing Unit (GPGPU). GPGPU has high parallel computing power, due to its characteristic (SIMD) we need to carefully design a method to attain the parallel computing power. Existing CPU based sequence retrieval method does […]

CUDA

Aug, 10

Finding Next Best Views for Autonomous UAV Mapping through GPU-Accelerated Particle Simulation

This paper presents a novel algorithm capable of generating multiple next best views (NBVs), sorted by achievable information gain. Although being designed for waypoint generation in autonomous airborne mapping of outdoor environments, it works directly on raw point clouds and thus can be used with any sensor generating spatial occupancy information (e.g. LIDAR, kinect or […]

CUDA

•

OpenGL

Aug, 9

GPU-Based Real-Time Imaging Software Suite for Medical Ultrasound

We developed a GPU-based real-time imaging software suite for medical ultrasound imaging to provide a fast real-time imaging platform for various probe geometries and imaging schemes. The imaging software receives raw RF data from a data acquisition system, and processes them on GPU to reconstruct real-time images. The most general-purpose imaging program in the suite […]

CUDA

Aug, 9

A GPU implementation of massively parallel direction splitting for the incompressible Navier-Stokes equations

Guermond and Minev proposed a directional splitting algorithm to solve the incompressible Stokes equations. Their algorithm applies the alternating direction implicit method to the viscosity term. The pressure update uses a direction splitting method in order to enforce the incompressibility constraint, as opposed to commonly used projection methods that require the solution of a Poisson […]

OpenCL

•

OpenGL

Aug, 9

A GPGPU based program to solve the TDSE in intense laser fields through the finite difference approach

We present a General-purpose computing on graphics processing units (GPGPU) based computational program and framework for the electronic dynamics of atomic systems under intense laser fields. We present our results using the case of hydrogen, however the code is trivially extensible to tackle problems within the single-active electron (SAE) approximation. Building on our previous work, […]

OpenCL

Aug, 9

Multi-core CPU or GPU-accelerated Multiscale Modeling for Biomolecular Complexes

Multi-scale modeling plays an important role in understanding the structure and biological functionalities of large biomolecular complexes. In this paper, we present an efficient computational framework to construct multi-scale models from atomic resolution data in the Protein Data Bank (PDB), which is accelerated by multi-core CPU and programmable Graphics Processing Units (GPU). A multi-level summation […]

CUDA

Aug, 9

High-Performance Holistic XML Twig Filtering Using GPUs

Current state of the art in information dissemination comprises of publishers broadcasting XML-coded documents, in turn selectively forwarded to interested subscribers. The deployment of XML at the heart of this setup greatly increases the expressive power of the profiles listed by subscribers, using the XPath language. On the other hand, with great expressive power comes […]

CUDA

Aug, 8

GPU implementation of a shell element structural solver aimed at fluid-structure interaction problems

The study of thin structures is very common nowadays and useful in different fields. An important example is the analysis of sail dynamics. In this context, accurate simulations of the interaction between the sail and the wind are also required. However, this kind of fluid-structure interaction problems are very computationally expensive. First objective of this […]

CUDA

* * *

high performance computing on graphics processing units: hgpu.org

Posts

A Parallel Algorithm for Calculation of Large Determinants with High Accuracy for GPUs and MPI clusters

Heterogeneous Computing for Real-Time Stereo Matching

Analyzing Optimization Techniques for Power Efficiency on Heterogeneous Platforms

PixelPie: Maximal Poisson-disk Sampling with Rasterization

Sequence Data Indexing Method Exploiting the Parallel Processing Resources of GPGPU

Finding Next Best Views for Autonomous UAV Mapping through GPU-Accelerated Particle Simulation

GPU-Based Real-Time Imaging Software Suite for Medical Ultrasound

A GPU implementation of massively parallel direction splitting for the incompressible Navier-Stokes equations

A GPGPU based program to solve the TDSE in intense laser fields through the finite difference approach

Multi-core CPU or GPU-accelerated Multiscale Modeling for Biomolecular Complexes

High-Performance Holistic XML Twig Filtering Using GPUs

GPU implementation of a shell element structural solver aimed at fluid-structure interaction problems

Recent source codes

Specx: Speculative task-based runtime system

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

KISim: Kubernetes Intelligent Scheduling Simulator

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

Efficient GPU Implementation of Multi-Precision Integer Division

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

exa-AMD: Exascale Accelerated Materials Discovery

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

Most viewed papers (last 30 days)