high performance computing on graphics processing units: hgpu.org

Posts

May, 26

Parallel Parametric Optimisation with Firefly Algorithms on Graphical Processing Units

Parametric optimisation techniques such as Particle Swarm Optimisation (PSO), Firefly algorithms (FAs), genetic algorithms (GAs) are at the centre of attention in a range of optimisation problems where local minima plague the parameter space. Variants of these algorithms deal with the problems presented by local minima in a variety of ways. A salient feature in […]

CUDA

May, 26

Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 1. Generalized Born

We present an implementation of generalized Born implicit solvent all-atom classical molecular dynamics (MD) within the AMBER program package that runs entirely on CUDA enabled NVIDIA graphics processing units (GPUs). We discuss the algorithms that are used to exploit the processing power of the GPUs and show the performance that can be achieved in comparison […]

CUDA

May, 26

Fast and accurate digital signal processing realized with GPGPU technology

An idea of the so-called quasi-maximum accuracy computations for improvement of precision of the floating-point digital signal processing with graphic processing units (GPUs) is presented in this paper. In the presented approach, the increase of the precision of computations does not need any increase of the length of the data words. Special attention has been […]

CUDA

May, 26

Parallelization of the Local Threshold and Boolean Function Based Edge Detection Algorithm Using CUDA

In this paper we present a parallelized algorithm for edge detection for gray scale images. The chosen method is the local threshold and boolean function based edge detection. This method differs from common edge detectors in the use of bit map patterns instead of analyzing gradient changes in the image for edge recognition. The parallelization […]

CUDA

May, 25

The Third International Conference on Parallel, Distributed, Grid and Cloud Computing for Engineering, PARENG2013

The conference will consider mathematical, computer science and engineering developments that impact on the use of HPC in engineering analysis, design, and simulation. Engineering is interpreted in its widest sense to include aeronautical, civil, mechanical, electrical, materials, bioengineering, geotechnical, structural and environmental fields. The range of topics considered by the Conference will include: The mathematical […]

May, 25

The 3rd International Workshop of GPU Solutions to Multiscale Problems in Science and Engineering, 2012, GPU-SMP’ 2012

This international conference in Shenzhen will focus on understanding the potential usage of GPU and MIC from a computational scientific user point of view, particularly for multiscale problems in science on engineering. It brings together experts from China, Japan, and bordering Pacific countries such as the USA, Korea，Australia and Singapore. In addition to algorithmic research, […]

May, 25

Using Compute Unified Device Architecture (CUDA) in Parallelizing Different Digital Image Processing Techniques

Graphics Processing Units (GPUs) have been conventionally used in the acceleration of 2D, 3D graphics and video rendering. Because of its performance and capability, the GPU has evolved into a highly parallel programmable processor that specializes in memory bandwith utilization and intensive computation. For operations involving graphics, GPUs offer a lot of gigaflops of processing […]

CUDA

May, 25

On the Simulations of Evolution-Communication P Systems with Energy without Antiport Rules for GPUs

In this report, we present our initial proposal on simulating computations on a restricted variant of Evolution-Communication P system with energy (ECPe system) which will then be implemented in Graphics Processing Units (GPUs). This ECPe systems variant prohibits the use of antiport rules for communication. Several possible levels of parallelizations for simulating ECPe systems computations […]

CUDA

May, 25

Effective Sparse Matrix Representation for the GPU Architectures

General purpose computation on graphics processing unit (GPU) is prominent in the high performance computing era of this time. Porting or accelerating the data parallel applications onto GPU gives the default performance improvement because of the increased computational units. Better performances can be seen if application specific fine tuning is done with respect to the […]

CUDA

May, 25

Accelerating In-Memory Graph Database traversal using GPGPUS

The paper aims to provide a comparitive analysis on the performance of in memory databases as opposed to a customised graph database written ground up whose joins(searches) are performed on a GPGPU. This is done primarily to serve as a proof of concept on how databases that are represented as graphs can benefit by fostering […]

CUDA

May, 25

Parallel simulation of mixed-abstraction SystemC models on GPUs and multicore CPUs

This work presents a methodology that parallelizes the simulation of mixed-abstraction level SystemC models across multicore CPUs, and graphics processing units (GPUs) for improved simulation performance. Given a SystemC model, we partition it into processes suitable for GPU execution and CPU execution. We convert the processes identified for GPU execution into GPU kernels with additional […]

CUDA

May, 24

Java on CUDA architecture

Traditional CPU is able to run only a few complex threads concurrently. On the other side, a GPU allows a concurrent execution of hundreds or thousands of simpler threads. The GPU was originally designed for a computer graphics, but nowadays it is being used for general-purpose calculations using a GPGPU technology. CUDA, one of the […]

CUDA

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations

microSYCL: SYCL micro-benchmarks repository

Exploring SYCL as a Portability Layer for High-Performance Computing on CPUs

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Posts

Parallel Parametric Optimisation with Firefly Algorithms on Graphical Processing Units

Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 1. Generalized Born

Fast and accurate digital signal processing realized with GPGPU technology

Parallelization of the Local Threshold and Boolean Function Based Edge Detection Algorithm Using CUDA

The Third International Conference on Parallel, Distributed, Grid and Cloud Computing for Engineering, PARENG2013

The 3rd International Workshop of GPU Solutions to Multiscale Problems in Science and Engineering, 2012, GPU-SMP’ 2012

Using Compute Unified Device Architecture (CUDA) in Parallelizing Different Digital Image Processing Techniques

On the Simulations of Evolution-Communication P Systems with Energy without Antiport Rules for GPUs

Effective Sparse Matrix Representation for the GPU Architectures

Accelerating In-Memory Graph Database traversal using GPGPUS

Parallel simulation of mixed-abstraction SystemC models on GPUs and multicore CPUs

Java on CUDA architecture

Recent source codes

Efficient GPU Implementation of Multi-Precision Integer Division

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

exa-AMD: Exascale Accelerated Materials Discovery

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

Most viewed papers (last 30 days)