high performance computing on graphics processing units: hgpu.org

Posts

Dec, 28

Efficient parallel lists intersection and index compression algorithms using graphics processing units

Major web search engines answer thousands of queries per second requesting information about billions of web pages. The data sizes and query loads are growing at an exponential rate. To manage the heavy workload, we consider techniques for utilizing a Graphics Processing Unit (GPU). We investigate new approaches to improve two important operations of search […]

CUDA

Dec, 28

Real-Time Rendering of Temporal Volumetric Data on a GPU

Real-time rendering of static volumetric data is generally known to be a memory and computationally intensive process. With the advance of graphic hardware, especially GPU, it is now possible to do this using desktop computers. However, with the evolution of real-time CT and MRI technologies, volumetric rendering is an even bigger challenge. The first one […]

Dec, 27

PFAC Library: GPU-based string matching algorithm

The PFAC algorithm efficiently exploits the parallelism of the Aho-Corasick algorithm by creating an individual thread for each byte of an input stream to identify any pattern starting at the thread’s starting position. The number of threads created by the PFAC algorithm is equal to the length of an input stream.

CUDA

Dec, 27

A GPU Accelerated High Performance Cloud Computing Infrastructure for Grid Computing Based Virtual Environmental Laboratory

Numerical models play a main role in the earth sciences, filling in the gap between experimental and theoretical approach. Nowadays, the computational approach is widely recognized as the complement to the scientific analysis. Meanwhile, the huge amount of observed/modelled data, and the need to store, process, and refine them, often makes the use of high […]

CUDA

Dec, 27

Enabling New Uses for GPUs

As graphics processing unit (GPU) architects have made their pipelines more programmable in recent years, GPUs have become increasingly general-purpose. As a result, more and more general-purpose, non-graphics applications are being ported to GPUs. Past work has focused on applications that map well to the data parallel GPU programming model. These applications are usually embarrassingly […]

CUDA

Dec, 27

A GPU implementation of EGSnrc’s Monte Carlo photon transport for imaging applications

EGSnrc is a well-known Monte Carlo simulation package for coupled electron-photon transport that is widely used in medical physics application. This paper proposes a parallel implementation of the photon transport mechanism of EGSnrc for graphics processing units (GPUs) using NVIDIA’s Compute Unified Device Architecture (CUDA). The implementation is specifically designed for imaging applications in the […]

CUDA

Dec, 27

MIDeA: a multi-parallel intrusion detection architecture

Network intrusion detection systems are faced with the challenge of identifying diverse attacks, in extremely high speed networks. For this reason, they must operate at multi-Gigabit speeds, while performing highly-complex per-packet and per-flow data processing. In this paper, we present a multi-parallel intrusion detection architecture tailored for high speed networks. To cope with the increased […]

CUDA

Dec, 27

GPU Pro 2

This book focuses on advanced rendering techniques that run on the DirectX and/or OpenGL run-time with any shader language available. It includes articles on the latest and greatest techniques in real-time rendering, including MLAA, adaptive volumetric shadow maps, light propagation volumes, wrinkle animations, and much more. The book emphasizes techniques for handheld programming to reflect […]

OpenGL

Dec, 27

OpenCL Programming Guide

Using the new OpenCL (Open Computing Language) standard, you can write applications that access all available programming resources: CPUs, GPUs, and other processors such as DSPs and the Cell/B.E. processor. Already implemented by Apple, AMD, Intel, IBM, NVIDIA, and other leaders, OpenCL has outstanding potential for PCs, servers, handheld/embedded devices, high performance computing, and even […]

OpenCL

Dec, 27

GPU-accelerated Chemical Similarity Assessment for Large Scale Databases

The assessment of chemical similarity between molecules is a basic operation in chemoinformatics, a computational area concerning with the manipulation of chemical structural information. Comparing molecules is the basis for a wide range of applications such as searching in chemical databases, training prediction models for virtual screening or aggregating clusters of similar compounds. However, currently […]

CUDA

Dec, 27

Parallel Algorithms for GPU accelerated Probabilistic Inference

Real world data is likely to contain an inherent structure. Those structures may be represented with graphs which encode independence assumptions within the data. Performing inference in those models is nearly intractable on mobile devices or casual workstations. This work introduces and compares two approaches for accelerating the inference in graphical models by using GPUs […]

CUDA

Dec, 27

Hybrid CPU and GPGPU Volunteer Computing Framework over the Extensible Messaging and Presence Protocol for Prallel Branch and Bound Optimization of Truss Structures

A new fault-tolerant message passing interface framework for a hybrid parallel com-puting in distributed volunteer-based networks is presented in this paper. It is a new programming core library being able to link together many CPUs and promis-ing General Purpose Graphic Processing Unit (GPGPU) devices over the Internet. We successfully developed a high-level programming interface on […]

CUDA