5966

Posts

Oct, 13

A rendering method for simulated emission nebulae

Emission nebulae are some of the most beautiful stellar phenomena. The newly formed hot stars inside the nebulae ionize the surrounding gas making it glow in variety of colors. The focus of this work is to find a method for interactive rendering of simulated emission nebulae. A rendering program has been developed to render and […]
Oct, 13

Introduction to GPU Radix Sort

Radix sort is one of the fastest sorting algorithms. It is fast especially for a large problem size. Radix sort is not a comparison sort but a counting sort. When we sort n bit keys, 2^n counters are prepared for each number.
Oct, 13

Input Sensitivity of GPU Program Optimizations

Graphic Processing Units (GPU) have become increasingly adopted for the enhancement of computing throughput. However, the development of a high-quality GPU application is challenging, due to the large optimization space and complex unpredictable effects of optimizations on GPU program performance. Many recent efforts have been employing empirical search-based auto-tuners to tackle the problem, but few […]
Oct, 13

gpustats: GPU Library for Statistical Computing in Python

In this talk we will discuss gpustats, a new Python library for assisting in "big data" statistical computing applications, particularly Monte Carlobased inference algorithms. The library provides a general code generation / metaprogramming framework for easily implementing discrete and continuous probability density functions and random variable samplers. These functions can be utilized to achieve more […]
Oct, 13

Seamless Dynamic Runtime Reconfiguration in a Software-Defined Radio

We discuss implementation aspects of a software-defined radio system that allows for dynamic waveform reconfiguration during runtime without interrupting dataflow processing. Traditional software-defined radio systems execute a waveform statically, exactly as it is programmed. Reconfiguration is provided by executing a different waveform, which requires the system to stop processing data while reconfiguration occurs, and also […]
Oct, 13

Developing a High Performance GPGPU Compiler Using Cetus

In this paper we present our experience in developing an optimizing compiler for general purpose computation on graphics processing units (GPGPU) based on the Cetus compiler framework. The input to our compiler is a naive GPU kernel procedure, which is functionally correct but without any consideration for performance optimization. Our compiler applies a set of […]
Oct, 13

PlinkGPU: A Framework for GPU Acceleration of Whole Genome Data Analysis

Genome-wide association studies (GWAS) are performed in order to detect the genetic variations associated with physical traits (e.g. diseases), and Plink is a popular software system for analyzing the data of GWAS. Due to the large datasets involved, the task of data processing can be very time-consuming. Although GPUs (graphics processing units) are not generally […]
Oct, 13

State of The Art Report on GPU

This report aims to provide a beginner’s introduction to GPUs, from both a hardware and a software angle. We look at the evolution of specialist graphics hardware from the early days of PC graphics cards to the present day. We describe the currently available hardware from NVIDIA and AMD/ATI, and the current software from both […]
Oct, 12

Optimizing a High Energy Physics (HEP) Toolkit on Heterogeneous Architectures

A desired trend within high energy physics is to increase particle accelerator luminosities, leading to production of more collision data and higher probabilities of finding interesting physics results. A central data analysis technique used to determine whether results are interesting or not is the maximum likelihood method, and the corresponding evaluation of the negative log-likelihood, […]
Oct, 12

GPU-Based Translation-Invariant 2D Discrete Wavelet Transform for Image Processing

The Discrete Wavelet Transform (DWT) is applied to various signal and image processing applications. However the computation is computational expense. Therefore plenty of approaches have been proposed to accelerate the computation. Graphics processing units (GPUs) can be used as stream processor to speed up the calculation of the DWT. In this paper, we present a […]
Oct, 12

High Performance Computing with Accelerators

High-performance computing (HPC) uses supercomputers and computer clusters to solve advanced computation problems. HPC has come to be applied to business uses of cluster-based supercomputers, such as data warehouses, line-of-business (LOB) applications, and transaction processing. In the past few years, a new class of HPC systems has emerged. These systems employ unconventional processor architectures-such as […]
Oct, 12

Ray Tracing on Graphics Hardware

Ray tracing is one of the important elements in photo-realistic image synthesis. Since ray tracing is computationally expensive, a large body of research has been devoted to improve the performance of ray tracing. One of the recent developments on efficient ray tracing is the implementation on graphics hardware. Similar to general purpose CPUs, recent graphics […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: