Posts
Jan, 21
OpenCL for Database Query Processing
In recent years, graphics processing units (GPUs) have evolved into powerful devices with significant computational performance and memory throughput. Efforts to exploit their potential to tackle problems from various scientific domains with high computational requirements have proven quite successful. In addition, previous research suggests that database query processing algorithms can be accelerated with the utilisation […]
Jan, 21
Markov Chain Monte Carlo on the GPU
Markov chains are a useful tool in statistics that allow us to sample and model a large population of individuals. We can extend this idea to the challenge of sampling solutions to problems. Using Markov chain Monte Carlo (MCMC) techniques we can also attempt to approximate the number of solutions with a certain confidence based […]
Jan, 21
A Practical Visualization Strategy for Large-Scale Supernovae CFD Simulations
Simulating the expansion of a Type II supernova using an adaptive computational fluid dynamics (CFD) engine yields a complex mixture of turbulent flow with dozens of physical properties. The dataset shown in this sketch was initially simulated on iVEC’s EPIC supercomputer (a 9600 core Linux cluster) using FLASH [Fryxell et al. 2000] to model the […]
Jan, 21
Parallel FEM Simulation Using GPUs
This paper deals with a research concept of parallel finite element (FE) simulation for moving boundary and adaptive refinement problems using graphics processing unit (GPU). The main concern in this study is to improve the numerical performance of continuous FE simulation using recent data-parallel computing technology (GPU-CUDA). The computational time for our existing simulations is […]
Jan, 21
EASEA: A Generic Optimization Tool for GPU Machines in Asynchronous Island Model
Very recently, we presented an efficient implementation of Evolutionary Algorithms (EAs) using Graphics Processing Units (GPU) for solving microporous crystal structures. Because of both the inherent complexity of zeolitic materials and the constant pressure to accelerate R&D solutions, an asynchronous island model running on clusters of machines equipped with GPU cards, i.e. the current trend […]
Jan, 21
Plenoptic Rendering With Interactive Performance Using GPUs
Processing and rendering of plenoptic camera data requires significant computational power and memory bandwidth. At the same time, real-time rendering performance is highly desirable so that users can interactively explore the infinite variety of images that can be rendered from a single plenoptic image. In this paper we describe a GPU-based approach for lightfield processing […]
Jan, 21
Direct Visualization of Particle-Partition of Unity Data
Direct visualization of higher-order data provides manifold advantages over the traditional approach, which is based on resampling and subsequent visualization by interpolation-based techniques. Most important, it avoids excessive computation and consumption of memory, and prevents artifacts by pixel-accurate visualization at interactive rates. This work addresses particle-partition of unity simulation data, where fields are modeled both […]
Jan, 21
The State of the Art in Interactive Global Illumination
The interaction of light and matter in the world surrounding us is of striking complexity and beauty. Since the very beginning of computer graphics, adequate modeling of these processes and efficient computation is an intensively studied research topic and still not a solved problem. The inherent complexity stems from the underlying physical processes as well […]
Jan, 21
Fast Graph Cuts using Shrink-Expand Reparameterization
Global optimization of MRF energy using graph cuts is widely used in computer vision. As the images are getting larger, faster graph cuts are needed without sacrificing optimality. Initializing or reparameterizing a graph using results of a similar one has provided efficiency in the past. In this paper, we present a method to speedup graph […]
Jan, 20
On the Correctness of the SIMT Execution Model of GPUs
GPUs are becoming a primary resource of computing power. They use a single instruction, multiple threads (SIMT) execution model that executes batches of threads in lockstep. If the control flow of threads within the same batch diverges, the different execution paths are scheduled sequentially; once the control flows reconverge, all threads are executed in lockstep […]
Jan, 20
Automated Techniques for Enabling Efficient MPI Application Migration
Applications that use the MPI standard have additional dependencies related to the MPI implementation. When migrating an MPI code to a new computing site, the binary will not run if these dependencies are not resolved by properly configuring the new site. In this work, we present techniques that automatically resolve dependencies before runtime and enable […]
Jan, 20
Experiences in Teaching a Specialty Multicore Computing Course
We detail the design and experiences in delivering a specialty multicore computing course whose materials are openly available. The course ambitiously covers three multicore programming paradigms: shared memory (OpenMP), device (CUDA) and message passing (RCCE), and involves significant practical work on their respective platforms: an UltraSPARC T2, Fermi GPU and the Single-Chip Cloud Computer. Specialized […]