Posts
Nov, 26
Fast Hydraulic and Thermal Erosion on GPU
Computer games, TV series, movies, simulators, and many other computer graphics applications use external scenes where a realistic looking terrain is a vital part of the viewing experience. Creating such terrains is a challenging task. In this paper we propose a method that generates realistic virtual terrains by simulation of hydraulic and thermal erosion on […]
Nov, 26
A Short Note on Gaussian Process Modeling for Large Datasets using Graphics Processing Units
The graphics processing unit (GPU) has emerged as a powerful and cost effective processor for high performance computing. GPUs are capable of an order of magnitude more floating point operations per second as compared to modern central processing units (CPUs), and thus provide a great deal of promise for computationally intensive statistical applications (Brodtkorb et […]
Nov, 26
High performance system for the Interactive rendering of a 3D Model into MPEG-4
The goal of the project is to create a Multiprocessor system capable of rendering a 3D model into an MPEG-4 stream. This paper outlines the design, software architecture and hardware setup for the system. Preliminary success in the previous setup[1] helped us gain experience as well as motivation for this highly optimized and powerful second […]
Nov, 26
High Performance Iterative Solver for Linear System using Multi GPU
The variable preconditioned (VP) Krylov subspace method on multi Graphics Processing Unit (GPU) is numerically investigated. Besides, the linear system obtained by finite element method with an edge element is adopted for the problem. The results of computations show that VP conjugate gradient method on multi GPU demonstrated significant achievement than that of CPU. Especially, […]
Nov, 26
Partitioned Memory Parallel Programming Framework
We present a framework for parallel programming. It consists of a distributed shared memory based simplified programming model, which leaves the application developer to focus mainly on task decomposition. This is a unified model for many-core processors (e.g., CPUs and GPUs), multiple processors on a system, as well as multiple systems. We also present a […]
Nov, 26
Scaling Results for a Discontinuous Galerkin Finite-Element Wave Solver on Multi-GPU Systems
Among the various techniques for solving hyperbolic partial differential equations with inhomogeneous, irregularly-shaped domains, a relatively new type of finite element method has grown in popularity because of its flexibility and scalability across many parallel cores. Discont inuous Galerkin (DG) methods have shown themselves to be an effective scheme for the simulation of wave-propagation problems […]
Nov, 25
Seismic Wave Propagation Simulation Using Support Operator Method on multi-GPU system
The method of Support Operator(SOM) is a numerical method based on finite difference method. We use SOM to simulate seismic wave propagation by solving the three dimension viscoelastic equations. The Support Operator Rupture Dynamics(SORD) has been proved to be highly scalable in large-scale multi-processors computing. This paper discusses accelerating SORD on multi-GPU system using NVIDIA […]
Nov, 25
Parallel GMRES implementation for solving sparse linear systems on GPU clusters
In this paper, we propose an efficient parallel implementation of the GMRES method for GPU clusters. This implementation requires us to parallelize the GMRES algorithm between the CPUs of the cluster. Hence, all parallel and intensive computations on local data are performed on GPUs and reduction operations to compute global results are carried out by […]
Nov, 25
Design of 3D FFT on Multi-GPU Clusters
Turbulence simulations are computationally expensive and typically take weeks to complete. With the advent of GPGPU(General Purpose computing on GPUs), one can achieve significant speedup. Spectral method is one of the several known methods used in turbulence simulations. It is based on Fast Fourier Transforms(FFT) to an extent that it spends 70-80% time in calculating […]
Nov, 25
Fastplay: A Parallelization Model and Implementation of SMC on CUDA Based GPU Cluster Architecture
We propose a four-tiered parallelization model for acceleration of the secure multiparty computation (SMC) on the CUDA based Graphic Processing Unit (GPU) cluster architecture. Specification layer is the top layer, which adopts the SFDL of Fairplay for specification of secure computations. The SHDL file generated by the SFDL compiler of Fairplay is used as inputs […]
Nov, 25
Fast hough transform on GPUs: exploration of algorithm trade-offs
The Hough transform is a commonly used algorithm to detect lines and other features in images. It is robust to noise and occlusion, but has a large computational cost. This paper introduces two new implementations of the Hough transform for lines on a GPU. One focuses on minimizing processing time, while the other has an […]
Nov, 25
Decryption-decompression of AES protected ZIP files on GPUs
AES is a strong encryption system, so decryption-decompression of AES encrypted ZIP files requires very large computing power and techniques of reducing the password space. This makes implementations of techniques on common computing system not practical. In [1], we reduced the original very large password search space to a much smaller one which surely containing […]