Posts
Dec, 4
Computing Strongly Connected Components in Parallel on CUDA
The problem of decomposing a directed graph into its strongly connected components is a fundamental graph problem inherently present in many scientific and commercial applications. In this paper we show how some of the existing parallel algorithms can be reformulated in order to be accelerated by NVIDIA CUDA technology. In particular, we design a new […]
Dec, 4
Implementing CFD (Computational Fluid Dynamics) in OpenCL for Building Simulation
Though researchers in computer graphics have started to use the GPGPU (General Purposed Graphics Processing Unit) method to speed up their procedural programs, these techniques are seldom used in the building simulation field. It is possible to apply the GPGPU method to many simulation scenarios (i.e. human evacuation, shadow simulation) to speed up performance. In […]
Dec, 4
Global Point Mascon Models for Simple, Accurate and Parallel Geopotential Computation
High-fidelity geopotential calculation using spherical harmonics (SH) is expensive and relies on recursive non-parallel relations. Here, a global point mascon (PMC) model is proposed that is memory light, extremely simple to implement (at any derivative level), and is naturally amenable to parallelism. The gravity inversion problem is posed classically as a large and dense least […]
Dec, 4
Multi-Directional Optimisation on the GPU
The multi-directional (MD) technique is a general purpose tool for optimisation, that is, finding the global maxima or minima of some objective function in a given domain. Any function that produces a relatively continuous surface may therefore be suitable. Using a graphics processing unit (GPU) for MD optimisation demonstrates an increase in speed of up […]
Dec, 4
Multi-GPU Load Balancing for In-situ Visualization
Real-time visualization is an important tool for immediately inspecting results for scientific simulations. Graphics Processing Units (GPUs) as commodity computing devices offer massive parallelism that can greatly improve performance for data-parallel applications. However, a single GPU provides limited support which is only suitable for smaller scale simulations. Multi-GPU computing, on the other hand, allows concurrent […]
Dec, 4
Solving Rigid Multibody Physics Dynamics Using Proximal Point Functions on the GPU
Physical simulation is important for a wide range of problems, particularly so in the field of robotics. The need for faster simulation to provide larger amounts of data is increasingly growing. The trend in computing is growing towards more cores as opposed to faster cores, and the graphical processing unit, or GPU, shows great promise […]
Dec, 3
Programming hybrid systems with implicit memory based synchronization
In the last years CPU performance increases came with an increase in software development complexity. One of the next big changes in CPU architecture may be so-called hybrid multicore chips, which combine both multicore and manycore technologies on the same chip. Unfortunately, this increase in performance again may lead to an increase in development complexity. […]
Dec, 3
Computation of Large Covariance Matrices by SAMMY on Graphical Processing Units and Multicore CPUs
Computational power of Graphical Processing Units and multicore CPUs was harnessed by the nuclear data evaluation code SAMMY to speed up computations of large Resonance Parameter Covariance Matrices (RPCMs). This was accomplished by linking SAMMY to vendor-optimized implementations of the matrix-matrix multiplication subroutine of the Basic Linear Algebra Library to compute the most time-consuming step. […]
Dec, 3
A Multi-View Stereo Implementation on Massively Parallel Hardware
In recent years, we have seen several approaches to implement hardware-accelerated multi-view stereo (MVS) algorithms employing the graphics processing unit (GPU) for fast and parallel computation. To our knowledge, all of them resort to various rendering passes to perform their computations. In contrast, modern GPU compute frameworks give access to the massively parallel compute capability […]
Dec, 3
Impact of Floating-Point Precision on Boundary Layer Instabilities Modeled on Fermi GPU
We have implemented two-dimensional and three-dimensional Rayleigh-Benard convection for infinite Prandtl number, appropriate for the Earth’s mantle, on a single Fermi GPU by utilizing a second-order finite-difference method. The code was written in C for CUDA and heavily itilized optimized CUBLAS routines. These implementations enjoyed performance on the order 535 GFLOP/s and 100 GFLOP/s in […]
Dec, 3
Implementations of a Parallel Algorithm for Computing Euclidean Distance Map in Multicore Processors and GPUs
Given a 2-D binary image of size nxn, Euclidean Distance Map (EDM) is a 2-D array of the same size such that each element is storing the Euclidean distance to the nearest black pixel. It is known that a sequential algorithm can compute the EDM in O(n2) and thus this algorithm is optimal. Also, work-time […]
Dec, 3
Design and Optimization of Hybrid MD5-Blowfish Encryption on GPUs
Nowadays, data has been playing an indispensable role in almost all industrial areas. Data integrity and security over Internet, other types of media and applications have become the major concerns in computer world. If confidential or sensitive data is forged, juggled or wiretapped by an attacker, capital losses might occur. Encryption is one of the […]