Posts
Jun, 14
Ultra-Fast Hybrid CPU-GPU Multiple Scatter Simulation for 3D PET
Scatter correction is very important in 3D PET reconstruction due to a large scatter contribution in measurements. Currently, one of the most popular methods is so called single scatter simulation (SSS), which considers single Compton scattering contributions from many randomly distributed scatter points. The SSS enables a fast calculation of scattering with a relatively high […]
Jun, 14
Computing virtual acoustics using the 3D finite difference time domain method and Kepler architecture GPUs
The computation of virtual acoustics for physical modelling synthesis using the finite difference time domain is a computationally expensive process, especially at audio rates such as 44.1kHz. However, the high level of dataindependence is well suited to parallel architectures such as those provided by graphics processing units. This paper describes the use of the latest […]
Jun, 14
Research on a Parallel BD-tree Index Structure
The BD-tree is an efficient database index structure which has good random access performance like hashing methods and can also provide range search and key sequential access like the B+-tree. In order to further improve the operational performance of the BD-tree, we adapted the traditional BD-tree hash function to realize the BD-tree parallel processing. By […]
Jun, 13
7th International ICST Conference on Simulation Tools and Techniques, SIMUTools 2014
SIMUTools 2014 is the seventh edition of the annual International Conference on Simulation Tools and Techniques. The conference focuses on a broad range of research challenges in the field of simulation, modeling and analysis, addressing current and future trends in simulation techniques, models, practices and software. The conference is dedicated to fostering interdisciplinary collaborative research […]
Jun, 13
Use of CUDA Parallel Computing Technology in Modeling of Solid Mineral Deposits
This article describes CUDA technology of high-performance parallel computing and its application in the modeling of mineral deposits. The article describes CUDA technology of high-performance parallel computing. It provides the concept, features of the technology architecture, the principle of development of applications based on CUDA. The article presents algorithms for modeling mineral resources deposits using […]
Jun, 13
Cost-Effective Soft-Error Protection for SRAM-Based Structures in GPGPUs
The general-purpose computing on graphics processing units (GPGPUs) are increasingly used to accelerate parallel applications. This makes reliability a growing concern in GPUs as they are originally designed for graphics processing with relaxed requirements for execution correctness. With CMOS processing technologies continuously scaling down to the nano-scale, on-chip soft error rate (SER) has been predicted […]
Jun, 13
Acceleration of calculation of Third Party Risk around an airport using OpenCL
During the past two decades, the Dutch National Aerospace Laboratory has developed a model to calculate the risk for third parties around airports. This Third Party Risk model is used in the decision making with respect to airport development and land use planning. Due to the increase of air traffic, the availability of improved individual […]
Jun, 13
An Analysis of Conventional and Heterogeneous Workloads on Production Supercomputing Resources
Cloud computing setups are a huge investment of resources and personnel to maintain. As the workload on a system is a major contributing factor to both the performance of the system and a representation of the needs of system users, a clear understanding of the workload is critical to organizations that support supercomputing systems. In […]
Jun, 13
Real-time planar flow velocity measurements using an optical flow algorithm implemented on GPU
This paper presents a high speed implementation of an optical flow algorithm which computes planar velocity fields in an experimental flow. Real-time computation of the flow velocity field allows the experimentalist to have instantaneous access to quantitative features of the flow. This can be very useful in many situations: fast evaluation of the performances and […]
Jun, 12
The Hierarchical Memory Machine Model for GPUs
The Discrete Memory Machine (DMM) and the Unified Memory Machine (UMM) are theoretical parallel computing models that capture the essence of the shared memory access and the global memory access of GPUs. The main contribution of this paper is to introduce the Hierarchical Memory Machine (HMM), which consists of multiple DMMs and a single UMM. […]
Jun, 12
FastSpMM: An Efficient Library for Sparse Matrix Matrix Product on GPUs
Sparse matrix matrix (SpMM) multiplication is involved in a wide range of scientific and technical applications. The computational requirements for this kind of operation are enormous, especially for large matrices. This paper analyzes and evaluates a method to efficiently compute the SpMM product in a computing environment that includes graphics processing units (GPUs). Some libraries […]
Jun, 12
FFT-SPA Non-Binary LDPC Decoding on GPU
It is well known that non-binary LDPC codes outperform the BER performance of binary LDPC codes for the same code length. The superior BER performance of non-binary codes comes at the expense of more complex decoding algorithms that demand higher computational power. In this paper, we propose parallel signal processing algorithms for performing the FFT-SPA […]