9680

Posts

Jun, 13

7th International ICST Conference on Simulation Tools and Techniques, SIMUTools 2014

SIMUTools 2014 is the seventh edition of the annual International Conference on Simulation Tools and Techniques. The conference focuses on a broad range of research challenges in the field of simulation, modeling and analysis, addressing current and future trends in simulation techniques, models, practices and software. The conference is dedicated to fostering interdisciplinary collaborative research […]
Jun, 13

Use of CUDA Parallel Computing Technology in Modeling of Solid Mineral Deposits

This article describes CUDA technology of high-performance parallel computing and its application in the modeling of mineral deposits. The article describes CUDA technology of high-performance parallel computing. It provides the concept, features of the technology architecture, the principle of development of applications based on CUDA. The article presents algorithms for modeling mineral resources deposits using […]
Jun, 13

Cost-Effective Soft-Error Protection for SRAM-Based Structures in GPGPUs

The general-purpose computing on graphics processing units (GPGPUs) are increasingly used to accelerate parallel applications. This makes reliability a growing concern in GPUs as they are originally designed for graphics processing with relaxed requirements for execution correctness. With CMOS processing technologies continuously scaling down to the nano-scale, on-chip soft error rate (SER) has been predicted […]
Jun, 13

Acceleration of calculation of Third Party Risk around an airport using OpenCL

During the past two decades, the Dutch National Aerospace Laboratory has developed a model to calculate the risk for third parties around airports. This Third Party Risk model is used in the decision making with respect to airport development and land use planning. Due to the increase of air traffic, the availability of improved individual […]
Jun, 13

An Analysis of Conventional and Heterogeneous Workloads on Production Supercomputing Resources

Cloud computing setups are a huge investment of resources and personnel to maintain. As the workload on a system is a major contributing factor to both the performance of the system and a representation of the needs of system users, a clear understanding of the workload is critical to organizations that support supercomputing systems. In […]
Jun, 13

Real-time planar flow velocity measurements using an optical flow algorithm implemented on GPU

This paper presents a high speed implementation of an optical flow algorithm which computes planar velocity fields in an experimental flow. Real-time computation of the flow velocity field allows the experimentalist to have instantaneous access to quantitative features of the flow. This can be very useful in many situations: fast evaluation of the performances and […]
Jun, 12

The Hierarchical Memory Machine Model for GPUs

The Discrete Memory Machine (DMM) and the Unified Memory Machine (UMM) are theoretical parallel computing models that capture the essence of the shared memory access and the global memory access of GPUs. The main contribution of this paper is to introduce the Hierarchical Memory Machine (HMM), which consists of multiple DMMs and a single UMM. […]
Jun, 12

FastSpMM: An Efficient Library for Sparse Matrix Matrix Product on GPUs

Sparse matrix matrix (SpMM) multiplication is involved in a wide range of scientific and technical applications. The computational requirements for this kind of operation are enormous, especially for large matrices. This paper analyzes and evaluates a method to efficiently compute the SpMM product in a computing environment that includes graphics processing units (GPUs). Some libraries […]
Jun, 12

FFT-SPA Non-Binary LDPC Decoding on GPU

It is well known that non-binary LDPC codes outperform the BER performance of binary LDPC codes for the same code length. The superior BER performance of non-binary codes comes at the expense of more complex decoding algorithms that demand higher computational power. In this paper, we propose parallel signal processing algorithms for performing the FFT-SPA […]
Jun, 12

OpenCL Implementation of a Color Based Object Tracking

In this paper we present an algorithm for realtime object tracking based on color. Firstly, a two-layer perceptron is trained aimed at coping with scene illumination changes. Based on this training, a piece of OpenCL code is generated for the purpose of harnessing the power of GPU computing. Then, color based object tracking is done […]
Jun, 12

Performance of a GPU-based Direct Summation Algorithm for Computation of Small Angle Scattering Profile

Small Angle Scattering (SAS) of X-rays or neutrons is an experimental technique that provides valuable structural information for biological macromolecules under physiological conditions and with no limitation on the molecular size. In order to refine molecular structure against experimental SAS data, ab initio prediction of the scattering profile must be recomputed hundreds of thousands of […]
Jun, 10

OCLoptimizer: An Iterative Optimization Tool for OpenCL

Nowadays, computers include several computational devices with parallel capacities, such as multicore processors and Graphic Processing Units (GPUs). OpenCL enables the programming of all these kinds of devices. An OpenCL program consists of a host code which discovers the computational devices available in the host system and it queues up commands to the devices, and […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: