Posts
Sep, 19
Simulating spiking neural networks on GPU
Modern graphics cards contain hundreds of cores that can be programmed for intensive calculations. They are beginning to be used for spiking neural network simulations. The goal is to make parallel simulation of spiking neural networks available to a large audience, without the requirements of a cluster. We review the ongoing efforts towards this goal, […]
Sep, 19
Parallelization of a Block-Matching Algorithm
In this work we present a parallelization technique, together with its GPU implementation, for the full-search block-matching algorithm. This problem consists in finding the block that best matches a given reference template in terms of some photometric measure within a predefined search area. Block matching is a fundamental processing step for many signal-processing applications. Its […]
Sep, 19
Beauty And The Beast: Exploiting GPUs In Haskell
In this paper we compare a Haskell system that exploits a GPU back end using Obsidian against a number of other GPU/parallel processing systems. Our examples demonstrate two major results. Firstly they show that the Haskell system allows the applications programmer to exploit GPUs in a manner that eases the development of parallel code by […]
Sep, 19
Gauge fixing using overrelaxation and simulated annealing on GPUs
We adopt CUDA-capable Graphic Processing Units (GPUs) for Coulomb, Landau and maximally Abelian gauge fixing in 3+1 dimensional SU(3) lattice gauge field theories. The local overrelaxation algorithm is perfectly suited for highly parallel architectures. Simulated annealing preconditioning strongly increases the probability to reach the global maximum of the gauge functional. We give performance results for […]
Sep, 18
Implementation of QR Updating Algorithms on the GPU
The least squares problem is an extremely useful device to represent an approximate solution to overdetermined systems, and a QR factorisation is a common method for solving least squares problems. It is often the case that multiple least squares solutions have to be computed with only minor changes in the underlying data. In this case, […]
Sep, 18
The Architecture and Evolution of CPU-GPU Systems for General Purpose Computing
GPU computing has emerged in recent years as a viable execution platform for throughput oriented applications or regions of code. GPUs started out as independent units for program execution but there are clear trends towards tight-knit CPU-GPU integration. In this work, we will examine existing research directions and future opportunities for chip integrated CPU-GPU systems. […]
Sep, 18
Quasi-real-time analysis of dynamic near field scattering data using a graphics processing unit
We present an implementation of the analysis of dynamic near field scattering (NFS) data using a graphics processing unit (GPU). We introduce an optimized data management scheme thereby limiting the number of operations required. Overall, we reduce the processing time from hours to minutes, for typical experimental conditions. Previously the limiting step in such experiments, […]
Sep, 18
High-throughput Execution of Hierarchical Analysis Pipelines on Hybrid Cluster Platforms
We propose, implement, and experimentally evaluate a runtime middleware to support high-throughput execution on hybrid cluster machines of large-scale analysis applications. A hybrid cluster machine consists of computation nodes which have multiple CPUs and general purpose graphics processing units (GPUs). Our work targets scientific analysis applications in which datasets are processed in application-specific data chunks, […]
Sep, 18
Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines
In this paper, we address the problem of efficient execution of a computation pattern, referred to here as the irregular wavefront propagation pattern (IWPP), on hybrid systems with multiple CPUs and GPUs. The IWPP is common in several image processing operations. In the IWPP, data elements in the wavefront propagate waves to their neighboring elements […]
Sep, 17
The 4rd International Workshop of GPU and MIC Solutions to Multiscale Problems in Science and Engineering (GPU-SMP’2013), 2013
TOPICS OF INTEREST AT THE CONFERENCE: Some topics are mentioned below but are not restricted to 1. Large-scale problems using GPU and hybrid systems 2. physical, chemical, biological, geological and industrial applications 3. Techniques for optimizing kernels in GPU and other many-core systems (MIC) 4. mixed precision computing 5. Benchmarking and performance evaluation for GPU, MIC, and hybrid systems 6. Visualization tools and techniques […]
Sep, 17
Seismic damage simulation for urban buildings based on high-performance GPU computing
Refined models have been an important development trend of urban regional seismic damage prediction. However, the application of refined models has been limited due to their high computational cost if implemented on traditional Central Processing Unit (CPU) platforms. In recent years, Graphics Processing Unit (GPU) technology has been developed and applied rapidly due to its […]
Sep, 17
A Simulation Framework for Scheduling Performance Evaluation on CPU-GPU Heterogeneous System
Modern PCs are equipped with multi-many core capabili-ties which enhance their computational power and address important issues related to the efficiency of the scheduling processes of the modern operating system in such hybrid architectures. The aim of our work is to implement a simulation framework devoted to the study of the scheduling process in hybrid […]