Posts
Jun, 15
Evaluating CP2K on Exascale Hardware: Intel Xeon Phi
CP2K, a popular open-source European atomistic simulation package has been ported to the Intel Xeon Phi architecture, requiring no code modifications except minor bug fixes. Benchmarking of a small molecular dynamics simulation has been carried out using CP2K’s existing MPI, OpenMP and mixed-mode MPI/OpenMP implementations to achieve full utilisation of the Xeon Phi’s 240 virtual […]
Jun, 15
Airborne radar clutter simulation using GPU (CUDA)
Radar is an object detection system. Airborne radar is meant to search, detect and track aerial objects. Clutter is an unwanted echo that interferes with the observation of signal on radar screen. This paper discusses the use of GPU and CUDA. Graphic Processor Unit or GPU computing is the use of GPU together with CPU […]
Jun, 15
A Parallel Algorithm of PCA-SIFT Based on CUDA
PCA-SIFT is an algorithm to extract invariant features from images, it has been widely applied to many application fields including image processing, computer vision and pattern recognition. However, the execution of PCA-SIFT is time-consuming. A parallel algorithm of PCA-SIFT based on Compute Unified Device Architecture (CUDA) is proposed in this paper, in which each step […]
Jun, 15
Scalable Lattice Boltzmann Solvers for CUDA GPU Clusters
The lattice Boltzmann method (LBM) is an innovative and promising approach in computational fluid dynamics. From an algorithmic standpoint it reduces to a regular data parallel procedure and is therefore well-suited to high performance computations. Numerous works report efficient implementations of the LBM for the GPU, but very few mention multi-GPU versions and even fewer […]
Jun, 14
Image Denoising Using Wavelet Transform and CUDA
The discrete wavelet transform has a huge number of applications in science, engineering, mathematics and computer science. Most notably, it is used for signal coding to represent a discrete signal in a more redundant form, often as a preconditioning for data compression. Beginning in the 1990s, wavelets have been found to be a powerful tool […]
Jun, 14
Dynamic loop vectorization for executing OpenCL kernels on CPUs
Heterogeneous computing platforms are becoming increasingly important in supercomputing. Many systems now integrate CPUs and GPUs cooperating together on a single node. Much effort is invested in tuning GPU-kernels. However, it can be the case that some systems may not have GPUs or the GPUs are busy. Maintaining two versions of the same code for […]
Jun, 14
A GPU Implementation of Large Neighborhood Search for Solving Constraint Optimization Problems
Constraint programming has gained prominence as an effective and declarative paradigm for modeling and solving complex combinatorial problems. In particular, techniques based on local search have proved practical to solve real-world problems, providing a good compromise between optimality and efficiency. In spite of the natural presence of concurrency, there has been relatively limited effort to […]
Jun, 14
Parallel technologies for solving system of the linear equations by the conjugate gradient method
The main purpose of this work is to show the clear advantages of using modern parallel tools in solving the SLAE. The BiCGStab method was used for solving system of the linear equations. This paper contains some details about this method. To accelerate computations on the GPU several technologies (such as CUBLAS, OpenACC) were used. […]
Jun, 14
Unified Development for Mixed Multi-GPU and Multi-Coprocessor Environments using a Lightweight Runtime Environment
Many of the heterogeneous resources available to modern computers are designed for different workloads. In order to efficiently use GPU resources, the workload must have a greater degree of parallelism than a workload designed for multicore-CPUs. And conceptually, the Intel Xeon Phi coprocessors are capable of handling workloads somewhere in between the two. This multitude […]
Jun, 13
International Conference on Computer Engineering, ICOCE 2014
Publication All papers, both invited and contributed, will be reviewed by two or three experts from the PC. After a careful reviewing process, all accepted paper will be publishedin WIT Transactions on Information and Communication Technologies (ISSN: 1743-3517), which will be indexed by EI Compendex, Scopus and ISI. Topics: The topics of ICOCE2014 are included […]
Jun, 13
Realizing Accelerated Cost-Effective Distributed RAID
The exponential growth in user and application data entails new means for providing fault tolerance and protection against data loss. High Performance Computing (HPC) storage systems, which are at the forefront of handling the data deluge, typically employ hardware RAID at the backend. However, such solutions are costly, do not ensure end-to-end data integrity, and […]
Jun, 13
LTE Physical Layer Implementation Using GPU Based High Performance Computing
In recent years Graphics Processing Unit (GPU) has evolved as a high performance data processing technology allowing users to compute large blocks of parallel data using an array of low complexity processors. This paper proposes the implementation of compute intensive portions of 3rd Generation Partnership Project (3GPP) Long Term Evolution (LTE) physical layer using GPU. […]