Posts
Aug, 9
A GPU implementation of massively parallel direction splitting for the incompressible Navier-Stokes equations
Guermond and Minev proposed a directional splitting algorithm to solve the incompressible Stokes equations. Their algorithm applies the alternating direction implicit method to the viscosity term. The pressure update uses a direction splitting method in order to enforce the incompressibility constraint, as opposed to commonly used projection methods that require the solution of a Poisson […]
Aug, 9
A GPGPU based program to solve the TDSE in intense laser fields through the finite difference approach
We present a General-purpose computing on graphics processing units (GPGPU) based computational program and framework for the electronic dynamics of atomic systems under intense laser fields. We present our results using the case of hydrogen, however the code is trivially extensible to tackle problems within the single-active electron (SAE) approximation. Building on our previous work, […]
Aug, 9
Multi-core CPU or GPU-accelerated Multiscale Modeling for Biomolecular Complexes
Multi-scale modeling plays an important role in understanding the structure and biological functionalities of large biomolecular complexes. In this paper, we present an efficient computational framework to construct multi-scale models from atomic resolution data in the Protein Data Bank (PDB), which is accelerated by multi-core CPU and programmable Graphics Processing Units (GPU). A multi-level summation […]
Aug, 9
High-Performance Holistic XML Twig Filtering Using GPUs
Current state of the art in information dissemination comprises of publishers broadcasting XML-coded documents, in turn selectively forwarded to interested subscribers. The deployment of XML at the heart of this setup greatly increases the expressive power of the profiles listed by subscribers, using the XPath language. On the other hand, with great expressive power comes […]
Aug, 8
GPU implementation of a shell element structural solver aimed at fluid-structure interaction problems
The study of thin structures is very common nowadays and useful in different fields. An important example is the analysis of sail dynamics. In this context, accurate simulations of the interaction between the sail and the wind are also required. However, this kind of fluid-structure interaction problems are very computationally expensive. First objective of this […]
Aug, 8
An Energy Optimization of a GPU Application by Grid Design Space Exploration
Power and energy consumptions are also becoming important design criteria. Consequently, software designs have to consider the power/energy consumptions together with performance when they are developing software. In this paper, we explore a design space exploration with a commercial GPU: nVidia GTX 660 for investigating the best configuration of a kernel grid structure in a […]
Aug, 8
Levy Flights for Particle Swarm Optimisation Algorithms on Graphical Processing Units
Particle Swarm Optimisation (PSO) is a powerful algorithm for space search problems such as parametric optimisation. Particles with Levy flights have a long-tailed probability of outlier jumps in the problem space that provide a good compromise between local space exploration and local minima avoidance. Generating many particles and their trajectories with Levy random deviates is […]
Aug, 8
Compiler-based Data Prefetching and Streaming Non-temporal Store Generation for the Intel Xeon Phi Coprocessor
The Intel Xeon Phi coprocessor has software prefetching instructions to hide memory latencies and special store instructions to save bandwidth on streaming nontemporal store operations. In this work, we provide details on compiler-based generation of these instructions and evaluate their impact on the performance of the Intel Xeon Phi coprocessor using a wide range of […]
Aug, 8
Improving the GPU space of computation under triangular domain problems
There is a stage in the GPU computing pipeline where a grid of thread-blocks is mapped to the problem domain. Normally, this grid is a k-dimensional bounding box that covers a k-dimensional problem no matter its shape. Threads that fall inside the problem domain perform computations, otherwise they are discarded at runtime. For problems with […]
Aug, 7
Exploring Microcontrollers in GPUs
Recent graphics processing units (GPUs) integrate wimpy microcontrollers on a chip. They are often used to execute firmware code configuring the functional units of GPUs. This paper opens up the programming of these microcontrollers and explores how to utilize them for GPU resource management. Our prototype system provides a compiler suite for NVIDIA’s GPU microcontrollers […]
Aug, 7
Finite Difference Time-Domain Modelling of Metamaterials: GPU Implementation of Cylindrical Cloak
Finite difference time-domain (FDTD) technique can be used to model metamaterials by treating them as dispersive material. Drude or Lorentz model can be incorporated into the standard FDTD algorithm for modelling negative permittivity and permeability. FDTD algorithm is readily parallelisable and can take advantage of GPU acceleration to achieve speed-ups of 5x-50x depending on hardware […]
Aug, 7
Fast Morphological Image Processing on GPU using CUDA
A mathematical morphology is used as a tool for extracting image components that are useful in the representation and description of region shape. The mathematical morphology operations of dilation, erosion, opening, and closing are important building blocks of many other image processing algorithms. The data parallel programming provides an opportunity for performance acceleration using highly […]