12389

Posts

Jun, 23

Runtime Visualization of Application Progress and Monitoring of a GPU-enabled Parallel Environment

The paper presents design, implementation and real life uses of a visualization subsystem for a distributed framework for parallelization of work-flow-based computations among clusters with nodes that feature both CPUs and GPUs. Firstly, the proposed system presents a graphical view of the infrastructure with clusters, nodes and compute devices along with parameters and runtime graphs […]
Jun, 23

A Scala Prototype to Generate Multigrid Solver Implementations for Different Problems and Target Multi-Core Platforms

Many problems in computational science and engineering involve partial differential equations and thus require the numerical solution of large, sparse (non)linear systems of equations. Multigrid is known to be one of the most efficient methods for this purpose. However, the concrete multigrid algorithm and its implementation highly depend on the underlying problem and hardware. Therefore, […]
Jun, 23

Coupled Vlasov and two-fluid codes on GPUs

We present a way to combine Vlasov and two-fluid codes for the simulation of a collisionless plasma in large domains while keeping full information of the velocity distribution in localized areas of interest. This is made possible by solving the full Vlasov equation in one region while the remaining area is treated by a 5-moment […]
Jun, 22

The Power-Performance Tradeoffs of the Intel Xeon Phi on HPC Applications

Accelerators are used in about 13% of the current Top500 List. Supercomputers leveraging accelerators grew by a factor of 2.2x in 2012 and are expected to completely dominate the Top500 by 2015. Though most of these deployments use NVIDIA GPGPU accelerators, Intel’s Xeon Phi architecture will likely grow in popularity in the coming years. Unfortunately, […]
Jun, 22

The Fast and Wideband MoM Based on GPU and Two-Path AFS Acceleration

In this paper, a General Purpose Unit (GPU) accelerated full-wave method of moment (MoM) is combined with a two-path adaptive frequency sampling (AFS) approach to analyze the wideband characteristic of the body-wire structures. An equivalent principle is employed to treat the wire as surface so that the model which is analyzed based on the electric-field […]
Jun, 22

Solving the Caputo Fractional Reaction-Diffusion Equation on GPU

We present a parallel GPU solution of the Caputo fractional reaction-diffusion equation in one spatial dimension with explicit finite difference approximation. The parallel solution, which is implemented with CUDA programming model, consists of three procedures: preprocessing, parallel solver, and postprocessing. The parallel solver involves the parallel tridiagonal matrix vector multiplication, vector-vector addition, and constant vector […]
Jun, 22

Real-Time Deformation of Subdivision Surfaces from Object Collisions

We present a novel real-time approach for fine-scale surface deformations resulting from collisions. Deformations are represented by a high-resolution displacement function. When two objects collide, these offsets are updated directly on the GPU based on a dynamically generated binary voxelization of the overlap region. Consequently, we can handle collisions with arbitrary animated geometry. Our approach […]
Jun, 22

Comparison of Random Number Generators in Particle Swarm Optimization Algorithm

Intelligent optimization algorithms are very effective to tackle complex problems that would be difficult or impossible to solve exactly. A key component within these algorithms is the random number generators (RNGs) which provide random numbers to drive the stochastic search process. Much effort is devoted to develop efficient RNGs with good statistical properties, and many […]
Jun, 20

Acceleration of GPU-based ultrasound simulation via data compression

The realistic simulation of ultrasound wave propagation is computationally intensive. The large size of the grid and low degree of reuse of data means that it places a great demand on memory bandwidth. Graphics Processing Units (GPUs) have attracted attention for performing scientific calculations due to their potential for efficiently performing large numbers of floating […]
Jun, 20

GPU based FDTD method for investigation on the electromagnetic scattering from 1-D rough soil surface

In this paper, the graphic processor unit (GPU) implementation of the finite-difference time domain (FDTD) algorithm is presented to investigate the electromagnetic (EM) scattering from one dimensional (1-D) Gaussian rough soil surface. The FDTD lattices are truncated by uniaxial perfectly matched layer (UPML), in which the finite-difference equations are carried out for the total computation […]
Jun, 20

A Fast Mixed-Band Lifting Wavelet Transform on the GPU

Discrete wavelet transform (DWT) has been widely used in many image compression applications, such as JPEG2000 and compressive sensing MRI. Even though a lifting scheme [1] has been widely adopted to accelerate DWT, only a handful of research has been done on its efficient implementation on many-core accelerators, such as graphics processing units (GPUs). Moreover, […]
Jun, 20

MIC-SVM: Designing A Highly Efficient Support Vector Machine For Advanced Modern Multi-Core and Many-Core Architectures

Support Vector Machine (SVM) has been widely used in data-mining and Big Data applications as modern commercial databases start to attach an increasing importance to the analytic capabilities. In recent years, SVM was adapted to the field of High Performance Computing for power/performance prediction, auto-tuning, and runtime scheduling. However, even at the risk of losing […]

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org