Posts
Oct, 4
Heterogeneous GPU&CPU cluster for High Performance Computing in cryptography
This paper addresses issues associated with distributed computing systems and the application of mixed GPU&CPU technology to data encryption and decryption algorithms. We describe a heterogenous cluster HGCC formed by two types of nodes: Intel processor with NVIDIA graphics processing unit and AMD processor with AMD graphics processing unit (formerly ATI), and a novel software […]
Oct, 4
Computing resultants on Graphics Processing Units: Towards GPU-accelerated computer algebra
In this article we report on our experience in computing resultants of bivariate polynomials on Graphics Processing Units (GPU). Following the outline of Collins’ modular approach [6], our algorithm starts by mapping the input polynomials to a finite field for sufficiently many primes m. Next, the GPU algorithm evaluates the polynomials at a number of […]
Oct, 4
Parallel computations on GPU in 3D using the vortex particle method
The paper presented theVortexin Cell (VIC) method for solving the fluid motion equations in3D and its implementation for parallelcomputationin multicore architecture of the Graphics Processing Unit (GPU). One of the most important components of the VIC method algorithm is the solution of the Poisson equation. Multigrid and full multigrid methods were chosen for its solution […]
Oct, 4
Towards accelerating Smoothed Particle Hydrodynamics simulations for free-surface flows on multi-GPU clusters
Starting from the single graphics processing unit (GPU) version of the Smoothed Particle Hydrodynamics (SPH) code DualSPHysics, a multi-GPU SPH program is developed for free-surface flows. The approach is based on a spatial decomposition technique, whereby different portions (sub-domains) of the physical system under study are assigned to different GPUs. Communication between devices is achieved […]
Oct, 3
KERNELGEN – A Toolchain for Automatic GPU-centric Applications Porting
KernelGen is a toolchain for porting existing source code on the GPU, that does not involve inserting annotations or manual kernels programming, but instead moves as much target source on the GPU, as possible, enabling automatic adaptation of large codebase, e.g. numerical models. Separate kernels are generated for parallel loops, and the rest of the […]
Oct, 3
GPU-based infrared thermography for NDE of minefields
Infrared thermography is an attractive technique for non-destructive evaluation processes and particularly for detecting shallowly buried mines. Its use consists of subjecting the area under inspection to a source of natural or artificial heating/cooling process and studying the soil’s response by means of the analysis of its thermal evolution given by a temporal sequence of […]
Oct, 3
Performance Analysis of an Ultrasound Reconstruction Algorithm for Non Destructive Testing
The CIVA software platform developed by CEA-LIST offers various simulation and data processing modules dedicated to non-destructive testing (NDT). In particular, ultrasonic imaging and reconstruction tools are proposed in the purpose of localizing echoes and identifying and sizing the detected defects. Because of the complexity of data processed, computation time is now a limitation for […]
Oct, 3
Shape Modeling and GPU Based Image Warping
This project addresses the problems of manually placing facial landmarks on a portrait and finding a fast way to warp the annotated image of a face. While there are many approaches to automatically find facial landmarks, most of them provide insufficient results in uncontrolled environments. Thus I introduce a method to manually adjust a non-rigid […]
Oct, 3
Orthogononalization on a general purpose graphics processing unit with double double and quad double arithmetic
Our problem is to accurately solve linear systems of modest dimensions (typically, the number of variables equals 32) on a general purpose graphics processing unit. The linear systems originate from the application of Newton’s method on polynomial systems of (moderately) large degrees. Newton’s method is applied as a corrector in a path following method, so […]
Oct, 2
CADDIES: A New Framework for Rapid Development of Parallel Cellular Automata Algorithms for Flood Simulation
A recent trend in the development of flood simulation algorithms shows the move toward fast simplified models instead of slow full hydrodynamic models. CADDIES is a research project that aims to develop a real/near-real time pluvial urban flood simulation model using the computational speed of cellular automata (CA) algorithms. This paper presents a component of […]
Oct, 2
Material Removal Simulation and Cutting Force Prediction of Multi-Axis Machining Processes on General-Purpose Graphics Processing Units
The efficient planning of automated machining processes is unthinkable without the use of offline CAM systems. Though machining programs can be written and input manually, right at the machine controller, if the workpiece geometry is complex, or if the machined features are numerous, the help of CAM software is essential for generating the program both […]
Oct, 2
Exploiting Limited Access Distance of ODE Systems for Parallelism and Locality in Explicit Methods
The solution of initial value problems of large systems of ordinary differential equations (ODEs) is computationally intensive and demands for efficient parallel solution techniques that take into account the complex architectures of modern parallel computer systems. This article discusses implementation techniques suitable for ODE systems with a special coupling structure, called limited access distance, which […]