Posts
Dec, 1
Developing Extensible Lattice-Boltzmann Simulators for General-Purpose Graphics-Processing Units
Lattice-Boltzmann methods are versatile numerical modeling techniques capable of reproducing a wide variety of fluid-mechanical behavior. These methods are well suited to parallel implementation, particularly on the single-instruction multiple data (SIMD) parallel processing environments found in computer graphics processing units (GPUs). Although recent programming tools dramatically improve the ease with which GPUbased applications can be […]
Dec, 1
OpenCL Numerical Simulations of Two-Fluid Compressible Flows With a 2D Random Choice Method
In this paper, we propose a new very simple numerical method for solving liquid-gas compressible flows. Such flows are difficult to simulate because classical conservative finite volume schemes generate pressure oscillations at the liquid-gas interface. We extend to several dimensions the random choice scheme that we have previously constructed. The extension is performed through Strang […]
Dec, 1
CUDA-OpenGL Interoperability to Visualize Electromagnetic Fields Calculated by FDTD
In this contribution, a compute unified device architecture (CUDA) implementation of a two-dimensional finite-difference time-domain (FDTD) program is presented along with the OpenGL interoperability to visualize electromagnetic fields as an animation while an FDTD simulation is running. CUDA, which runs on a graphics processing unit (GPU) card, is used for electromagnetic field data generation and […]
Dec, 1
CPUless PCs inside networked control systems
This paper represents results of adavancing our previous WSEAS paper[1] and is aimed to basics for design framework that helps design hard real-time control systems using Unix/Unix like operating systems. This framework is designed while solving research project supported by the Slovak Research and Development Agency under the contract No. VMSP-II-0034-09. This framework contains layer […]
Dec, 1
Multidimensional Dataflow Graph Modeling and Mapping for Efficient GPU Implementation
Multidimensional synchronous dataflow (MDSDF) provides an effective model of computation for a variety of multidimensional DSP systems that have static dataflow structures. In this paper, we develop new methods for optimized implementation of MDSDF graphs on embedded platforms that employ multiple levels of parallelism to enhance performance at different levels of granularity. Our approach allows […]
Nov, 29
Using GPUs for Realtime Prediction of Optical Forces on Microsphere Ensembles
Laser beams can be used to create optical traps that can hold and transport small particles. Optical trapping has been used in a number of applications ranging from prototyping at the microscale to biological cell manipulation. Successfully using optical tweezers requires predicting optical forces on the particle being trapped and transported. Reasonably accurate theory and […]
Nov, 29
Parallel Nonbinary LDPC Decoding on GPU
Nonbinary Low-Density Parity-Check (LDPC) codes are a class of error-correcting codes constructed over the Galois field GF(q) for q > 2. As extensions of binary LDPC codes, nonbinary LDPC codes can provide better error-correcting performance when the code length is short or moderate, but at a cost of higher decoding complexity. This paper proposes a […]
Nov, 29
clOpenCL – Supporting Distributed Heterogeneous Computing in HPC Clusters
Clusters that combine heterogeneous compute device architectures, coupled with novel programming models, have created a true alternative to traditional (homogeneous) cluster computing, allowing to leverage the performance of parallel applications. In this paper we introduce clOpenCL, a platform that supports the simple deployment and efficient running of OpenCL-based parallel applications that may span several cluster […]
Nov, 29
GPU Ray Tracing – Comparative Study of Ray-Triangle Intersection Algorithms
We present a comparative study of GPU ray tracing implemented for two different types of ray-triangle intersection algorithms used with BVH (Bounding Volume Hierarchy) spatial data structure evaluated for performance on three static scenes. We study how number of triangles placed in a BVH leaf node affects rendering performance. We propose GPU-optimized SIMD ray-triangle intersection […]
Nov, 28
Formal Semantics of Heterogeneous CUDA-C: A Modular Approach with Applications
We extend an off-the-shelf, executable formal semantics of C (Ellison and Rosu’s K Framework semantics) with the core features of CUDA-C. The hybrid CPU/GPU computation model of CUDA-C presents challenges not just for programmers, but also for practitioners of formal methods. Our formal semantics helps expose and clarify these issues. We demonstrate the usefulness of […]
Nov, 28
Applying Contact Angle to a Two-Dimensional Smoothed Particle Hydrodynamics (SPH) model on a Graphics Processing Unit (GPU) Platform
A parallel GPU compatible Lagrangian mesh free particle solver for multiphase fluid flow based on SPH scheme is developed and used to capture the interface evolution during droplet impact. Surface tension is modeled employing the multiphase scheme of Hu et al. (2006). In order to precisely simulate the wetting phenomena, a method based on the […]
Nov, 28
Chest CT automatic analysis for lung nodules detection implemented on a GPU computing system
The aim of this work is the efficient implementation of the Hessian based filters. These filters are commonly used in medical image analysis and are employed in the Voxel Based Neural Approach (VBNA) lung CAD (Computer Aided Detection) system for lung nodule detection. This work mainly focuses on the optimization of the filter devoted to […]