8447

Posts

Oct, 16

High-Performance Computing Algorithms for Constructing Inverted Files on Emerging Multicore Processors

Current trends in processor architectures increasingly include more cores on a single chip and more complex memory hierarchies, and such a trend is likely to continue in the foreseeable future. These processors offer unprecedented opportunities for speeding up demanding computations if the available resources can be effectively utilized. Simultaneously, parallel programming languages such as OpenMP […]
Oct, 16

Particle Filters on Multi-Core Processors

The particle filter is a Bayesian estimation technique based on Monte Carlo simulation. The nonparametric nature of particle filters makes them ideal for non-linear, non-Gaussian dynamic systems. Particle filtering has many applications: in computer vision, robotics, and econometrics to name just a few. Although superior to Kalman filters, particle filters have higher computational requirements, which […]
Oct, 16

Analysis of Single Phase Fluid Flow and Heat Transfer in Slip Flow Regime by Parallel Implementation of Lattice Boltzmann Method on GPUs

In this thesis work fluid flow and heat transfer in two-dimensional microchannels are studied numerically. A computer code based on Lattice Boltzmann Method (LBM) is developed for this purpose. The code is written using MATLAB and Jacket software and has the important feature of being able to run parallel on Graphics Processing Units (GPUs). The […]
Oct, 15

CUDA Based Multi Objective Parallel Genetic Algorithms: Adapting Evolutionary Algorithms for Document Searches

This paper introduces a Multi Objective Parallel Genetic Algorithm (MOPGA) using the Compute Unified Device Architecture (CUDA) hardware for parallel processing. The algorithm demonstrates significant speed gains using affordable, scalable and commercially available hardware. The algorithm implements a document search using techniques such as Term Frequency Inverse Document Frequency (TF-IDF), Latent Semantic Analysis (LSA), Multi […]
Oct, 15

Geometry Based Visualization with OpenCL

This work targets the design and implementation of an isosurface extraction solution capable of handling large datasets. The Marching Cubes algorithm is the method used to extract the isosurfaces. These are graphical representations of points with a constant value (e.g. matter density) within volumetric datasets. A very useful approach to visualize particular regions of such […]
Oct, 15

Parallel Programming using OpenCL on Modern Architectures

This report is intended as a quick introduction to the OpenCL framework and the aim is to facilitate a smooth transfer into the use OpenCL C for developers with previous GPGPU experience. The purpose of OpenCL is to allow for developers to use all compute resources available on a heterogeneous hardware platform. As well as […]
Oct, 15

Iterative Krylov solution methods for geophysical electromagnetic simulations on throughput-oriented processing units

Many geoscientific applications involve boundary value problems arising in simulating electrostatic and electromagnetic fields for geophysical prospecting and subsurface imaging of electrical resistivity. Modeling complex geological media with three-dimensional finite difference grids gives rise to large sparse linear systems of equations. For such systems, we have implemented three common iterative Krylov solution methods on graphics […]
Oct, 15

Kinetics of liquid-solid phase transition in large nickel clusters

In this paper we have explored computationally the solidification process of large nickel clusters. This process has the characteristic features of the first order phase transition occurring in a finite system. The focus of our research is placed on the elucidation of correlated dynamics of a large ensemble of particles in the course of the […]
Oct, 14

Graphics Processing Unit based searching the critical slip surface of slopes by the Vector Sum Analysis Method

Vector Sum Analysis Method (VSAM), with its clear physical and mechanical meaning in slope stability analysis, can be effectively applied to calculate the slope safety factor and to search the critical slip surface of a slope. In the current researches of VSAM, most people focus on the solutions or application in slope engineering without considering […]
Oct, 14

Real-Time Use of GPUs in NA62 Experiment

We describe a pilot project for the use of GPUs in a real-time triggering application in the early trigger stages at the CERN NA62 experiment, and the results of the first field tests together with a prototype data acquisition (DAQ) system. This pilot project within NA62 aims at integrating GPUs into the central L0 trigger […]
Oct, 14

Exploiting Data Parallelism in GPUs

Mainstream microprocessor design no longer delivers performance boosts by increasing the processor clock frequency due to power and thermal constraints. Nonetheless, advances in semiconductor fabrication still allow the transistor density to increase at the rate of Moore’s law. This has resulted in the proliferation of many-core parallel architectures and accelerators, among which GPUs (graphics processing […]
Oct, 14

Lattice Group Models: GPU Acceleration and Numerics

Lattice group models (LGpM) are kinetic models on integer lattices derived from the automorphism group of the lattice. In the last decades it was too expensive to simulate large systems (100 – 1000 velocities in a 2D or 3D model), with complex physical two or three dimensional domains, on normal computers or clusters within an […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: