Posts
Oct, 17
Platform-independent parallelization of the Lattice Boltzmann method with OpenCL
Simulations, like fluid dynamics, are very computationally intensive problems. Since the Lattice Boltzmann method uses a discrete grid of cells for simulating the flow, there are no dependencies between the single cells during the computation for one time step. Therefore, the computing can easily be done in parallel. During the last years, multi-CPU computers have […]
Oct, 17
Design and Performance Evaluation of a Software Framework for Multi-Physics Simulations on Heterogeneous Supercomputers
Despite the experience of several decades the numerical simulation of computational fluid dynamics is still an enormously challenging and active research field. Most simulation tasks of scientific and industrial relevance require the modeling of multiple physical effects, complex numerical algorithms, and have to be executed on supercomputers due to their high computational demands. Facing these […]
Oct, 17
Enhancing Productivity and Performance Portability of General-Purpose Parallel Programming
This work focuses on compiler and run-time techniques for improving the productivity and the performance portability of general-purpose parallel programming. More specifically, we focus on shared-memory task-parallel languages, where the programmer explicitly exposes parallelism in the form of short tasks that may outnumber the cores by orders of magnitude. The compiler, the run-time, and the […]
Oct, 17
Generalized Resource Allocation for the Cloud
Resource allocation is an integral, evolving part of many data center management problems such as virtual machine placement in data centers, network virtualization, and multi-path network routing. Since the problems are inherently NP-Hard, most existing systems use custom-designed heuristics to find a suitable solution. However, such heuristics are often rigid, making it difficult to extend […]
Oct, 17
Performance Analysis Cluster and GPU Computing Environment on Molecular Dynamic Simulation of BRV-1 and REM2 with GROMACS
One of application that needs high performance computing resources is molecular d ynamic. There is some software available that perform molecular dynamic, one of these is a well known GROMACS. Our previous experiment simulating molecular dynamics of Indonesian grown herbal compounds show sufficient speed up on 32 n odes Cluster computing environment. In order to […]
Oct, 16
Accelerating Fully Homomorphic Encryption on GPUs
In a major breakthrough, in 2009 Gentry introduced the first plausible construction of a fully homomorphic encryption (FHE) scheme. FHE allows the evaluation of arbitrary functions directly on encrypted data on untwisted servers. In 2010, Gentry and Halevi presented the first FHE implementation on an IBM x3500 server. However, this implementation remains impractical due to […]
Oct, 16
Optimal structure of face detection algorithm using GPU architecture
This article describes parallel algorithm of face detection on images for GPU architecture. This algorithm is an extension of an algorithm from OpenCV library. A computational structure is presented for the developed algorithm. Also, scheduling algorithm was developed to balance a workload among GPU’s threads.
Oct, 16
High-Performance Computing Algorithms for Constructing Inverted Files on Emerging Multicore Processors
Current trends in processor architectures increasingly include more cores on a single chip and more complex memory hierarchies, and such a trend is likely to continue in the foreseeable future. These processors offer unprecedented opportunities for speeding up demanding computations if the available resources can be effectively utilized. Simultaneously, parallel programming languages such as OpenMP […]
Oct, 16
Particle Filters on Multi-Core Processors
The particle filter is a Bayesian estimation technique based on Monte Carlo simulation. The nonparametric nature of particle filters makes them ideal for non-linear, non-Gaussian dynamic systems. Particle filtering has many applications: in computer vision, robotics, and econometrics to name just a few. Although superior to Kalman filters, particle filters have higher computational requirements, which […]
Oct, 16
Analysis of Single Phase Fluid Flow and Heat Transfer in Slip Flow Regime by Parallel Implementation of Lattice Boltzmann Method on GPUs
In this thesis work fluid flow and heat transfer in two-dimensional microchannels are studied numerically. A computer code based on Lattice Boltzmann Method (LBM) is developed for this purpose. The code is written using MATLAB and Jacket software and has the important feature of being able to run parallel on Graphics Processing Units (GPUs). The […]
Oct, 15
CUDA Based Multi Objective Parallel Genetic Algorithms: Adapting Evolutionary Algorithms for Document Searches
This paper introduces a Multi Objective Parallel Genetic Algorithm (MOPGA) using the Compute Unified Device Architecture (CUDA) hardware for parallel processing. The algorithm demonstrates significant speed gains using affordable, scalable and commercially available hardware. The algorithm implements a document search using techniques such as Term Frequency Inverse Document Frequency (TF-IDF), Latent Semantic Analysis (LSA), Multi […]
Oct, 15
Geometry Based Visualization with OpenCL
This work targets the design and implementation of an isosurface extraction solution capable of handling large datasets. The Marching Cubes algorithm is the method used to extract the isosurfaces. These are graphical representations of points with a constant value (e.g. matter density) within volumetric datasets. A very useful approach to visualize particular regions of such […]