7177

Posts

Feb, 6

GPU-based Swendsen-Wang multi-cluster algorithm for the simulation of two-dimensional classical spin systems

We present the GPU calculation with the common unified device architecture (CUDA) for the Swendsen-Wang multi-cluster algorithm of two-dimensional classical spin systems. We adjust the two connected component labeling algorithms recently proposed with CUDA for the assignment of the cluster in the Swendsen-Wang algorithm. Starting with the q-state Potts model, we extend our implementation to […]
Feb, 6

A toolkit to describe and interactively display three-manifolds embedded in four-space

A data structure and toolkit are presented here that allow for the description and manipulation of mathematical models of three-manifolds and their interactive display from multiple viewpoints via the OpenGL 3D graphics package. The data structure and vector math package can be extended to support an arbitrary number of Euclidean spatial dimensions. A model in […]
Feb, 5

Comparing Linear and Convex Relaxations for Stereo and Motion

We provide an analysis of several linear programming relaxations for the problems of stereo disparity estimation and motion estimation. The problems are cast as integer linear programs and their relaxations are solved approximately either by block coordinate descent (TRW-S and MPLP) or by smoothing and convex optimization techniques. We include a comparison to graph cuts. […]
Feb, 5

A Fast Parallel Implementation of Queue-based Morphological Reconstruction using GPUs

In this paper we develop and experimentally evaluate a novel GPU-based implementation of the morphological reconstruction operation. This operation is commonly used in the segmentation and feature computation steps of image analysis pipelines, and often used as a component in other image processing operations. Our implementation builds on a fast hybrid CPU algorithm, which employs […]
Feb, 5

Accelerating Outlier Detection with Uncertain Data using Graphics Processors

Outlier detection (also known as anomaly detection) is a common data mining task in which data points that lie outside expected patterns in a given dataset are identified. This is useful in areas such as fault detection, intrusion detection and in pre-processing before further analysis. There are many approaches already in use for outlier detection, […]
Feb, 5

Efficient Computation of SOM for Outage Database

This paper describes a utilization of the Self Organizing Map (SOM) method for the analysis of power outage data. SOM, to be already used in many fields, is based on the Kohonen self-organizing neural network and it is known to capture underlying concepts. We apply this method for a unified database of power outages to […]
Feb, 5

Performance Evaluation of Particle Swarm Optimization Algorithms on GPU Using CUDA

Particle Swarm Optimization (PSO) may be easy but powerful optimization algorithm relying on the social behavior of the particles. PSO has become popular due to its simplicity and its effectiveness in wide range of application with low computational cost. The main objective of this paper is to implement a parallel Asynchronous version and Synchronous versions […]
Jan, 31

Raytracing Dynamic Scenes on GPU

Raytracing dynamic scenes at interactive rates to realtime rates has received a lot of attention recently. In this dissertation, We present a few strategies for high performance ray tracing on an off-theshelf commodity Graphics Processing Unit (GPU) traditionally used for accelerating gaming and other graphics applications. We utilize the Grid datastructure for spatially arranging the […]
Jan, 31

Decompilation of LLVM IR

Recently, in many important domains, high-level languages have become the code representations with widest platform support surpassing any low-level language in their area with respect to completeness and importance as exchange format (e.g. OpenCL for data-parallel computing, GLSL/HLSL for shader programs, JavaScript for the web). The code representations of many actively-developed compiler frameworks [JVM,LLVM,FIRM] are […]
Jan, 31

The Virtual OpenCL (VCL) Cluster Platform

Heterogeneous computing systems can dramatically increase the performance of parallel applications on clusters. Currently, applications that utilize GPU and APU devices, run their device-specific code only on devices of the same computer were the application runs. This paper presents the Virtual OpenCL (VCL) cluster platform that can run unmodified OpenCL applications transparently on clusters with […]
Jan, 31

Graphical processing unit implementation of an integrated shape-based active contour: Application to digital pathology

Commodity graphics hardware has become a cost-effective parallel platform to solve many general computational problems. In medical imaging and more so in digital pathology, segmentation of multiple structures on high-resolution images, is often a complex and computationally expensive task. Shape-based level set segmentation has recently emerged as a natural solution to segmenting overlapping and occluded […]
Jan, 31

An OpenCL implementation for the solution of TDSE on GPU and CPU architectures

Open Computing Language (OpenCL) is a parallel processing language that is ideally suited for running parallel algorithms on Graphical Processing Units (GPUs). In the present work we report the development of a generic parallel single-GPU code for the numerical solution of a system of first-order ordinary differential equations (ODEs) based on the openCL model. We […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: