11082

Posts

Dec, 8

A Highly Extensible Framework for Molecule Dynamic Simulation on GPUs

Molecular dynamics (MD) was widely used in chemistry and bio molecules. Numerous attempts have been made to accelerate MD simulations. CUDA enabled NVIDIA Graphics processing units (GPUs) use as a general purpose parallel computer chips as CPU. But it is not easy to port a program to GPU. We present a highly extensible framework for […]
Dec, 8

Partial Parallelization of the Successive Projections Algorithm using Compute Unified Device Architecture

This paper proposes a partial parallelization for the Successive Projections Algorithm (SPA), which is a variable selection technique designed for use with Multiple Linear Regression. This implementation is aimed at improving the computational efficiency of SPA, without changing the outcome of the algorithm. For this purpose, a new strategy of inverse matrix calculation is employed. […]
Dec, 8

A GPU-based Multiresolution Pipeline for Compressed Volume Rendering

The recent improvements in data-acquisition methods have resulted in the emergence of increasingly larger volumetric datasets. The design of GPU volume rendering solutions must have into account this trend while dealing with the limited available memory in a graphics card. In this work, we present a pipeline for volume rendering that stores a compressed version […]
Dec, 8

Waste Not… Efficient Co-Processing of Relational Data

The variety of memory devices in modern computer systems holds opportunities as well as challenges for data management systems. In particular, the exploitation of Graphics Processing Units (GPUs) and their fast memory has been studied quite intensively. However, current approaches treat GPUs as systems in their own right and fail to provide a generic strategy […]
Dec, 8

GPU-Accelerated Crack Path Computation Based on a Phase Field Approach for Brittle Fracture

In recent years, a new approach to analyze fracturing has been developed. The so-called phase field models approximate cracks by a scalar, macroscopic field variable that distinguishes between broken and undamaged material. The phase field approach to fracture has significant advantages over more established methods. However it is necessary to solve a coupled set of […]
Dec, 6

A Distributed Data Mining Framework Accelerated with Graphics Processing Units

In the context of processing high volumes of data, the recent developments have led to numerous models and frameworks of distributed processing running on clusters of commodity hardware. On the other side, the Graphics Processing Unit (GPU) has seen much enthusiastic development as a device for general-purpose intensive parallel computation. In this paper we propose […]
Dec, 6

A Quantitative Comparison of Emulated Shared Memory Architectures to Current Multicore CPUs and GPUs

The performance of current multicore CPUs and GPUs is limited in computations making frequent use of communication/synchronization between the subtasks executed in parallel. This is because the directory-based cache systems scale weakly and/or the cost of synchronization is high. The Emulated Shared Memory (ESM) architectures relying on multithreading and efficient synchronization mechanism have been developed […]
Dec, 6

Parallel tree-ensemble algorithms for GPUs using CUDA

We present two new parallel implementations of the tree-ensemble algorithms Random Forest (RF) and Extremely randomized trees (ERT) for emerging many-core platforms, e.g., contemporary graphics cards suitable for general-purpose computing (GPGPU). Random Forest and Extremely randomized trees are ensemble learners for classification and regression. They operate by constructing a multitude of decision trees at training […]
Dec, 6

Speeding up the small progress measures algorithm for parity games using the GPU

Solving parity games is interesting because it is equivalent to model checking for mu-calculus. The small progress measures (SPM) algorithm by Jurdzinski is originally a sequential algorithm for solving parity games. The nature of this algorithm allows easy parallelization, and previous research has already adapted it to work on multi-core machines. Here, SPM is adapted […]
Dec, 6

PyFR: An Open Source Framework for Solving Advection-Diffusion Type Problems on Streaming Architectures using the Flux Reconstruction Approach

High-order numerical methods for unstructured grids combine the superior accuracy of high-order spectral or finite difference methods with the geometric flexibility of low-order finite volume or finite element schemes. The Flux Reconstruction (FR) approach unifies various high-order schemes for unstructured grids within a single framework. Additionally, the FR approach exhibits a significant degree of element […]
Dec, 6

Similarity Search in Metric Spaces on Parallel multi-core and multi-GPU Platforms

This thesis has proposed a set of algorithms and strategies to solve similarity searches in metric spaces using different parallel platforms. In the first part of the thesis, we have used a multi-core platform, where we found that particular strategies are more suitable depending on the traffic query, obtaining a high speed-up (up to 7.9x […]
Dec, 6

A Fast Implementation of Parallel Discrete-Event Simulation on GPGPU

Modern General Purpose Graphics Processing Units(GPGPUs) offer much more computational power than recent CPUs by providing a vast number of simple, data parallel, multithreaded cores. In this study, we focus on the use of a GPGPU to perform parallel discrete-event simulation. Our approach is to use a modified service time distribution function to allow more […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: