11104

Posts

Dec, 11

Multilayered Abstractions for Partial Differential Equations

How do we build maintainable, robust, and performance-portable scientific applications? This thesis argues that the answer to this software engineering question in the context of the finite element method is through the use of layers of Domain-Specific Languages (DSLs) to separate the various concerns in the engineering of such codes. Performance-portable software achieves high performance […]
Dec, 11

Job Parallelism using Graphical Processing Unit Individual Multi-Processors and Localised Memory

Graphical Processing Units(GPUs) are usually programmed to provide data-parallel acceleration to a host processor. Modern GPUs typically have an internal multi-processor (MP) structure that can be exploited in an unusual way to offer semi-independent task parallelism providing the MPs can operate within their own localised memory and apply data-parallelism to their own problem subset. We […]
Dec, 11

Runtime Support toward Transparent Memory Access in GPU-accelerated Heterogeneous Systems

GPU has become a popular parallel accelerator in modern heterogeneous systems for its great parallelism and superior energy efficiency. However, it also extremely complicates programing the memory system in such heterogeneous systems, due to the non-continuous memory spaces on CPU and GPU, and a two-level memory hierarchy on a GPU itself. The complexity of this […]
Dec, 11

A New Software Based GPU Framework

A software based GPU design, where most of the 3D pipeline is executed in software on shaders, with minimal support from custom hardware blocks, provides three benefits, it: (1) simplifies the GPU design, (2) turns 3D graphics into a general purpose application, and (3) opens the door for applying compiler optimization to the whole 3D […]
Dec, 11

A GPU-Accelerated Framework for Image Processing and Computer Vision

This paper presents and briefly describes the state of the art of accelerating image processing with graphics hardware (GPU) and discusses some of its caveats. Then it describes GpuCV, an open source multiplatform library for GPU-accelerated image processing and Computer Vision operators and applications. It is meant for computer vision scientist not familiar with GPU […]
Dec, 11

High Performance Poisson Equation Solver for Hybrid CPU/GPU Systems

We investigated the possible way for treatment of electrostatic interactions by solving numerically Poisson’s equation using Conjugate Gradient method and Stabilized BiConjugate Gradient method. The aim of the research was to test the execution time of prototype programs running on BLueGene/P and CPU/GPU system. The results show that the tested methods are applicable for electrostatics […]
Dec, 11

GPU Accelerated Parallel Iris Localization

Iris recognition is quite a computation intensive task with huge amounts of pixel processing. After the image acquisition of the eye, Iris recognition is basically divided into Iris localization, Feature Extraction and Matching steps. Each of these tasks involves a lot of processing. It thus becomes essential to improve the performance of each step to […]
Dec, 11

Evaluating tradeoff between recall and performance of GPU permutation index

Query-by-content, by means of similarity search, is a fundamental operation for applications that deal with multimedia data. For this kind of query it is meaningless to look for elements exactly equal to a given one as query. Instead, we need to measure the dissimilarity between the query object and each database object. This search problem […]
Dec, 11

An Analytical Approach of Mars Rovers by Using GPU Technology and Genetic Algorithm

Mars Rovers are the unmanned machines on planet MARS which are send to analyze and provide details about the planet. GPU and Genetic Algorithms are upcoming technologies used in Mars Rovers for analyzing and sending the data back to the Earth base station. GPU stands for Graphics Processing Unit in which Image compression is the […]
Dec, 11

Learning Random Forests on the GPU

Random Forests are a popular and powerful machine learning technique, with several fast multi-core CPU implementations. Since many other machine learning methods have seen impressive speedups from GPU implementations, applying GPU acceleration to random forests seems like a natural fit. Previous attempts to use GPUs have relied on coarse-grained task parallelism and have yielded inconclusive […]
Dec, 9

Integrated GPUs: how useful are they in HPC?

Due to their potential computation power, GPUs are often used for high performance computing. However, discrete GPUs are connected to the CPU via the PCIe bus, which can cause bottlenecks due to high latency and low bandwidth to the CPU. Lately, integrated GPUs have become more common, and due to being integrated on the CPU-chip, […]
Dec, 9

Efficient Volume Rendering in CUDA Path Tracer

Volume rendering has always been a popular topic in computer graphics as it captures more realistic rendering results. However, most of the rendering targets are focused on participating media distributed in the scene. In this project we integrate an Octree data structure with our path tracer in CUDA to achieve efficient volume rendering of models. […]

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us: