2181

Posts

Dec, 13

Visual-model-based, real-time 3D pose tracking for autonomous navigation: methodology and experiments

This paper presents a novel 3D-model-based computer-vision method for tracking the full six degree-of-freedom (dof) pose (position and orientation) of a rigid body, in real-time. The methodology has been targeted for autonomous navigation tasks, such as interception of or rendezvous with mobile targets. Tracking an object’s complete six-dof pose makes the proposed algorithm useful even […]
Dec, 13

GPU-ABiSort: Optimal Parallel Sorting on Stream Architectures

In this paper, we present a novel approach for parallel sorting on stream processing architectures. It is based on adaptive bitonic sorting. For sorting n values utilizing p stream processor units, this approach achieves the optimal time complexity O((n log n)/p). While this makes our approach competitive with common sequential sorting algorithms not only from […]
Dec, 13

Toward Acceleration of RSA Using 3D Graphics Hardware

Demand in the consumer market for graphics hardware that accelerates rendering of 3D images has resulted in commodity devices capable of astonishing levels of performance. These results were achieved by specifically tailoring the hardware for the target domain. As graphics accelerators become increasingly programmable however, this performance has made them an attractive target for other […]
Dec, 13

Remotely Keyed Cryptographics Secure Remote Display Access Using (Mostly) Untrusted Hardware

Software that covertly monitors a user’s actions, also known as spyware, has become a first-level security threat due to its ubiquity and the difficulty of detecting and removing it. Such software may be inadvertently installed by a user that is casually browsing the web, or may be purposely installed by an attacker, or even by […]
Dec, 13

A Framework for Megascale Agent Based Model Simulations on Graphics Processing Units

Agent-based modeling is a technique for modeling dynamic systems from the bottom up. Individual elements of the system are represented computationally as agents. The systemlevel behaviors emerge from the micro-level interactions of the agents. Contemporary stateof-the-art agent-based modeling toolkits are essentially discrete-event simulators designed to execute serially on the Central Processing Unit (CPU). They simulate […]
Dec, 13

A Framework for Megascale Agent Based Model Simulations on the GPU

This paper presents a series of efficient, data parallel algorithms for simulating agent based models. These include methods for handling environment updates, agent interactions and replication. One of the most important techniques presented in this work is a novel stochastic allocator which enables parallel agent replication in O(1) average time. These techniques can be easily […]
Dec, 13

Accelerating Compute-Intensive Applications with GPUs and FPGAs

Accelerators are special purpose processors designed to speed up compute-intensive sections of applications. Two extreme endpoints in the spectrum of possible accelerators are FPGAs and GPUs, which can often achieve better performance than CPUs on certain workloads. FPGAs are highly customizable, while GPUs provide massive parallel execution resources and high memory bandwidth. Applications typically exhibit […]
Dec, 13

Real-world comparison of CPU and GPU implementations of SNPrank: a network analysis tool for GWAS

MOTIVATION: Bioinformatics researchers have a variety of programming languages and architectures at their disposal, and recent advances in graphics processing unit (GPU) computing have added a promising new option. However, many performance comparisons inflate the actual advantages of GPU technology. In this study, we carry out a realistic performance evaluation of SNPrank, a network centrality […]
Dec, 13

New Row-grouped CSR format for storing the sparse matrices on GPU with implementation in CUDA

In this article we present a new format for storing sparse matrices. The format is designed to perform well mainly on the GPU devices. We present its implementation in CUDA. The performance has been tested on 1,600 different types of matrices and we compare our format with the Hybrid format. We give detailed comparison of […]
Dec, 13

Toward a multicore architecture for real-time ray-tracing

Significant improvement to visual quality for real-time 3D graphics requires modeling of complex illumination effects like soft-shadows, reflections, and diffuse lighting interactions. The conventional Z-buffer algorithm driven GPU model does not provide sufficient support for this improvement. This paper targets the entire graphics system stack and demonstrates algorithms, a software architecture, and a hardware architecture […]
Dec, 13

Fast Soft Self-Shadowing on Dynamic Height Fields

We present a new, real-time method for rendering soft shadows from large light sources or lighting environments on dynamic height fields. The method first computes a horizon map for a set of azimuthal directions. To reduce sampling, we compute a multi-resolution pyramid on the height field. Coarser pyramid levels are indexed as the distance from […]
Dec, 13

Extensions of Parallel Coordinates for Interactive Exploration of Large Multi-Timepoint Data Sets

Parallel coordinate plots (PCPs) are commonly used in information visualization to provide insight into multi-variate data. These plots help to spot correlations between variables. PCPs have been successfullyapplied to unstructured datasets up to a few millions of points. In this paper, we present techniques to enhance the usability of PCPs forthe exploration of large, multi-timepoint […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: