3159

Posts

Feb, 27

Efficient hierarchical parallel genetic algorithms using grid computing

In this paper, we present an efficient Hierarchical Parallel Genetic Algorithm framework using Grid computing (GE-HPGA). The framework is developed using standard Grid technologies, and has two distinctive features: (1) an extended GridRPC API to conceal the high complexity of the Grid environment, and (2) a metascheduler for seamless resource discovery and selection. To assess […]
Feb, 27

GPU ray casting of virtual globes

Accurately rendering an ellipsoid is a fundamental problem for virtual globes in GIS and aerospace applications where the Earth’s standard reference surface is non-spherical. The traditional approach of tessellating an ellipsoid into triangles and rendering via rasterization has several drawbacks [Miller and Gaskins 2009]. Geodetic grid tessellations oversample at the poles (2a), which leads to […]
Feb, 27

Data Mining and Machine Learning in Astronomy

We review the current state of data mining and machine learning in astronomy. ‘Data Mining’ can have a somewhat mixed connotation from the point of view of a researcher in this field. If used correctly, it can be a powerful approach, holding the potential to fully exploit the exponentially increasing amount of available data, promising […]
Feb, 27

Data Buffering Optimization Methods toward a Uniform Programming Interface for GPU-based Applications

The massive computational power available in off-the shelf Graphics Processing Units (GPUs) can pave the way for its usage in general purpose applications. Current interfaces to program GPU operation are still oriented towards graphics processing. This paper is focused in disparities on those programming interfaces and proposes an extension to of the recently developed Caravela […]
Feb, 27

A Method for Accelerating Bronchoscope Tracking Based on Image Registration by GPGPU

This paper presents an acceleration method for tracking a bronchoscope based on image registration. This method tracks a bronchoscope by image registration between real bronchoscopic images and virtual ones derived from CT images. However, since the computation cost of image registration, especially generating virtual bronchoscopic (VB) images, is quite expensive, it is dificult to track […]
Feb, 27

CUDAlign: using GPU to accelerate the comparison of megabase genomic sequences

Biological sequence comparison is a very important operation in Bioinformatics. Even though there do exist exact methods to compare biological sequences, these methods are often neglected due to their quadratic time and space complexity. In order to accelerate these methods, many GPU algorithms were proposed in the literature. Nevertheless, all of them restrict the size […]
Feb, 27

Size Matters: Space/Time Tradeoffs to Improve GPGPU Applications Performance

GPUs offer drastically different performance characteristics compared to traditional multicore architectures. To explore the tradeoffs exposed by this difference, we refactor MUMmer, a widely-used, highly-engineered bioinformatics application which has both CPU- and GPU-based implementations. We synthesize our experience as three high-level guidelines to design efficient GPU-based applications. First, minimizing the communication overheads is as important […]
Feb, 27

Accelerating CUDA Graph Algorithms at Maximum Warp

Graphs are powerful data representations favored in many computational domains. Modern GPUs have recently shown promising results in accelerating computationally challenging graph problems but their performance suffers heavily when the graph structure is highly irregular, as most real-world graphs tend to be. In this study, we first observe that the poor performance is caused by […]
Feb, 27

A Domain-Specific Approach To Heterogeneous Parallelism

Exploiting heterogeneous parallel hardware currently requires mapping application code to multiple disparate programming models. Unfortunately, general-purpose programming models available today can yield high performance but are too low-level to be accessible to the average programmer. We propose leveraging domainspecific languages (DSLs) to map high-level application code to heterogeneous devices. To demonstrate the potential of this […]
Feb, 26

StarPU: a Runtime System for Scheduling Tasks over Accelerator-Based Multicore Machines

Multicore machines equipped with accelerators are becoming increasingly popular. The TOP500-leading RoadRunner machine is probably the most famous example of a parallel computer mixing IBM Cell Broadband Engines and AMD opteron processors. Other architectures, featuring GPU accelerators, are expected to appear in the near future. To fully tap into the potential of these hybrid machines, […]
Feb, 26

A Tuning Framework for Software-Managed Memory Hierarchies

New architectures are emerging at a rapid pace, architectures with multiple processing units on a chip and with deep memory hierarchies have become pervasive; while architectures with software-managed memory hierarchies (such as the Sony/Toshiba/IBM Cell processor) have gained popularity. Due to the increased complexity of architectures, re-targeting a legacy application to a new architecture requires […]
Feb, 26

Real-Time Approaches to Computer Vision

Perhaps the extensive reliance on our visual sensory inputs, makes the use of artificial visual sensors seem like an intuitive choice. Thus, Machine Vision or Computer Vision has become an exciting field of research, finding its way into many industrial applications. The results from Computer Vision research can be incorporated in autonomous machine navigation, industrial […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: