Posts
Feb, 27
Efficient hierarchical parallel genetic algorithms using grid computing
In this paper, we present an efficient Hierarchical Parallel Genetic Algorithm framework using Grid computing (GE-HPGA). The framework is developed using standard Grid technologies, and has two distinctive features: (1) an extended GridRPC API to conceal the high complexity of the Grid environment, and (2) a metascheduler for seamless resource discovery and selection. To assess […]
Feb, 27
GPU ray casting of virtual globes
Accurately rendering an ellipsoid is a fundamental problem for virtual globes in GIS and aerospace applications where the Earth’s standard reference surface is non-spherical. The traditional approach of tessellating an ellipsoid into triangles and rendering via rasterization has several drawbacks [Miller and Gaskins 2009]. Geodetic grid tessellations oversample at the poles (2a), which leads to […]
Feb, 27
Data Mining and Machine Learning in Astronomy
We review the current state of data mining and machine learning in astronomy. ‘Data Mining’ can have a somewhat mixed connotation from the point of view of a researcher in this field. If used correctly, it can be a powerful approach, holding the potential to fully exploit the exponentially increasing amount of available data, promising […]
Feb, 27
Data Buffering Optimization Methods toward a Uniform Programming Interface for GPU-based Applications
The massive computational power available in off-the shelf Graphics Processing Units (GPUs) can pave the way for its usage in general purpose applications. Current interfaces to program GPU operation are still oriented towards graphics processing. This paper is focused in disparities on those programming interfaces and proposes an extension to of the recently developed Caravela […]
Feb, 27
A Method for Accelerating Bronchoscope Tracking Based on Image Registration by GPGPU
This paper presents an acceleration method for tracking a bronchoscope based on image registration. This method tracks a bronchoscope by image registration between real bronchoscopic images and virtual ones derived from CT images. However, since the computation cost of image registration, especially generating virtual bronchoscopic (VB) images, is quite expensive, it is dificult to track […]
Feb, 27
CUDAlign: using GPU to accelerate the comparison of megabase genomic sequences
Biological sequence comparison is a very important operation in Bioinformatics. Even though there do exist exact methods to compare biological sequences, these methods are often neglected due to their quadratic time and space complexity. In order to accelerate these methods, many GPU algorithms were proposed in the literature. Nevertheless, all of them restrict the size […]
Feb, 27
Size Matters: Space/Time Tradeoffs to Improve GPGPU Applications Performance
GPUs offer drastically different performance characteristics compared to traditional multicore architectures. To explore the tradeoffs exposed by this difference, we refactor MUMmer, a widely-used, highly-engineered bioinformatics application which has both CPU- and GPU-based implementations. We synthesize our experience as three high-level guidelines to design efficient GPU-based applications. First, minimizing the communication overheads is as important […]
Feb, 27
Accelerating CUDA Graph Algorithms at Maximum Warp
Graphs are powerful data representations favored in many computational domains. Modern GPUs have recently shown promising results in accelerating computationally challenging graph problems but their performance suffers heavily when the graph structure is highly irregular, as most real-world graphs tend to be. In this study, we first observe that the poor performance is caused by […]
Feb, 27
A Domain-Specific Approach To Heterogeneous Parallelism
Exploiting heterogeneous parallel hardware currently requires mapping application code to multiple disparate programming models. Unfortunately, general-purpose programming models available today can yield high performance but are too low-level to be accessible to the average programmer. We propose leveraging domainspecific languages (DSLs) to map high-level application code to heterogeneous devices. To demonstrate the potential of this […]
Feb, 26
StarPU: a Runtime System for Scheduling Tasks over Accelerator-Based Multicore Machines
Multicore machines equipped with accelerators are becoming increasingly popular. The TOP500-leading RoadRunner machine is probably the most famous example of a parallel computer mixing IBM Cell Broadband Engines and AMD opteron processors. Other architectures, featuring GPU accelerators, are expected to appear in the near future. To fully tap into the potential of these hybrid machines, […]
Feb, 26
A Tuning Framework for Software-Managed Memory Hierarchies
New architectures are emerging at a rapid pace, architectures with multiple processing units on a chip and with deep memory hierarchies have become pervasive; while architectures with software-managed memory hierarchies (such as the Sony/Toshiba/IBM Cell processor) have gained popularity. Due to the increased complexity of architectures, re-targeting a legacy application to a new architecture requires […]
Feb, 26
Real-Time Approaches to Computer Vision
Perhaps the extensive reliance on our visual sensory inputs, makes the use of artificial visual sensors seem like an intuitive choice. Thus, Machine Vision or Computer Vision has become an exciting field of research, finding its way into many industrial applications. The results from Computer Vision research can be incorporated in autonomous machine navigation, industrial […]