Posts
Oct, 23
Overdetermined Shooting Methods for Computing Standing Water Waves with Spectral Accuracy
A high-performance shooting algorithm is developed to compute time-periodic solutions of the free-surface Euler equations with spectral accuracy in double and quadruple precision. The method is used to study resonance and its effect on standing water waves. We identify new nucleation mechanisms in which isolated large-amplitude solutions, and closed loops of such solutions, suddenly exist […]
Oct, 23
An Approach for Traffic Forecast with GPU Computing & Cellular Automata Model
Traffic forecast has been of practical interest for modern society, mainly in minimizing of jammed traffic effects due to the saturation of roads, as well as predictable the impact of road interventions. In this way, a family of computational methods that represent basic traffic characteristics is based on Cellular Automata (CA). Moreover, the simulation of […]
Oct, 22
A (ir)regularity-aware task scheduler for heterogeneous platforms
This paper addresses the design, implementation and validation of an effective scheduling scheme for both regular and irregular applications on heterogeneous platforms. The scheduler uses an empirical performance model to dynamically schedule the workload, organized into a given number of chunks, and follows the Heterogeneous Earliest Finish Time (HEFT) scheduling algorithm, which ranks the tasks […]
Oct, 22
Hybrid MPI/GPU Interpolation for Grid DEM Construction
The proliferation of lidar technology in remote sensing has resulted in extremely large, high resolution point clouds covering a wide variety of terrain. Constructing a grid digital elevation model (DEM) from these large data sets requires extensive computational resources and ample disk space. We propose a framework for leveraging modern computing resources including multi-core distributed […]
Oct, 22
Streaming Dynamic Coarse-Grained CPU/GPU Workloads with Heterogeneous Pipelines in FastFlow
Software pipelines permit the decomposition of a repetitive sequential process into a succession of distinguishable sub-processes called stages, each of which can be concurrently executed on a distinct processing element. This paper presents a heterogeneous streaming pipeline implementation using the FastFlow skeletal library for a numerical linear algebra code. By introducing minimal memory management, we […]
Oct, 22
MR-API: A Comprehensive API Framework for Heterogeneous Multi-core Systems using Map Reduce Programming Model
In this paper we propose MR-API framework, a comprehensive framework Application Programming Interface for Heterogeneous Multi-core systems in a cloud environment. Recent trends have shown the significant importance of adopting Heterogeneity in Multi-cored processing environment due to their potential high performance and energy efficiency. In order to extract the maximum potential of underlying hardware, the […]
Oct, 22
Stereo Matching using Multi-Resolution Images on CUDA
Stereo matching technique is used to estimate the depth of objects in an image acquired from real time scenes. The basic algorithm is not very complex but is computationally exhaustive and hinders its usage for real time applications. However, this algorithm is highly data parallel and it highly suitable for execution on GPGPU (General-purpose graphical […]
Oct, 20
Ray Tracing of Volumetric Data in Real Time
Graphics processors of today are highly efficient, parallel processors, capable of rendering complex scenes consisting of millions of polygons on the screen each and every second. They are highly specialized towards game graphics and similar, polygon based graphics. In the past, however, they have not been very efficient at rendering volumetric data, and especially not […]
Oct, 20
The impact of GPU/Multicore in Signal Processing: a quantitative approach
This paper presents a meaningful practical performance comparison between the last generation of Graphics Processing Units (GPUs) and the last generation multi-core CPUs when they are used to solve given Signal Processing algorithms. Two kinds of tests were considered: when GPU pre-designed computational libraries were available, and when the GPU code was developed by the […]
Oct, 20
Hierarchical Exploration of Volumes Using Multilevel Segmentation of the Intensity-Gradient Histograms
Visual exploration of volumetric datasets to discover the embedded features and spatial structures is a challenging and tedious task. In this paper we present a semi-automatic approach to this problem that works by visually segmenting the intensitygradient 2D histogram of a volumetric dataset into an exploration hierarchy. Our approach mimics user exploration behavior by analyzing […]
Oct, 20
Empirical analysis of a parallel data mining algorithm on a graphic processor
In this thesis, we analyze in an empirical way a different approach of the algorithm SPAM (Sequential PAttern Mining using A Bitmap Representation) made by J. Ayres, J. Gehrke, T. Yiu and J. Flannick from Cornell University, exploiting GPUs. SPAM is a novel approach for FSM (Frequent Sequence Mining) where the algorithm is not looking […]
Oct, 20
Research on DSP-GPU Heterogeneous Computing System
In this paper, DSP-GPU heterogeneous computing system is studied and the system architecture is designed. The task scheduling model is analyzed and the discrete particle swarm optimization algorithm is used for the DSP-GPU heterogeneous computing. The communication framework between DSP and GPU is designed for the heterogeneous computing.