Posts
Oct, 23
Task Parallelism and Data Distribution: An Overview of Explicit Parallel Programming Languages
Programming parallel machines as effectively as sequential ones would ideally require a language that provides high-level programming constructs to avoid the programming errors frequent when expressing parallelism. Since task parallelism is considered more error-prone than data parallelism, we survey six popular and efficient parallel language designs that tackle this difficult issue: Cilk, Chapel, X10, Habanero-Java, […]
Oct, 23
Techniques for designing GPGPU games
The increasing level of realism in digital games depends not only on the enhancement of modeling and rendering effects, but also on the improvement of different aspects such as animation, characters artificial intelligence and physics simulation. Normally, games process most of their tasks in the CPU, using the GPU only for graphics processing. Several games […]
Oct, 23
GPUSync: A Framework for Real-Time GPU Management
The integration of graphics processing units (GPUs) into real-time systems has recently become an active area of research. However, prior research on this topic has failed to produce real-time GPU allocation methods that fully exploit the available parallelism in GPU-enabled systems. In this paper, a GPU management framework called GPUSync is described that enables increased […]
Oct, 23
Overdetermined Shooting Methods for Computing Standing Water Waves with Spectral Accuracy
A high-performance shooting algorithm is developed to compute time-periodic solutions of the free-surface Euler equations with spectral accuracy in double and quadruple precision. The method is used to study resonance and its effect on standing water waves. We identify new nucleation mechanisms in which isolated large-amplitude solutions, and closed loops of such solutions, suddenly exist […]
Oct, 23
An Approach for Traffic Forecast with GPU Computing & Cellular Automata Model
Traffic forecast has been of practical interest for modern society, mainly in minimizing of jammed traffic effects due to the saturation of roads, as well as predictable the impact of road interventions. In this way, a family of computational methods that represent basic traffic characteristics is based on Cellular Automata (CA). Moreover, the simulation of […]
Oct, 22
A (ir)regularity-aware task scheduler for heterogeneous platforms
This paper addresses the design, implementation and validation of an effective scheduling scheme for both regular and irregular applications on heterogeneous platforms. The scheduler uses an empirical performance model to dynamically schedule the workload, organized into a given number of chunks, and follows the Heterogeneous Earliest Finish Time (HEFT) scheduling algorithm, which ranks the tasks […]
Oct, 22
Hybrid MPI/GPU Interpolation for Grid DEM Construction
The proliferation of lidar technology in remote sensing has resulted in extremely large, high resolution point clouds covering a wide variety of terrain. Constructing a grid digital elevation model (DEM) from these large data sets requires extensive computational resources and ample disk space. We propose a framework for leveraging modern computing resources including multi-core distributed […]
Oct, 22
Streaming Dynamic Coarse-Grained CPU/GPU Workloads with Heterogeneous Pipelines in FastFlow
Software pipelines permit the decomposition of a repetitive sequential process into a succession of distinguishable sub-processes called stages, each of which can be concurrently executed on a distinct processing element. This paper presents a heterogeneous streaming pipeline implementation using the FastFlow skeletal library for a numerical linear algebra code. By introducing minimal memory management, we […]
Oct, 22
MR-API: A Comprehensive API Framework for Heterogeneous Multi-core Systems using Map Reduce Programming Model
In this paper we propose MR-API framework, a comprehensive framework Application Programming Interface for Heterogeneous Multi-core systems in a cloud environment. Recent trends have shown the significant importance of adopting Heterogeneity in Multi-cored processing environment due to their potential high performance and energy efficiency. In order to extract the maximum potential of underlying hardware, the […]
Oct, 22
Stereo Matching using Multi-Resolution Images on CUDA
Stereo matching technique is used to estimate the depth of objects in an image acquired from real time scenes. The basic algorithm is not very complex but is computationally exhaustive and hinders its usage for real time applications. However, this algorithm is highly data parallel and it highly suitable for execution on GPGPU (General-purpose graphical […]
Oct, 20
Ray Tracing of Volumetric Data in Real Time
Graphics processors of today are highly efficient, parallel processors, capable of rendering complex scenes consisting of millions of polygons on the screen each and every second. They are highly specialized towards game graphics and similar, polygon based graphics. In the past, however, they have not been very efficient at rendering volumetric data, and especially not […]
Oct, 20
The impact of GPU/Multicore in Signal Processing: a quantitative approach
This paper presents a meaningful practical performance comparison between the last generation of Graphics Processing Units (GPUs) and the last generation multi-core CPUs when they are used to solve given Signal Processing algorithms. Two kinds of tests were considered: when GPU pre-designed computational libraries were available, and when the GPU code was developed by the […]