Posts
May, 18
StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators
GPUs have largely entered HPC clusters, as shown by the top entries of the latest top500 issue. Exploiting such machines is however very challenging, not only because of combining two separate paradigms, MPI and CUDA or OpenCL, but also because nodes are heterogeneous and thus require careful load balancing within nodes themselves. The current paradigms […]
May, 18
Relativistic hydrodynamics on graphics processing units
Hydrodynamics calculations have been successfully used in studies of the bulk properties of the Quark-Gluon Plasma, particularly of elliptic flow and shear viscosity. However, there are areas (for instance event-by-event simulations for flow fluctuations and higher-order flow harmonics studies) where further advancement is hampered by lack of efficient and precise 3+1D program. This problem can […]
May, 18
Paralleizing AwSpPCA for robust facial recognition using CUDA
This paper was conducted to analyze the performance benefits of parallelizing the Adaptive Weighted Sub-patterned Principle Component Analysis (Aw SP PCA) algorithm, given that the algorithm is implemented so as to retain the accuracy from its serialized version. The serialized execution of this algorithm is analyzed first and then compared against its parallel implementation, both […]
May, 18
Parallel Optical Flow Detection Using CUDA
The intention of this thesis paper is to deploy a parallel implementation of the optical flow detection algorithm known as the Lucas-Kanade algorithm. As an important algorithm in the field of computer vision, it is believed that it holds much promise and shows much potential for benefiting from techniques used to enhance performance through parallel […]
May, 17
Evolutionary Simulation of Life Using CUDA
The idea behind this project was to create a simulation of the evolution of life in CUDA. In this simulation the creatures are individual entities that can interact with the world. Each has its own set of state information and DNA representing it. Through this DNA the creatures evolve via division and mating. The evolution […]
May, 17
Investigating the Impact of Data Parallelism and GPU Technology on Computer Gaming
According to the current design trends, multithreaded multicore processors will be ubiquitous in every device. In computer gaming, chip-makers are adding more cores to fulfill the next generation performance requirements. A game engine has many ‘tasks’ and data parallelism is an important technique for concurrent execution of these tasks. However, effective implementation of multithreaded computer […]
May, 17
Fine-Grained Parallel Incomplete LU Factorization
This paper presents a new fine-grained parallel algorithm for computing an incomplete LU factorization. All nonzeros in the incomplete factors can be computed in parallel and asynchronously, using one or more sweeps that iteratively improve the accuracy of the factorization. Unlike existing parallel algorithms, the new algorithm does not depend on reordering the matrix. Numerical […]
May, 17
Hierarchical Transparent Programming for Heterogeneous Computing
Parallel computing and the development of parallel programs is a way to reduce the time of the program execution. During many years, sequential optimization was designed without thinking about parallel tasks. Currently, multi-core devices have arrived, making code parallelization more important. The parallel computing is closely related with both hardware and software point of view, […]
May, 17
Heterogeneity-aware Fault Tolerance using a Self-Organizing Runtime System
Due to the diversity and implicit redundancy in terms of processing units and compute kernels, off-the-shelf heterogeneous systems offer the opportunity to detect and tolerate faults during task execution in hardware as well as in software. To automatically leverage this diversity, we introduce an extension of an online-learning runtime system that combines the benefits of […]
May, 17
Using NVIDIA GPUs for Real-time Data Processing in a Holographic Radar System, webinar
In this webinar, Peter Wurmsdobler, Lead Software Architect, Aveillant, will give a short introduction to Aveillant’s Holographic Radar systems, the principles of Holographic radars, as opposed to scanning radar systems, as well as its computational requirements. Peter will go on to explore the technical challenges faced in the implementation of the mathematical algorithms needed, how […]
May, 16
The Next Steps for Folding@home, webinar
Folding@home is a large-scale volunteer distributed computing project, started in October 1, 2000. For over a decade, new types of hardware (such as GPUs, multi-core CPUs, and PS3) and algorithms have been pioneered in order to make significant advances in our ability to simulate diseases at the molecular scale. Join Professor Vijay Pande from Stanford […]
May, 16
An Introduction to CUDA Programming, webinar
Join Chris Mason, Product Manager, Acceleware, for an informative introduction to CUDA programming. The webinar will begin with a brief overview of CUDA and data-parallelism before focusing on the GPU programming model. Chris will explore the fundamentals of GPU kernels, host and device responsibilities, CUDA syntax and thread hierarchy. A programming demonstration of a simple […]