Posts
Aug, 30
A Feedback Approach to Task Partitioning in Heterogeneous Architectures
Personal Computers of today are based on complex architectures often with multiple high performance computational units for various dedicated purposes. The General Purpose GPU is one such example where Graphic Processing Units are being used for more general purpose computing. In this paper, we target such architectures and focus on Load Balancing and Task Partitioning […]
Aug, 30
Real-Time GPU Path Tracing
In this paper, we present a simple, yet efficient implementation of the path tracing algorithm for GPUs. A reformulation of Russian Roulette is used to achieve high SIMT utilization, which leads to real-time performance in Kajiya’s classic scene, using a single GPU. We apply our scheme to larger scenes in the Brigade system, an experimental […]
Aug, 30
Evolutionary Algorithm for Optimizing Parameters of GPGPU-based Image Segmentation
The use of digital microscopy allows diagnosis through automated quantitative and qualitative analysis of the digital images. Often to evaluate the samples, the first step is determining the number and location of cell nuclei. For this purpose, we have developed a GPGPU based data-parallel region growing algorithm that is equally as accurate as the already […]
Aug, 30
Performance Portability Strategies for Computational Fluid Dynamics (CFD) Applications on HPC Systems
Achieving high computational performance on large-scale high performance computing (HPC) system demands optimizations to exploit hardware characteristics. Various optimizations and research strategies are implemented to improve performance with emphasis on single or multiple hardware characteristics. Among these approaches, the domain-specific approach involving domain expertise shows its high potential in achieving high performance and maintaining performance […]
Aug, 30
Swendsen-Wang Multi-Cluster Algorithm for the 2D/3D Ising Model on Xeon Phi and GPU
Simulations of the critical Ising model by means of local update algorithms suffer from critical slowing down. One way to partially compensate for the influence of this phenomenon on the runtime of simulations is using increasingly faster and parallel computer hardware. Another approach is using algorithms that do not suffer from critical slowing down, such […]
Aug, 30
8th International Symposium on Intelligent Distributed Computing, IDC’2014
The emergent field of Intelligent Distributed Computing focuses on the development of a new generation of intelligent distributed systems. It faces the challenges of adapting and combining research in the fields of Intelligent Computing and Distributed Computing. Intelligent Computing develops methods and technology ranging from classical artificial intelligence, computational intelligence and multi-agent systems to game […]
Aug, 30
Fifth International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies, HEART 2014
The HEART symposium is an international forum on state-of-the-art research in high-performance and power-efficient computing using accelerator technologies such as FPGAs, GPGPUs, and/or specialized accelerators. The scope of the meeting includes, but is not limited to: Architectures and systems: Novel systems/platforms for efficient acceleration based on FPGA, GPU, and other devices Heterogeneous processor architectures and […]
Aug, 30
24th International Conference on Field Programmable Logic and Applications, FPL 2014
The International Conference on Field Programmable Logic and Applications (FPL) is the first and largest conference covering the rapidly growing area of field-programmable logic. During the past 23 years, many of the advances achieved in reconfigurable system architectures, applications, embedded processors, design automation methods (EDA) and tools have been first published in the proceedings of […]
Aug, 29
High Performance Algorithms to Improve the Runtime Computation of Spacecraft Trajectories
Increasing space mission complexity coupled with challenging science requirements are driving the need for fast and robust space trajectory design and simulation tools. Current state-of-the art methods and techniques are often found to be lacking, particularly when problems are scaled to the future demands of mission design. This challenging problem is addressed in this thesis […]
Aug, 29
Fast network communities visualization on massively parallel GPU architecture
Modeling phenomena with networks has a wide application in many disciplines including biology, economics, sociology, and computer science. In network analysis modularity is an important measure for automatically extracting communities of closely connected nodes. Another important aspect of the network analysis is network visualization. Different techniques for network layout generation exist and the force-driven layout […]
Aug, 29
Efficient Sparse Matrix-Vector Multiplication on x86-Based Many-Core Processors
Sparse matrix-vector multiplication (SpMV) is an important kernel in many scientific applications and is known to be memory bandwidth limited. On modern processors with wide SIMD and large numbers of cores, we identify and address several bottlenecks which may limit performance even before memory bandwidth: (a) low SIMD efficiency due to sparsity, (b) overhead due […]
Aug, 29
Numerical simulations of acoustic waves with the graphic acceleration GAMER code
We present results of numerical simulations of acoustic waves with the use of the Graphics Processing Unit (GPU) acceleration GAMER code which implements a second-order Godunov-type numerical scheme and adaptive mesh refinement (AMR). The AMR implementation is based on constructing a hierarchy of grid patches with an octree data structure. In this code a hybrid […]