Posts
May, 3
Feasibility Analysis of Low Cost Graphical Processing Units for Electromagnetic Field Simulations by Finite Difference Time Domain Method
Among several techniques available for solving Computational Electromagnetics (CEM) problems, the Finite Difference Time Domain (FDTD) method is one of the best suited approaches when a parallelized hardware platform is used. In this paper we investigate the feasibility of implementing the FDTD method using the NVIDIA GT 520, a low cost Graphical Processing Unit (GPU), […]
May, 3
AMD Developer Summit 2013, APU13
AMD is excited to bring together technology influencers from all over the world to share their vision and strategy for an open-standard heterogeneous computing ecosystem. Last year’s AMD Developer Summit saw the formation of the Heterogeneous System Architecture (HSA) Foundation. In 2011, Microsoft announced c++AMP, and AMD first revealed its Graphics Core Next Architecture. This […]
May, 2
Efficient implementation for QUAD stream cipher with GPUs
QUAD stream cipher uses multivariate polynomial systems. It has provable security based on the computational hardness assumption. More specifically, the security of QUAD depends on hardness of solving non-linear multivariate systems over a finite field, and it is known as an NP-complete problem. However, QUAD is slower than other stream ciphers, and an efficient implementation, […]
May, 2
GPU accelerated Trotter-Suzuki solver for quantum spin dynamics
The resolution of dynamics in out of equilibrium quantum spin systems relies at the heart of fundamental questions among Quantum Information Processing, Statistical Mechanics and Nano-Technologies. Efficient computational simulations of interacting many-spin systems are extremely valuable tools for tackling such questions. Here, we use the Trotter-Suzuki (TS) algorithm, a well-known strategy that provides the evolution […]
May, 2
A framework for data-access strategies in GPGPU programs
In recent years, graphics processing units (GPUs) became more and more popular as high performance processing units. Due to the availability of hundreds of cores, code fragments speed up significantly when they are transformed from CPU functions to GPU kernels. The transformation process is non-trivial and therefore error prone. Developing correct and efficient GPU accelerated […]
May, 2
Adding GPU Computing to Computer Organization Courses
How can parallel computing topics be incorporated into core courses that are taken by the majority of undergraduate students? This paper reports our experiences adding GPU computing with CUDA into the core undergraduate computer organization course at two different colleges. We have found that even though programming in CUDA is not necessarily easy, programmer control […]
May, 2
OpenMP performance analysis for many-core platforms with non-uniform memory access
One of the first steps in embedded-system design flow is to choose the most efficient implementation of the embedded software application. However, this is difficult to do at the earliest design stages because particular details of the final manycore HW platform are usually unknown and many possible mappings of the software tasks/threads have to be […]
May, 1
GPU Acceleration of the Variational Monte Carlo Method for Many Body Physics
High-Performance computing is one of the major areas making inroads into the future for large-scale simulation. Applications such as 3D nuclear test, Molecular Dynamics, and Quantum Monte Carlo simulations are now developed on supercomputers using the latest computing technologies. As per the TOP500 supercomputers rating, most of today’s supercomputers are now heterogeneous: with massively parallel […]
May, 1
Parallel For Loops on Heterogeneous Resources
In recent years, Graphics Processing Units (GPUs) have piqued the interest of researchers in scientific computing. Their immense floating point throughput and massive parallelism make them ideal for not just graphical applications, but many general algorithms as well. Load balancing applications and taking advantage of all computational resources in a machine is a difficult challenge, […]
May, 1
GPU-based Steady-State Solution of the Chemical Master Equation
The Chemical Master Equation (CME) is a stochastic and discrete-state continuous-time model for macromolecular reaction networks inside the cell. Under this theoretical framework, the solution of a sparse linear system provides the steady-state probability landscape over the molecular microstates. The CME framework can in fact reveal important insights into basic principles on how biological networks […]
May, 1
ACL2 Meets the GPU: Formalizing a CUDA-based Parallelizable All-Pairs Shortest Path Algorithm in ACL2
As Graphics Processing Units (GPUs) have gained in capability and GPU development environments have matured, developers are increasingly turning to the GPU to off-load the main host CPU of numerically-intensive, parallelizable computations. Modern GPUs feature hundreds of cores, and offer programming niceties such as double-precision floating point, and even limited recursion. This shift from CPU […]
May, 1
Graphics Programming on the Web WebCL Course Notes
This document introduces WebCL [1], a new standard under development by the Khronos Group, for highperformance computing in web browsers. Since WebCL wraps OpenCL, the course starts by reviewing important OpenCL [2] concepts. Next, we detail how to program with WebCL in the browser and on devices such as GPUs. Finally, we discuss WebCL – […]