Posts
Nov, 8
A Game Architecture Based on Multiple GPUs With Energy Management
The availability of multicore CPUs and programmable GPUs have risen the provision of processing power for applications. In case of games, this means increased scene realism and more sophisticated artificial intelligence and physics simulations, for example. However, using more power raises energy consumption and system temperature. Therefore, energy consumption and thermal management are research fields […]
Nov, 6
Many-core applications to online track reconstruction in HEP experiments
Interest in parallel architectures applied to real time selections is growing in High Energy Physics (HEP) experiments. In this paper we describe performance measurements of Graphic Processing Units (GPUs) and Intel Many Integrated Core architecture (MIC) when applied to a typical HEP online task: the selection of events based on the trajectories of charged particles. […]
Nov, 6
NaNet:a low-latency NIC enabling GPU-based, real-time low level trigger systems
We implemented the NaNet FPGA-based PCI2 Gen2 GbE/APElink NIC, featuring GPUDirect RDMA capabilities and UDP protocol management offloading. NaNet is able to receive a UDP input data stream from its GbE interface and redirect it, without any intermediate buffering or CPU intervention, to the memory of a Fermi/Kepler GPU hosted on the same PCIe bus, […]
Nov, 6
A new GPU-accelerated hydrodynamical code for numerical simulation of interacting galaxies
In this paper a new scalable hydrodynamic code GPUPEGAS (GPU-accelerated PErformance Gas Astrophysic Simulation) for simulation of interacting galaxies is proposed. The code is based on combination of Godunov method as well as on the original implementation of FlIC method, specially adapted for GPU-implementation. Fast Fourier Transform is used for Poisson equation solution in GPUPEGAS. […]
Nov, 6
Dynamic autotuning of adaptive fast multipole methods on hybrid multicore CPU & GPU systems
We discuss an implementation of adaptive fast multipole methods targeting hybrid multicore CPU- and GPU-systems. From previous experiences with the computational profile of our version of the fast multipole algorithm, suitable parts are off-loaded to the GPU, while the remaining parts are threaded and executed concurrently by the CPU. The parameters defining the algorithm affects […]
Nov, 6
Comparative Performance Analysis of Intel Xeon Phi, GPU, and CPU
We investigate and characterize the performance of an important class of operations on GPUs and Many Integrated Core (MIC) architectures. Our work is motivated by applications that analyze low-dimensional spatial datasets captured by high resolution sensors, such as image datasets obtained from whole slide tissue specimens using microscopy image scanners. We identify the data access […]
Nov, 6
Computer Graphics: From Pixels to Programmable Graphics Hardware
Computer Graphics: From Pixels to Programmable Graphics Hardware explores all major areas of modern computer graphics, starting from basic mathematics and algorithms and concluding with OpenGL and real-time graphics. It gives students a firm foundation in today’s high-performance graphics. UP-TO-DATE TECHNIQUES, ALGORITHMS, AND API: The book includes mathematical background on vectors and matrices as well […]
Nov, 6
Parallelizing Alternating Direction Implicit Solver on GPUs
We present a parallel Alternating Direction Implicit (ADI) solver on GPUs. Our implementation significantly improves ex- isting implementations in two aspects. First, we address the scalability issue of existing Parallel Cyclic Reduction (PCR) implementations by eliminating their hardware resource constraints. As a result, our parallel ADI, which is based on PCR, no longer has the […]
Nov, 6
Development of Generic Scheduling Concepts for OpenGL ES 2.0
The ability of a Graphics Processing Unit (GPU) to do efficient and massively parallel computations makes it the choice for 3D graphic applications. It is been extensively used as a hardware accelerator to boost the performance of a single application like 3D games. However, due to increasing number of 3D rendering applications and the limiting […]
Nov, 6
Accelerating Dissipative Particle Dynamics Simulations on GPUs: Algorithms, Numerics and Applications
We present a scalable dissipative particle dynamics simulation code, fully implemented on the Graphics Processing Units (GPUs) using a hybrid CUDA/MPI programming model, which achieves 10-30 times speedup on a single GPU over 16 CPU cores and almost linear weak scaling across a thousand nodes. A unified framework is developed within which the efficient generation […]
Nov, 6
Performance of Kepler GTX Titan GPUs and Xeon Phi System
NVIDIA’s new architecture, Kepler improves GPU’s performance significantly with the new streaming multiprocessor SMX. Along with the performance, NVIDIA has also introduced many new technologies such as direct parallelism, hyper-Q and GPU Direct with RDMA. Apart from other usual GPUs, NVIDIA also released another Kepler ‘GeForce’ GPU named GTX Titan. GeForce GTX Titan is not […]
Nov, 4
Batch Method for Efficient Resource Sharing in Real-time Multi-GPU Systems
The performance of many GPU-based systems depends heavily on the effective bandwidth for transferring data between the processors. For realtime systems, the importance of data transfer rates may be even higher due to non-deterministic transfer times that limit the ability to satisfy response time requirements. We present a new method that allows real-time applications to […]