5364

Posts

Aug, 25

Parallelizing compiler framework and API for power reduction and software productivity of real-time heterogeneous multicores

Heterogeneous multicores have been attracting much attention to attain high performance keeping power consumption low in wide spread of areas. However, heterogeneous multicores force programmers very difficult programming. The long application program development period lowers product competitiveness. In order to overcome such a situation, this paper proposes a compilation framework which bridges a gap between […]
Aug, 25

MobiRT: an implementation of OpenGL ES-based CPU-GPU hybrid ray tracer for mobile devices

Three-dimensional user interfaces on mobile devices are increasingly important. For more realistic three-dimensional visualization on mobile devices, we present the implementation of an OpenGL ES-based CPU-GPU hybrid ray tracer. This ray tracer exploits the availability of CPU and GPU architectures to fully support reflection, refraction, hard shadows, and dynamic scenes. To the best of our […]
Aug, 25

Considering GPGPU for HPC Centers: Is It Worth the Effort?

In contrast to just a few years ago, the answer to the question "What system should we buy next to best assist our users" has become a lot more complicated for the operators of an HPC center today. In addition to multicore architectures, powerful accelerator systems have emerged, and the future looks heterogeneous. In this […]
Aug, 25

Towards a GPU-Based Simulation Framework for Deformable Surface Meshes

Realism and real-time visual and haptic interactions with anatomical structures are key challenges in simulation software for surgeries. Overcoming these challenges is made difficult by the need to run the software on consumer-grade computing platforms. This paper presents preliminary work towards a framework for fast, realistic and stable simulation of deformable anatomical structures. The approach […]
Aug, 25

A PC-based fully-programmable medical ultrasound imaging system using a graphics processing unit

In this paper, a PC-based fully-programmable medical ultrasound imaging system is presented where a high performance graphics processing unit (GPU) is utilized to perform entire ultrasound processing. In the proposed architecture, ultrasound signal and image processing algorithms were divided into four modules and efficiently implemented on the NVIDA’s Computer Unified Device Architecture (CUDA) platform (GeForce […]
Aug, 25

Fine-grain Parallelism using Multi-core, Cell/BE, and GPU Systems

Currently, we are facing a situation where applications exhibit increasing computational demands and where a large variety of parallel processor systems are available. In this paper we focus on exploiting fine-grain parallelism for three applications with distinct characteristics: a Bioinformatics application (MrBayes), a Molecular Dynamics application (NAMD), and a Database application (TPC-H). We assess, side-by-side, […]
Aug, 25

An efficient GPU-based time domain solver for the acoustic wave equation

An efficient algorithm for time-domain solution of the acoustic wave equation for the purpose of room acoustics is presented. It is based on adaptive rectangular decomposition of the scene and uses analytical solutions within the partitions that rely on spatially invariant speed of sound. This technique is suitable for auralizations and sound field visualizations, even […]
Aug, 25

GPU-acceleration for Moving Particle Semi-implicit Method

The MPS (Moving Particle Semi-implicit) method has been proven useful in computation free-surface hydrodynamic flows. Despite its applicability, one of its drawbacks in practical application is the high computational load. On the other hand, Graphics Processing Unit (GPU), which was originally developed for acceleration of computer graphics, now provides unprecedented capability for scientific computations. The […]
Aug, 24

Parallel computation of spherical parameterizations for mesh analysis

Mesh parameterization is central to a broad spectrum of applications. In this paper, we present a novel approach to spherical mesh parameterization based on an iterative quadratic solver that is efficiently parallelizable on modern massively parallel architectures. We present an extensive analysis of performance results on both GPU and multicore architectures. We introduce a number […]
Aug, 24

A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction

Programmers for GPGPU face rapidly changing substrate of programming abstractions, execution models, and hardware implementations. It has been established, through numerous demonstrations for particular conjunctions of application kernel, programming languages, and GPU hardware instance, that it is possible to achieve significant improvements in the price/performance and energy/performance over general purpose processors. But these demonstrations are […]
Aug, 24

CnC-CUDA: declarative programming for GPUs

The computer industry is at a major inflection point in its hardware roadmap due to the end of a decades-long trend of exponentially increasing clock frequencies. Instead, future computer systems are expected to be built using homogeneous and heterogeneous many-core processors with 10’s to 100’s of cores per chip, and complex hardware designs to address […]
Aug, 24

WAYPOINT: scaling coherence to thousand-core architectures

In this paper, we evaluate a set of coherence architectures in the context of a 1024-core chip multiprocessor (CMP) tailored to throughput-oriented parallel workloads. Based on our analysis, we develop and evaluate two techniques for scaling coherence to thousand-core CMPs. We find that a broadcast-based probe filtering scheme provides reasonable performance up to 128 cores […]

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org