8731

Posts

Nov, 5

Kite: Braided Parallelism for Heterogeneous Systems

Modern processors are evolving into hybrid, heterogeneous processors with both CPU and GPU cores used for general purpose computation. Several languages, such as BrookGPU, CUDA, and more recently OpenCL, have been developed to harness the potential of these processors. These languages typically involve control code running on a host CPU, while performance-critical, massively data-parallel kernel […]
Nov, 1

DL: A data layout transformation system for heterogeneous computing

For many-core architectures like the GPUs, efficient off-chip memory access is crucial to high performance; the applications are often limited by off-chip memory bandwidth. Transforming data layout is an effective way to reshape the access patterns to improve off-chip memory access behavior, but several challenges had limited the use of automated data layout transformation systems […]
Oct, 28

Investigating Host-Device communication in a GPU-based H.264 encoder

Modern graphical processing units (GPU) are powerful parallel processors, capable of running thousands of concurrent threads. While originally limited to graphics processing, newer generations can be used for general computing (GPGPU). Through frameworks such as nVidia Compute Unified Device Architecture (CUDA) and OpenCL, GPU programs can be written using established programming languages (with minor extensions) […]
Oct, 26

An Environment to Support GPU and Multicore Programming for Rapid, High Performance, Application Deployment

Homogeneous multicore processors, heterogeneous multicore processors, high performance accelerators, and other heterogeneous architectures have significant computing potential over traditional single core processors. Computer systems comprised of these specialized processing elements are increasingly common. Due to the increased complexity of these architectures, programming for them has become increasingly complex and error prone. Each of these architectures […]
Oct, 23

Task Parallelism and Data Distribution: An Overview of Explicit Parallel Programming Languages

Programming parallel machines as effectively as sequential ones would ideally require a language that provides high-level programming constructs to avoid the programming errors frequent when expressing parallelism. Since task parallelism is considered more error-prone than data parallelism, we survey six popular and efficient parallel language designs that tackle this difficult issue: Cilk, Chapel, X10, Habanero-Java, […]
Oct, 20

Ray Tracing of Volumetric Data in Real Time

Graphics processors of today are highly efficient, parallel processors, capable of rendering complex scenes consisting of millions of polygons on the screen each and every second. They are highly specialized towards game graphics and similar, polygon based graphics. In the past, however, they have not been very efficient at rendering volumetric data, and especially not […]
Oct, 16

High-Performance Computing Algorithms for Constructing Inverted Files on Emerging Multicore Processors

Current trends in processor architectures increasingly include more cores on a single chip and more complex memory hierarchies, and such a trend is likely to continue in the foreseeable future. These processors offer unprecedented opportunities for speeding up demanding computations if the available resources can be effectively utilized. Simultaneously, parallel programming languages such as OpenMP […]
Oct, 13

.NET High Performance Computing

Graphics Processing Units (GPUs) have been extensively applied in the High Performance Computing (HPC) community. HPC applications require additional special programming environments to improve the utilization of GPUs, for example, NVIDIA’s CUDA and Khronos group’s OpenCL. This thesis will introduce a preprocessor framework called HPC.NET, which is deployed on the Microsoft .NET platform to meet […]
Oct, 8

GPU Accelerated NIDS Search

Network Intrusion Detection System (NIDS) analyzes network traffic for malicious activities and report’s findings from events that intend to compromise the security of the computers and other equipment. NIDS looks into both headers and payloads of the network packets to identify possible intrusions. NIDS models that only use Central Processing Units (CPU) such as the […]
Oct, 5

Accelerated protein structure comparison using TM-score-GPU

MOTIVATION: Accurate comparisons of different protein structures play important roles in structural biology, structure prediction and functional annotation. The root-mean-square-deviation (RMSD) after optimal superposition is the predominant measure of similarity due to the ease and speed of computation. However, global RMSD is dependent on the length of the protein and can be dominated by divergent […]
Oct, 3

Performance Analysis of an Ultrasound Reconstruction Algorithm for Non Destructive Testing

The CIVA software platform developed by CEA-LIST offers various simulation and data processing modules dedicated to non-destructive testing (NDT). In particular, ultrasonic imaging and reconstruction tools are proposed in the purpose of localizing echoes and identifying and sizing the detected defects. Because of the complexity of data processed, computation time is now a limitation for […]
Oct, 2

Multi2Sim: a simulation framework for CPU-GPU computing

Accurate simulation is essential for the proper design and evaluation of any computing platform. Upon the current move toward the CPU-GPU heterogeneous computing era, researchers need a simulation framework that can model both kinds of computing devices and their interaction. In this paper, we present Multi2Sim, an open-source, modular, and fully configurable toolset that enables […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: