11288

Posts

Jan, 17

Power Profiling of GeMTC Many Task Computing

GeMTC allows for Many Task Computing (MTC) workloads to run on hardware accelerators allowing for advantages that come from the many-core architecture. However, presently GeMTC is only written to take advantage of NVIDIA GPUs. Another such hardware accelerator, the Intel Xeon Phi, is also an excellent candidate for MTC workloads. Therefore, the first goal of […]
Jan, 17

GPU Accelerated Vessel Segmentation Using Laplacian Eigenmaps

Laplacian eigenmap is one of the most widely used techniques to improve cluster-based segmentation of multivariate images. However, one problem with this approach is its excessive computational requirements, especially when processing large image datasets. In this paper, we aim to employ the emerging commodity graphics hardware of eigenmap-based segmentation. In particular, we present a highly […]
Jan, 17

Prefiltered Single Scattering

Volumetric light scattering is a complex phenomenon that is difficult to simulate in real time as light can be scattered towards the camera from everywhere in space. By assuming a single-scattering model, we can transform the usually-employed ray-marching into an efficient ray-independent texture filtering process. Our algorithm builds upon a rectified shadow map as input […]
Jan, 17

Efficient Parallel Video Processing Techniques on GPU: From Framework to Implementation

Through reorganizing the execution order and optimizing the data structure, we proposed an efficient parallel framework for H.264/AVC encoder based on massively parallel architecture. We implemented the proposed framework by CUDA on NVIDIA’s GPU. Not only the compute intensive components of the H.264 encoder are parallelized, but also the control intensive components are realized effectively, […]
Jan, 16

MRPB: Memory Request Prioritization for Massively Parallel Processors

Massively parallel, throughput-oriented systems such as graphics processing units (GPUs) offer high performance for a broad range of programs. They are, however, complex to program, especially because of their intricate memory hierarchies with multiple address spaces. In response, modern GPUs have widely adopted caches, hoping to providing smoother reductions in memory access traffic and latency. […]
Jan, 16

VertexAPI2 – A Vertex-Program API for Large Graph Computations on the GPU

VertexAPI2 uses state-of-the-art GPU algorithms to implement the Gather-Apply-Scatter (GAS) abstraction for graph computations. VertexAPI2 provides up to an order of magnitude greater performance over the previous implementation and performance comparable to speed-of-light hand-coded algorithms in some cases, while retaining the simplicity of development of the GAS model. The current code also has a preliminary […]
Jan, 16

Improving Student Learning in Computer Science Courses by Using Virtual OpenCL Laboratory

Laboratory experience is an essential factor for engineering and science education. Virtual laboratories are widely used by universities and research institutions in various kinds of academic sectors. However, general virtual laboratories always have some weakness for computer graphics which its experiment needs to be done in high performance computers. In the assessment of a graduate […]
Jan, 15

3rd Workshop on Scalable Computing in Distributed Systems and 8th Workshop on Large Scale Computations on Grids, SCoDiS-LaSCoG’14

The Large Scale Computing in Grids (LaSCoG) workshop originated in 2005, and when it was created we have stated in its preamble that: “The emerging paradigm for execution of large-scale computations, whether they originate as scientific or engineering applications, or for supporting large data-intensive calculations, is to utilize multiple computers at sites distributed across the […]
Jan, 14

Adaptation of an acoustic propagation model to the parallel architecture of a graphics processor

High performance underwater acoustic models are of great importance for enabling real-time acoustic source tracking, geoacoustic inversion, environmental monitoring and high-frequency underwater communications. Given the parallelizable nature of raytracing, in general, and of the ray superposition algorithm in particular, use of multiple computing units for the development of real-time efficient applications based on ray tracing […]
Jan, 14

High Performance Code Generation for Stencil Computation on Heterogeneous Multi-device Architectures

Heterogeneous architectures have been widely used in the domain of high performance computing. On one hand, it allows a designer to use multiple types of computing units and each able to execute the tasks that it is best suited for to increase performance; on the other hand, it brings many challenges in programming for novice […]
Jan, 14

A Pervasive Parallel Framework for Visualization

We are on the threshold of a transformative change in the basic architecture of high-performance computing. The use of accelerator processors, characterized by large core counts, shared but asymmetrical memory, and heavy thread loading, is quickly becoming the norm in high performance computing. These accelerators represent significant challenges in updating our existing base of software. […]
Jan, 14

Optimal Alignment of Three Sequences On A GPU

We develop two algorithms-layered and sloped-to align three sequences on a GPU. Our algorithms can be used to determine the alignment score as well as the actual alignment. Experiments conducted using an NVIDIA C2050 GPU show that our sloped algorithm is 3 times as fast as the layered one. Further, the sloped algorithm delivers a […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: