high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Concurrent Solutions to Linear Systems using Hybrid CPU/GPU Nodes

Concurrent Solutions to Linear Systems using Hybrid CPU/GPU Nodes

Oluwapelumi Adenikinju, Julian Gilyard, Joshua Massey, Thomas Stitt

Department of Computer Science and Electrical Engineering, UMBC

SIAM Undergraduate Research Online (SIURO), vol. 8, 2015

@article{adenikinju2015concurrent,

title={Concurrent Solutions to Linear Systems using Hybrid CPU/GPU Nodes},

author={Adenikinju, Oluwapelumi and Gilyard, Julian and Massey, Joshua and Stitt, Thomas},

year={2015}

}

Download (PDF)

View

Source

2275

views

We investigate the parallel solutions to linear systems with the application focus as the global illumination problem in computer graphics. An existing CPU serial implementation using the radiosity method is given as the performance baseline where a scene and corresponding form-factor coefficients are provided. The initial computational radiosity solver uses the basic Jacobi method with a fixed iteration count as an iterative approach to solving the radiosity linear system. We add the option of using the modern BiCG-STAB method with the aim of reduced runtime for complex problems. It is found that for the test scenes used, the problem complexity was not great enough to take advantage of mathematical reformulation through BiCG-STAB. Single-node parallelization techniques are implemented through OpenMP-based multithreading, GPU-offloading using CUDA, and hybrid multi-threading/GPU offloading. It is seen that in general OpenMP is optimal by requiring no expensive memory transfers. Finally, we investigate two storage schemes of the system to determine whether storage through arrays of structures or structures of arrays results in better performance. We find that the usage of arrays of structures in conjunction with OpenMP results in the best performance except for small scene sizes, where CUDA shows the minimal runtime.

Tags: Computer science, CUDA, Linear Algebra, nVidia, OpenGL, OpenMP

June 26, 2015 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Concurrent Solutions to Linear Systems using Hybrid CPU/GPU Nodes

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

Concurrent Solutions to Linear Systems using Hybrid CPU/GPU Nodes

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)