high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Preliminary Experiences with the Uintah Framework on Intel Xeon Phi and Stampede

Preliminary Experiences with the Uintah Framework on Intel Xeon Phi and Stampede

Qingyu Meng, Alan Humphrey, John Schmidt, Martin Berzins

Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, UT 84112 USA

Conference of the Extreme Science and Engineering Discovery Environment (XSEDE), 2013

@article{meng2013preliminary,

title={Preliminary Experiences with the Uintah Framework on Intel Xeon Phi and Stampede},

author={Meng, Qingyu and Humphrey, Alan and Schmidt, John and Berzins, Martin},

year={2013}

}

Download (PDF)

View

Source

2530

views

In this work, we describe our preliminary experiences on the Stampede system in the context of the Uintah Computational Framework. Uintah was developed to provide an environment for solving a broad class of fluid-structure interaction problems on structured adaptive grids. Uintah uses a combination of fluid-flow solvers and particle-based methods, together with a novel asynchronous taskbased approach and fully automated load balancing. While we have designed scalable Uintah runtime systems for large CPU core counts, the emergence of heterogeneous systems presents considerable challenges in terms of effectively utilizing additional on-node accelerators and co-processors, deep memory hierarchies, as well as managing multiple levels of parallelism. Our recent work has addressed the emergence of heterogeneous CPU/GPU systems with the design of a Unified heterogeneous runtime system, enabling Uintah to fully exploit these architectures with support for asynchronous, out-of-order scheduling of both CPU and GPU computational tasks. Using this design, Uintah has run at full scale on the Keeneland System and TitanDev. With the release of the Intel Xeon Phi co-processor and the recent availability of the Stampede system, we show that Uintah may be modified to utilize such a coprocessor based system. We also explore the different usage models provided by the Xeon Phi with the aim of understanding portability of a general purpose framework like Uintah to this architecture. These usage models range from the pragma based offload model to the more complex symmetric model, utilizing all co-processor and host CPU cores simultaneously. We provide preliminary results of the various usage models for a challenging adaptive mesh refinement problem, as well as a detailed account of our experience adapting Uintah to run on the Stampede system. Our conclusion is that while the Stampede system is easy to use, obtaining high performance from the Xeon Phi co-processors requires a substantial but different investment to that needed for GPU-based systems.

Tags: Computer science, Heterogeneous systems, Intel, Intel Phi, Performance

July 2, 2013 by hgpu

Rating: 0.5/5. From 1 vote.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Preliminary Experiences with the Uintah Framework on Intel Xeon Phi and Stampede

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

Preliminary Experiences with the Uintah Framework on Intel Xeon Phi and Stampede

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)