high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » RapidMind: Portability across Architectures and its Limitations

RapidMind: Portability across Architectures and its Limitations

Iris Christadler, Volker Weinberg

Leibniz-Rechenzentrum der Bayerischen Akademie der Wissenschaften, D-85748 Garching bei Munchen, Germany

arXiv:1001.1902 [cs.PF] (12 Jan 2010)

@article{christadler2010rapidmind,

title={RapidMind: Portability across Architectures and its Limitations},

author={Christadler, I. and Weinberg, V.},

journal={Arxiv preprint arXiv:1001.1902},

year={2010}

}

Download (PDF)

View

Source

2146

views

Recently, hybrid architectures using accelerators like GPGPUs or the Cell processor have gained much interest in the HPC community. The RapidMind Multi-Core Development Platform is a programming environment that allows generating code which is able to seamlessly run on hardware accelerators like GPUs or the Cell processor and multicore CPUs both from AMD and Intel. This paper describes the ports of three mathematical kernels to RapidMind which are chosen as synthetic benchmarks and representatives of scientific codes. Performance of these kernels has been measured on various RapidMind backends (cuda, cell and x86) and compared to other hardware-specific implementations (using CUDA, Cell SDK and Intel MKL). The results give an insight in the degree of portability of RapidMind code and code performance across different architectures.

Tags: Cell processor, Computer science, CUDA, nVidia, Performance, Programming Languages, Tesla C1060, Tesla S1070

November 12, 2010 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

* * *

high performance computing on graphics processing units: hgpu.org

RapidMind: Portability across Architectures and its Limitations

Your response

Recent source codes

CL4SE: A Context Learning Benchmark For Software Engineering Tasks

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Vortex-Optimized Light-weight Toolchain (VOLT)

SciDef: Automated Definition Extraction from Scientific Literature

bioagent-bench: Benchmark for evaluating LLM agents in bioinformatics

Benchmark suite for LLM inference on NVIDIA consumer GPUs

Theorizer: from the paper Generating Literature-Driven Scientific Discoveries at Scale

Most viewed papers (last 30 days)

RapidMind: Portability across Architectures and its Limitations

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)