high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Parallel computing with CUDA

Parallel computing with CUDA

M. Garland

IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2010

DOI:10.1109/IPDPS.2010.5470378

@poster{garland2010parallel,

title={Parallel computing with CUDA},

author={Garland, M.},

booktitle={IEEE International Symposium on Parallel and Distributed Processing (IPDPS)},

year={2010}

}

Source

1461

views

Summary form only given. NVIDIA’s CUDA architecture provides a powerful platform for writing highly parallel programs. By providing simple abstractions for hierarchical thread organization, memories, and synchronization, the CUDA programming model allows programmers to write scalable programs without the burden of learning a multitude of new programming constructs. The CUDA architecture can support many languages and programming environments, including C, Fortran, OpenCL, and DirectX Compute. In this tutorial, I will provide an overview of modern GPU processor design and its implications for successful parallel programming models. I will present the programming model adopted by the CUDA architecture, and demonstrate how this is exposed in the C/C++ language. Finally, I will sketch some techniques for implementing common data-parallel algorithms in the CUDA model.

Tags: Computer science, CUDA, nVidia, Programming techniques

April 2, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Parallel computing with CUDA

Your response

Recent source codes

NVIDIA Nemotron Parse 1.1

ThunderKittens: Tile primitives for speedy kernels

Iris: AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming

HipKittens: Fast and Furious AMD Kernels

Fortran xDSL dialects

mt4g: Memory Topology 4 GPUs

Falcon: GPU-Based Floating-point Adaptive Lossless Compression

CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization

pplx-garden: Perplexity open source garden for inference technology

LC Framework

Most viewed papers (last 30 days)

Parallel computing with CUDA

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)