high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Finance » Scalable and Parallel Implementation of a Financial Application on a GPU: With Focus on Out-of-Core Case

Scalable and Parallel Implementation of a Financial Application on a GPU: With Focus on Out-of-Core Case

Myungho Lee, Jin-hong Jeon, Joonsuk Kim, Joonhyun Song

Dept. of Comput. Sci. & Eng., Myong Ji Univ., Yong In, South Korea

IEEE 10th International Conference on Computer and Information Technology (CIT), 2010

DOI:10.1109/CIT.2010.238

@conference{lee2010scalable,

title={Scalable and Parallel Implementation of a Financial Application on a GPU: with focus on out-of-core case},

author={Lee, M. and Jeon, J. and Kim, J. and Song, J.},

booktitle={2010 10th IEEE International Conference on Computer and Information Technology (CIT 2010)},

pages={1323–1327},

year={2010},

organization={IEEE}

}

Source

1679

views

The architecture of the latest Graphic Processing Unit (GPU) consists of a number of uniform programmable units integrated on the same chip, which facilitate the general-purpose computing beyond the graphic processing. With the multiple programmable units executing in parallel, the latest GPU shows superior performance for many non-graphic applications. Furthermore, programmers can have a direct control on the GPU pipeline using easy-to-use parallel programming environments. These advances in hardware and software make General-Purpose GPU computing (GPGPU) widespread. In this paper, we parallelize a computationally demanding financial application and optimize its performance on a latest GPU. We also analyze the performance results compared with those obtained using CPU only. Experimental results show that GPU can achieve a superior performance, greater than 190x, compared with the CPU-only case when the data fits in the graphic memory. We also address the performance issue in the out-of-core case where the data cannot fit in the device memory on the GPU. In such a case, by using streaming technique helps make up the performance gap lost due to data transfer overhead from the CPU side to the GPU DRAM.

Tags: Finance, Monte Carlo simulation

March 27, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Scalable and Parallel Implementation of a Financial Application on a GPU: With Focus on Out-of-Core Case

Your response

Recent source codes

True 4-Bit Quantized CNN Training on CPU

cuFuzz: A GPU-oriented coverage-guided fuzzer for userland CUDA application

KernelSkill: A Multi-Agent Framework for GPU Kernel Optimization

MSKernelBench & CUDAMaster

EvoScientist: Harness Vibe Research with Self-evolving AI Scientists

RepoLaunch: Automating Build and Test Pipeline of Code Repositories on ANY Language and ANY Platform

RepoLaunch: Automating Build and Test Pipeline of Code Repositories on ANY Language and ANY Platform

CONCUR: a benchmark designed to evaluate multithreaded Java code generated by LLMs

HIPRT: Ray Tracing using HIP

MXFP4 Training Support Codebase

Most viewed papers (last 30 days)

Scalable and Parallel Implementation of a Financial Application on a GPU: With Focus on Out-of-Core Case

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)