high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Realizing Accelerated Cost-Effective Distributed RAID

Realizing Accelerated Cost-Effective Distributed RAID

Aleksandr Khasymski, M. Mustafa Rafique, Ali R. Butt, Sudharshan S. Vazhkudai, Dimitrios S. Nikolopoulos

Virginia Tech

Chapter in "Handbook on Data Centers", edited by Albert Y. Zomaya and Samee U. Khan, Springer, 2014

@chapter{khasymski2014realizing,

title={Realizing Accelerated Cost-Effective Distributed RAID},

author={Khasymski, Aleksandr and Rafique, M. Mustafa and Butt, Ali R. and Vazhkudai, Sudharshan S. and Nikolopoulos, Dimitrios S.},

book={Handbook on Data Centers},

year={2014}

}

Download (PDF)

View

Source

2204

views

The exponential growth in user and application data entails new means for providing fault tolerance and protection against data loss. High Performance Computing (HPC) storage systems, which are at the forefront of handling the data deluge, typically employ hardware RAID at the backend. However, such solutions are costly, do not ensure end-to-end data integrity, and can become a bottleneck during data reconstruction. In this paper, we design an innovative solution to achieve a flexible, fault-tolerant, and high-performance RAID-6 solution for a parallel file system (PFS). Our system utilizes low-cost, strategically placed GPUs-both on the client and server sides – to accelerate parity computation. In contrast to hardware-based approaches, we provide full control over the size, length and location of a RAID array on a per file basis, end-to-end data integrity checking, and parallelization of RAID array reconstruction. We have deployed our system in conjunction with the widely-used Lustre PFS, and show that our approach is feasible and imposes acceptable overhead.

Tags: Computer science, CUDA, nVidia, Storage system, Tesla C2070, Tesla X2090

June 13, 2014 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Realizing Accelerated Cost-Effective Distributed RAID

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

Realizing Accelerated Cost-Effective Distributed RAID

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)