high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Benchmarking Harp-DAAL: High Performance Hadoop on KNL Clusters

Benchmarking Harp-DAAL: High Performance Hadoop on KNL Clusters

Langshi Chen, Bo Peng, Bingjing Zhang, Tony Liu, Yiming Zou, Lei Jiang, Robert Henschel, Craig Stewart, Zhang Zhang, Emily Mccallum, Zahniser Tom, Omer Jon, Judy Qiu

School of Informatics and Computing, Indiana University

IEEE Cloud 2017 Conference, 2017

BibTeX

Download (PDF)

View

Source

Source codes

Package:

Harp-DAAL: A collective communication library plugined into Hadoop from Indiana University

3830

views

Data analytics is undergoing a revolution in many scientific domains, demanding cost-effective parallel data analysis techniques. Traditional Java-based Big Data processing tools like Hadoop MapReduce are designed for commodity CPUs. In contrast, emerging manycore processors like Xeon Phi has an order of magnitude of computation power and memory bandwidth. To harness the computing capabilities, we propose a Harp-DAAL framework. We show that enhanced versions of MapReduce can be replaced by Harp, a Hadoop plug-in, that offers useful data abstractions for both of high-performance iterative computation and MPI-quality communication, and it can drive Intel’s native library DAAL. We select a subset of three machine learning algorithms and implement them within Harp-DAAL. Our scalability benchmarks run on Knights Landing (KNL) clusters and achieve up to 2.5 times speedup of performance to the HPC solution in NOMAD and 15 to 40 times faster than Java-based solutions in Spark. We further quantify the workloads on single node KNL with a performance breakdown at micro-architecture level.

Tags: Benchmarking, big data, Computer science, Intel Xeon Phi, Java, Machine learning, MapReduce, MPI, Package, Performance

September 3, 2017 by hgpu

Rating: 2.0/5. From 1 vote.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Benchmarking Harp-DAAL: High Performance Hadoop on KNL Clusters

Package:

Your response

Recent source codes

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

KISim: Kubernetes Intelligent Scheduling Simulator

Efficient GPU Implementation of Multi-Precision Integer Division

exa-AMD: Exascale Accelerated Materials Discovery

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Most viewed papers (last 30 days)

Benchmarking Harp-DAAL: High Performance Hadoop on KNL Clusters

Package:

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)