high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Benchmarking and Optimization of Gradient Boosted Decision Tree Algorithms

Benchmarking and Optimization of Gradient Boosted Decision Tree Algorithms

Andreea Anghel, Nikolaos Papandreou, Thomas Parnell, Alessandro De Palma, Haralampos Pozidis

IBM Research – Zurich, Ruschlikon, Switzerland

arXiv:1809.04559 [cs.LG], (12 Sep 2018)

@article{anghel2018benchmarking,

title={Benchmarking and Optimization of Gradient Boosted Decision Tree Algorithms},

author={Anghel, Andreea and Papandreou, Nikolaos and Parnell, Thomas and Palma, Alessandro De and Pozidis, Haralampos},

year={2018},

month={sep},

archivePrefix={"arXiv"},

primaryClass={cs.LG}

}

Download (PDF)

View

Source

2517

views

Gradient boosted decision trees (GBDTs) have seen widespread adoption in academia, industry and competitive data science due to their state-of-the-art performance in a wide variety of machine learning tasks. In this paper, we present an extensive empirical comparison of XGBoost, LightGBM and CatBoost, three popular GBDT algorithms, to aid the data science practitioner in the choice from the multitude of available implementations. Specifically, we evaluate their behavior on four large-scale datasets with varying shapes, sparsities and learning tasks, in order to evaluate the algorithms’ generalization performance, training times (on both CPU and GPU) and their sensitivity to hyper-parameter tuning. In our analysis, we first make use of a distributed grid-search to benchmark the algorithms on fixed configurations, and then employ a state-of-the-art algorithm for Bayesian hyper-parameter optimization to fine-tune the models.

Tags: Algorithms, Bayesian, Benchmarking, Computer science, CUDA, Machine learning, nVidia, nVidia GeForce GTX 1080 Ti

September 16, 2018 by hgpu

Rating: 4.0/5. From 3 votes.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Benchmarking and Optimization of Gradient Boosted Decision Tree Algorithms

Your response

Recent source codes

Awesome LLM-Driven Kernel Generation

PhysProver: Advancing Automatic Theorem Proving for Physics

ParaCodex: A Profiling-Guided Autonomous Coding Agent for Reliable Parallel Code Generation and Translation

SeedFold: Scaling Biomolecular Structure Prediction

Tilus: A Tile-Level GPU Kernel Programming Language

Memory-Efficient Acceleration of Block Low-Rank Foundation Models on Resource Constrained GPUs

BoltzGen:Toward Universal Binder Design

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution

MATLAB Tensor Core models

Most viewed papers (last 30 days)

Benchmarking and Optimization of Gradient Boosted Decision Tree Algorithms

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)