high performance computing on graphics processing units: hgpu.org

hgpu.org » Graph theory

Fast Knowledge Graph Completion using Graphics Processing Units

Chun-Hee Lee, Dong-oh Kang, Hwa Jeon Song

View

Download (PDF)

Tags: AI, Algorithms, Computer science, CUDA, Databases, Graph theory, nVidia, nVidia A100

July 30, 2023 by hgpu

Improving the scalability of modern applications by parallel multi-core and many-core programming

Alessandro Borione

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, Graph theory, nVidia, nVidia GeForce RTX 3070, OpenCL, Thesis

January 15, 2023 by hgpu

Experience of Migrating a Parallel Graph Coloring Program from CUDA to SYCL

Zheming Jin

View

Download (PDF)

Source codes

Tags: Benchmarking, Computer science, CUDA, Graph theory, nVidia, OpenCL, Package, Performance, SYCL

May 8, 2022 by hgpu

Learning Massive Graph Embeddings on a Single Machine

Jason Mohoney, Roger Waleffe, Yiheng Xu, Theodoros Rekatsinas, Shivaram Venkataraman

View

Download (PDF)

Tags: Computer science, CUDA, Databases, Graph theory, Machine learning, nVidia, Tesla V100

January 24, 2021 by hgpu

Fast Gunrock Subgraph Matching (GSM) on GPUs

Leyuan Wang, John D. Owens

View

Download (PDF)

Source codes

Tags: Computer science, Graph theory, nVidia, nVidia GeForce GTX Titan V, nVidia GeForce GTX Titan XP, Package

March 8, 2020 by hgpu

Pangolin: An Efficient and Flexible Graph Mining System on CPU and GPU

Xuhao Chen, Roshan Dathathri, Gurbinder Gill, Keshav Pingali

View

Download (PDF)

Tags: Computer science, CUDA, Graph theory, nVidia, nVidia GeForce GTX 1080 Ti, Tesla V100

November 24, 2019 by hgpu

On-The-Fly Parallel Data Shuffling for Graph Processing on OpenCL-based FPGAs

Xinyu Chen, Ronak Bajaj, Yao Chen, Jiong He, Bingsheng He, Weng-Fai Wong, Deming Chen

View

Download (PDF)

Tags: Computer science, FPGA, Graph theory, OpenCL

August 25, 2019 by hgpu

HashGraph – Scalable Hash Tables Using A Sparse Graph Data Structure

Oded Green

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, Databases, Graph theory, nVidia, nVidia Quadro GV100

July 14, 2019 by hgpu

GraphVite: A High-Performance CPU-GPU Hybrid System for Node Embedding

Zhaocheng Zhu, Shizhen Xu, Meng Qu, Jian Tang

View

Download (PDF)

Tags: Computer science, Graph theory, Machine learning, nVidia, nVidia GeForce GTX 1080, Tesla P100

March 10, 2019 by hgpu

SIMD-X: Programming and Processing of Graph Algorithms on GPUs

Hang Liu, H. Howie Huang

View

Download (PDF)

Tags: Computer science, CUDA, Graph theory, nVidia, Tesla K20, Tesla K40, Tesla P100

December 16, 2018 by hgpu

Towards Efficient Large-Scale Graph Neural Network Computing

Lingxiao Ma, Zhi Yang, Youshan Miao, Jilong Xue, Ming Wu, Lidong Zhou, Yafei Dai

View

Download (PDF)

Tags: Computer science, CUDA, Deep learning, Graph theory, Machine learning, Neural networks, nVidia, Tesla P100

October 28, 2018 by hgpu

Abelian: A Compiler for Graph Analytics on Distributed, Heterogeneous Platforms

Gurbinder Gill, Roshan Dathathri, Loc Hoang, Andrew Lenharth, Keshav Pingali

View

Download (PDF)

Tags: Computer science, CUDA, Graph theory, Intel Xeon Phi, nVidia, Tesla K80

July 21, 2018 by hgpu

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

SimSYCL: A SYCL Implementation Targeting Development, Debugging, Simulation and Conformance

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

94% on CIFAR-10 in 3.29 Seconds on a Single GPU

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Fast Knowledge Graph Completion using Graphics Processing Units

Improving the scalability of modern applications by parallel multi-core and many-core programming

Experience of Migrating a Parallel Graph Coloring Program from CUDA to SYCL

Learning Massive Graph Embeddings on a Single Machine

Fast Gunrock Subgraph Matching (GSM) on GPUs

Pangolin: An Efficient and Flexible Graph Mining System on CPU and GPU

On-The-Fly Parallel Data Shuffling for Graph Processing on OpenCL-based FPGAs

HashGraph – Scalable Hash Tables Using A Sparse Graph Data Structure

GraphVite: A High-Performance CPU-GPU Hybrid System for Node Embedding

SIMD-X: Programming and Processing of Graph Algorithms on GPUs

Towards Efficient Large-Scale Graph Neural Network Computing

Abelian: A Compiler for Graph Analytics on Distributed, Heterogeneous Platforms

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)