high performance computing on graphics processing units: hgpu.org

hgpu.org » Package

SimSYCL: A SYCL Implementation Targeting Development, Debugging, Simulation and Conformance

Peter Thoman, Fabian Knorr, Luigi Crisci

View

Tags: Computer science, HPC, Package, Performance, SYCL

April 21, 2024 by hgpu

Python-Based Quantum Chemistry Calculations with GPU Acceleration

Xiaojie Wu, Qiming Sun, Zhichen Pu, Tianze Zheng, Wenzhi Ma, Wen Yan, Xia Yu, Zhengxiao Wu, Mian Huo, Xiang Li, Weiluo Ren, Sheng Gong, Yumin Zhang, Weihao Gao

View

Tags: Chemical Physics, Chemistry, Computational Physics, CUDA, nVidia, nVidia A100, Package, Python, Quantum Physics

April 21, 2024 by hgpu

Balancing Tracking Granularity and Parallelism in Many-Task Systems: The Horizons Approach

Peter Thoman, Philip Salzmann

View

Tags: Benchmarking, Computer science, GPU cluster, HPC, nVidia, nVidia V100, Package, SYCL

April 14, 2024 by hgpu

QArray: a GPU-accelerated constant capacitance model simulator for large quantum dot arrays

Barnaby van Straaten, Joseph Hickie, Lucas Schorling, Jonas Schuff, Federico Fedele, Natalia Ares

View

Tags: Condensed matter, Machine learning, Mesoscale and Nanoscale Physics, nVidia, nVidia GeForce GTX 1080 Ti, Package, Physics, Python, Rust

April 14, 2024 by hgpu

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

Erik D. Huckvale, Hunter N.B. Moseley

View

Tags: Computer science, CUDA, nVidia, Package, Performance, Profiling, Python

April 7, 2024 by hgpu

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

Massinissa Merouani, Khaled Afif Boudaoud, Iheb Nassim Aouadj, Nassim Tchoulak, Islam Kara Bernou, Hamza Benyamina, Fatima Benbouzid-Si Tayeb, Karima Benatchba, Hugh Leather, Riyadh Baghdadi

View

Tags: Compilers, Computer science, Deep learning, Machine learning, OpenCL, Package, Programming Languages

March 24, 2024 by hgpu

Retargeting and Respecializing GPU Workloads for Performance Portability

Ivan R. Ivanov, Oleksandr Zinenko, Jens Domke, Toshio Endo, William S. Moses

View

Tags: AMD Radeon Instinct MI210, AMD Radeon RX 6800, ATI, Computer science, CUDA, HIP, HPC, nVidia, nVidia A100, nVidia RTX A4000, Package, performance portability

March 24, 2024 by hgpu

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

John Tramm, Paul Romano, Patrick Shriwise, Amanda Lund, Johannes Doerfert, Patrick Steinbrecher, Andrew Siegel, Gavin Ridley

View

Tags: AMD Radeon Instinct MI250X, ATI, Computer science, CUDA, Intel, Intel Data Center GPU Max 1550, Intel Ponte Vecchio Max 1100, nVidia, nVidia A100, OpenMP, Package, performance portability

March 24, 2024 by hgpu

Parallel Gaussian process with kernel approximation in CUDA

Davide Carminati

View

Tags: Benchmarking, Computer science, CUDA, Linear Algebra, nVidia, nVidia GeForce GTX 1050, nVidia GeForce RTX 2080, Package

March 24, 2024 by hgpu

SYCL in the edge: performance and energy evaluation for heterogeneous acceleration

Youssef Faqir-Rhazoui, Carlos García

View

Tags: Computer science, CUDA, nVidia, nVidia Jetson Orin Nano, Optical flow, Package, Performance, SYCL

March 18, 2024 by hgpu

Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators

Yehonatan Fridman, Guy Tamir, Uri Steinitz, Gal Oren

View

Tags: Benchmarking, Intel, Intel Ponte Vecchio Max 1100, Monte Carlo simulation, OpenMP, Package, Physics

March 10, 2024 by hgpu

Hybrid quantum programming with PennyLane Lightning on HPC platforms

Ali Asadi, Amintor Dusko, Chae-Yeun Park, Vincent Michaud-Rioux, Isidor Schoch, Shuli Shu, Trevor Vincent, Lee James O'Riordan

View

Tags: AMD Radeon Instinct MI250X, ATI, HPC, nVidia, nVidia A100, OpenMP, Package, Physics, Quantum Physics

March 10, 2024 by hgpu

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

SimSYCL: A SYCL Implementation Targeting Development, Debugging, Simulation and Conformance

GPU plugin for PySCF

Python-Based Quantum Chemistry Calculations with GPU Acceleration

QArray

QArray: a GPU-accelerated constant capacitance model simulator for large quantum dot arrays

Celerity: High-level C++ for Accelerator Clusters

Balancing Tracking Granularity and Parallelism in Many-Task Systems: The Horizons Approach

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

94% on CIFAR-10 in 3.29 Seconds on a Single GPU

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

Parallel Gaussian process with kernel approximation in CUDA

See all packages

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Login | Sitemap | Feedback | Policy

Contact us: