high performance computing on graphics processing units: hgpu.org

hgpu.org » N-body simulation

Comparing Parallel Functional Array Languages: Programming and Performance

David van Balen, Tiziano De Matteis, Clemens Grelck, Troels Henriksen, Aaron W. Hsu, Gabriele K. Keller, Thomas Koopman, Trevor L. McDonell, Cosmin Oancea, Sven-Bodo Scholz, Artjoms Sinkarovs, Tom Smeding, Phil Trinder, Ivo Gabe de Wolff, Alexandros Nikolaos Ziogas

View

Tags: Benchmarking, Computer science, CUDA, HIP, N-body simulation, nVidia, nVidia A30, OpenCL, Package, Performance, performance portability, Programming Languages

May 18, 2025 by hgpu

Unified schemes for directive-based GPU offloading

Yohei Miki, Toshihiro Hanawa

View

Tags: AMD Radeon Instinct MI210, ATI, Computer science, Diffusion equation, Intel, Intel Ponte Vecchio Max 1100, N-body simulation, nVidia, nVidia GH200, nVidia H100, OpenACC, OpenMP, Package

December 8, 2024 by hgpu

ExaNBody: a HPC framework for N-Body applications

Thierry Carrard, Raphaël Prat, Guillaume Latu, Killian Babilotte, Paul Lafourcade, Lhassan Amarsid, Laurent Soulard

View

Tags: Astrophysics, Computer science, CUDA, MPI, N-body simulation, nVidia, nVidia A100, OpenMP, performance portability

November 19, 2023 by hgpu

Comparison of different n-body algorithms on various hardware platforms using SYCL

Tim Thüring

View

Tags: AMD Radeon Pro VII, Astrophysics, ATI, Computer science, N-body simulation, nVidia, nVidia A100, nVidia GeForce RTX 3090, nVidia Quadro GP100, Package, Physics, SYCL, Thesis

October 15, 2023 by hgpu

FlowPM: Distributed TensorFlow Implementation of the FastPM Cosmological N-body Solver

Chirag Modi, Francois Lanusse, Uros Seljak

View

Tags: Astrophysics, Cosmology, CUDA, N-body simulation, nVidia, Package, Physics, TensorFlow, Tesla V100

October 25, 2020 by hgpu

Performance and energy footprint assessment of FPGAs and GPUs on HPC systems using Astrophysics application

David Goz, Georgios Ieronymakis, Vassilis Papaefstathiou, Nikolaos Dimou, Sara Bertocco, Giuliano Taffoni, Francesco Simula, Antonio Ragagnin, Luca Tornatore, Igor Coretti

View

Tags: ARM, Astrophysics, CUDA, FPGA, Heterogeneous systems, HPC, HSL, N-body simulation, nVidia, OpenCL, Performance, Physics, SoC, Tesla V100

March 15, 2020 by hgpu

Direct N-body code on low-power embedded ARM GPUs

David Goz, Sara Bertocco, Luca Tornatore, Giuliano Taffoni

View

Tags: ARM, Astrophysics, FPGA, Heterogeneous systems, Instrumentation and Methods for Astrophysics, N-body simulation, OpenCL, OpenMPI, SoC

January 27, 2019 by hgpu

StePS: A Multi-GPU Cosmological N-body Code for Compactified Simulations

Gabor Racz, Istvan Szapudi, Laszlo Dobos, Istvan Csabai, Alexander S. Szalay

View

Tags: Astrophysics, Cosmology, CUDA, MPI, N-body simulation, nVidia, OpenMP, Package, Tesla K80

November 18, 2018 by hgpu

A Qualitative Comparison Study Between Common GPGPU Frameworks

Adam Soderstrom

View

Tags: Computer science, CUDA, DirectCompute, DirectX, N-body simulation, nVidia, nVidia GeForce GTX 1050, OpenCL, Thesis

August 26, 2018 by hgpu

Scalable Streaming Tools for Analyzing N-body Simulations: Finding Halos and Investigating Excursion Sets in One Pass

Nikita Ivkin, Zaoxing Liu, Lin F. Yang, Srinivas Suresh Kumar, Gerard Lemson, Mark Neyrinck, Alexander S. Szalay, Vladimir Braverman, Tamas Budavari

View

Tags: Algorithms, Astrophysics, CUDA, Instrumentation and Methods for Astrophysics, N-body simulation, nVidia, nVidia GeForce GTX 1080, Tesla C2050, Tesla C2070

November 7, 2017 by hgpu

Accelerating Workloads on FPGAs via OpenCL: A Case Study with OpenDwarfs

Anshuman Verma, Ahmed E. Helal, Konstantinos Krommydas, Wu-Chun Feng

View

Tags: Computer science, FPGA, Intel Xeon Phi, N-body simulation, nVidia, OpenCL, performance portability, Tesla C2070, Tesla K20

January 26, 2017 by hgpu

Massively Parallel Computation of Accurate Densities for N-body Dark Matter Simulations using the Phase-Space-Element Method

Ralf Kaehler

View

Tags: Astrophysics, Computational Physics, CUDA, Instrumentation and Methods for Astrophysics, N-body simulation, nVidia, Package, Physics, Tesla K80, Tessellation

January 4, 2017 by hgpu

NVIDIA Nemotron Parse 1.1

NVIDIA Nemotron Parse 1.1

ThunderKittens: Tile primitives for speedy kernels

ParallelKittens: Systematic and Practical Simplification of Multi-GPU AI Kernels

Iris: AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming

Iris: First-Class Multi-GPU Programming Experience in Triton

HipKittens: Fast and Furious AMD Kernels

HipKittens: Fast and Furious AMD Kernels

Fortran xDSL dialects

An MLIR pipeline for offloading Fortran to FPGAs via OpenMP

mt4g: Memory Topology 4 GPUs

MT4G: A Tool for Reliable Auto-Discovery of NVIDIA and AMD GPU Compute and Memory Topologies

Falcon: GPU-Based Floating-point Adaptive Lossless Compression

A High-Throughput GPU Framework for Adaptive Lossless Compression of Floating-Point Data

CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization

CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization

LC Framework

Characterizing the Performance of Parallel Data-Compression Algorithms across Compilers and GPUs

pplx-garden: Perplexity open source garden for inference technology

RDMA Point-to-Point Communication for LLM Systems

See all packages

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Login | Sitemap | Feedback | Policy

Contact us: