hgpu.org » software optimization
Suejb Memeti, Sabri Pllana, Alecio Binotto, Joanna Kolodziej, and Ivona Brandic
February 10, 2018 by suejb.memeti
Recent source codes
* * *
Most viewed papers (last 30 days)
- Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs
- GT4Py: High Performance Stencils for Weather and Climate Applications using Python
- OpenRAND: A Performance Portable, Reproducible Random Number Generation Library for Parallel Computations
- Accelerating bioinformatics applications on CUDA-enabled multi-GPU systems
- Solving MaxSAT with Matrix Multiplication
- Evaluation of FPGA-based high performance computing platforms
- On the Three P's of Parallel Programming for Heterogeneous Computing: Performance, Productivity, and Portability
- Performance Optimization of Deep Learning Sparse Matrix Kernels on Intel Max Series GPU
- A Comparison of the Performance of the Molecular Dynamics Simulation Package GROMACS Implemented in the SYCL and CUDA Programming Models
- CHARM-SYCL: New Unified Programming Environment for Multiple Accelerator Types
* * *