hgpu.org » Tesla P4
Xueying Wang, Guangli Li, Xiao Dong, Jiansong Li, Lei Liu, Xiaobing Feng
Tags: Computer science, CUDA, Deep learning, Neural networks, nVidia, nVidia GeForce GTX Titan XP, Tesla P4
July 19, 2020 by hgpu
Yao Chen, Xin Long, Jiong He, Yuhang Chen, Hongshi Tan, Zhenxiang Zhang, Marianne Winslett, Deming Chen
Tags: Computer science, Deep learning, FPGA, Heterogeneous systems, Machine learning, nVidia, OpenCL, Tesla P4
May 24, 2020 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Fortran High-Level Synthesis: Reducing the barriers to accelerating HPC codes on FPGAs
- PoCL-R: An Open Standard Based Offloading Layer for Heterogeneous Multi-Access Edge Computing with Server Side Scalability
- Compute units in OpenMP: Extensions for heterogeneous parallel programming
- Comparing Llama-2 and GPT-3 LLMs for HPC kernels generation
- Many Cores, Many Models: GPU Programming Model vs. Vendor Compatibility Overview
- Leveraging Memory Copy Overlap for Efficient Sparse Matrix-Vector Multiplication on GPUs
- Scope is all you need: Transforming LLMs for HPC Code
- Novel insights on atomic synchronization for sort-based group-by on GPUs
- Performant low-order matrix-free finite element kernels on GPU architectures
- HPAC-Offload: Accelerating HPC Applications with Portable Approximate Computing on the GPU
* * *