hgpu.org » nVidia Quadro P 620
Lazaros Papadopoulos, Dimitris John Soudris, Christoph Kessler, August Ernstsson, Johan Ahlqvist, Nikos Vasilas, Athanasios Papadopoulos, Panos Seferlis, Charles Prouveur, Matthieu Haefele, Samuel Paul Thibault, Athanasios Salamanis, Theodoros Ioakimidis, Dionisis D. Kehagias
Tags: Computer science, CUDA, FPGA, Heterogeneous systems, MPI, nVidia, nVidia Quadro P 620, OpenCL, OpenMP, Tesla P100, Tesla V100
August 22, 2021 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations
- Hardware Acceleration for Neural Networks: A Comprehensive Survey
- Tilus: A Tile-Level GPGPU Programming Language for Low-Precision Computation
- AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization
- The New Compiler Stack: A Survey on the Synergy of LLMs and Compilers
- SeedFold: Scaling Biomolecular Structure Prediction
- Memory-Efficient Acceleration of Block Low-Rank Foundation Models on Resource Constrained GPUs
- KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta
- GPU Kernel Optimization Beyond Full Builds: An LLM Framework with Minimal Executable Programs
- Optimal Software Pipelining and Warp Specialization for Tensor Core GPUs
* * *



