hgpu.org » pyCUDA
Richard Schoonhoven, Ben van Werkhoven, Kees Joost Batenburg
Tags: AMD Radeon Instinct Mi50, ATI, Auto-Tuning, Benchmarking, Computer science, CUDA, nVidia, nVidia A100, nVidia GeForce GTX 1080 Ti, nVidia GeForce GTX Titan X, nVidia Titan RTX, OpenCL, Performance, pyCUDA, PyOpenCL, Tesla K20, Tesla P100, Tesla V100
October 9, 2022 by hgpu
Florencio Balboa Usabiaga, Blaise Delmotte, Aleksandar Donev
Tags: Condensed matter, CUDA, nVidia, Package, Physics, pyCUDA, Soft Condensed Matter
December 6, 2016 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
- Accurate Models of NVIDIA Tensor Cores
- TritonForge: Profiling-Guided Framework for Automated Triton Kernel Optimization
- PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations
- cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution
- Tilus: A Tile-Level GPGPU Programming Language for Low-Precision Computation
- Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation
- Hybrid Learning and Optimization-Based Dynamic Scheduling for DL Workloads on Heterogeneous GPU Clusters
- BoltzGen:Toward Universal Binder Design
- AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization
* * *




