hgpu.org » pyCUDA
Richard Schoonhoven, Ben van Werkhoven, Kees Joost Batenburg
Tags: AMD Radeon Instinct Mi50, ATI, Auto-Tuning, Benchmarking, Computer science, CUDA, nVidia, nVidia A100, nVidia GeForce GTX 1080 Ti, nVidia GeForce GTX Titan X, nVidia Titan RTX, OpenCL, Performance, pyCUDA, PyOpenCL, Tesla K20, Tesla P100, Tesla V100
October 9, 2022 by hgpu
Florencio Balboa Usabiaga, Blaise Delmotte, Aleksandar Donev
Tags: Condensed matter, CUDA, nVidia, Package, Physics, pyCUDA, Soft Condensed Matter
December 6, 2016 by hgpu
Recent source codes
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
* * *
Most viewed papers (last 30 days)
- DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels
- BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics
- Accelerating Scientific Research with Gemini: Case Studies and Common Techniques
- Deep Kernel Fusion for Transformers
- Improving HPC Code Generation Capability of LLMs via Online Reinforcement Learning with Real-Machine Benchmark Rewards
- SciDef: Automating Definition Extraction from Academic Literature with Large Language Models
- Private LLM Inference on Consumer Blackwell GPUs: A Practical Guide for Cost-Effective Local Deployment in SMEs
- ProfInfer: An eBPF-based Fine-Grained LLM Inference Profiler
- Nsight Python: A Python-First Profiling Toolkit for Seamless GPU Kernel Analysis (Tool)
- Generating Literature-Driven Scientific Theories at Scale
* * *




