hgpu.org » Exa.TrkX
Xiangyang Ju, Daniel Murnane, Paolo Calafiura, Nicholas Choma, Sean Conlon, Steve Farrell, Yaoyuan Xu, Maria Spiropulu, Jean-Roch Vlimant, Adam Aurisano, Jeremy Hewes, Giuseppe Cerati, Lindsey Gray, Thomas Klijnsma, Jim Kowalkowski, Markus Atkinson, Mark Neubauer, Gage DeZoort, Savannah Thais, Aditi Chauhan, Alex Schuy, Shih-Chieh Hsu, Alex Ballow, Alina Lazar
Tags: Algorithms, CUDA, Deep learning, Exa.TrkX, HEP, Neural networks, nVidia, Package, Physics, Tesla A100, Tesla V100
March 21, 2021 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Revealing NVIDIA Closed-Source Driver Command Streams for CPU-GPU Runtime Behavior Insight
- Evaluating CUDA Tile for AI Workloads on Hopper and Blackwell GPUs
- DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
- FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow
- CuBridge: An LLM-Based Framework for Understanding and Reconstructing High-Performance Attention Kernels
- CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs
- KEET: Explaining Performance of GPU Kernels Using LLM Agents
- ARGUS: Agentic GPU Optimization Guided by Data-Flow Invariants
- Kerncap: Automated Kernel Extraction and Isolation for AMD GPUs
- A Human–Machine Collaborative Tuning Framework for Triton Kernel Optimization on SIMD Platforms
* * *




