hgpu.org » Tesla T40
Xiaojue Zhu, Everett Phillips, Vamsi Spandan, John Donners, Gregory Ruetsch, Josh Romero, Rodolfo Ostilla-Monico, Yantao Yang, Detlef Lohse, Roberto Verzicco, Massimiliano Fatica, Richard J.A.M. Stevens
Tags: cfd, CUDA, Fluid dynamics, Fortran, GPU cluster, MPI, Navier-Stokes equations, NSEs, nVidia, Package, Tesla K20, Tesla P100, Tesla T40
May 6, 2017 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs
- MusaCoder: Native GPU Kernel Generation with Full-Stack Training on Moore Threads GPU
- KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
- Pretraining large language models with MXFP4 on Native FP4 Hardware
- KForge: LLM-Driven Cross-Platform Kernel Generation for AI Accelerators
- Analyzing the Impact of Kernel Fusion on GPU Tensor Operation Performance: A Systematic Performance Study
- CUDABeaver: Benchmarking LLM-Based Automated CUDA Debugging
- Source-to-Source Transformations for GPU Code Generation
- Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation
- CodegenBench: Can LLMs Write Efficient Code Across Architectures?
* * *




