hgpu.org » Tesla S2075
Roberto Ammendola, Massimo Bernaschi, Andrea Biagioni, Mauro Bisson, Massimiliano Fatica, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Enrico Mastrostefano, Pier Stanislao Paolucci, Davide Rossetti, Francesco Simula, Laura Tosoratto, Piero Vicini
Tags: Computational Physics, CUDA, FPGA, MPI, nVidia, Physics, Tesla S2075
August 1, 2013 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- MusaCoder: Native GPU Kernel Generation with Full-Stack Training on Moore Threads GPU
- CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs
- KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
- Pretraining large language models with MXFP4 on Native FP4 Hardware
- KForge: LLM-Driven Cross-Platform Kernel Generation for AI Accelerators
- Analyzing the Impact of Kernel Fusion on GPU Tensor Operation Performance: A Systematic Performance Study
- CUDABeaver: Benchmarking LLM-Based Automated CUDA Debugging
- Source-to-Source Transformations for GPU Code Generation
- Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation
- CodegenBench: Can LLMs Write Efficient Code Across Architectures?
* * *



