hgpu.org » Tesla S2075
Roberto Ammendola, Massimo Bernaschi, Andrea Biagioni, Mauro Bisson, Massimiliano Fatica, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Enrico Mastrostefano, Pier Stanislao Paolucci, Davide Rossetti, Francesco Simula, Laura Tosoratto, Piero Vicini
Tags: Computational Physics, CUDA, FPGA, MPI, nVidia, Physics, Tesla S2075
August 1, 2013 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations
- Hardware Acceleration for Neural Networks: A Comprehensive Survey
- Tilus: A Tile-Level GPGPU Programming Language for Low-Precision Computation
- The New Compiler Stack: A Survey on the Synergy of LLMs and Compilers
- AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization
- SeedFold: Scaling Biomolecular Structure Prediction
- Memory-Efficient Acceleration of Block Low-Rank Foundation Models on Resource Constrained GPUs
- KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta
- GPU Kernel Optimization Beyond Full Builds: An LLM Framework with Minimal Executable Programs
- Optimal Software Pipelining and Warp Specialization for Tensor Core GPUs
* * *



