hgpu.org » CPU cluster	 
David Clarke, Aleksandar Ilic, Alexey Lastovetsky, Leonel Sousa
Tags: Computer science, CPU cluster, GPU cluster, Heterogeneous systems, Matrix multiplication, nVidia, Tesla C2050, Tesla T10
June 5, 2012  by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Towards Robust Agentic CUDA Kernel Benchmarking, Verification, and Optimization
- Dato: A Task-Based Programming Model for Dataflow Accelerators
- TRUST: the HPC open-source CFD platform – from CPU to GPU
- Mojo: MLIR-Based Performance-Portable HPC Science Kernels on GPUs for the Python Ecosystem
- Towards GPU Parallelism Abstractions in Rust: A Case Study with Linear Pipelines
- High-Performance Computing: from Optimization to Automation
- exa-AMD: An Exascale-Ready Framework for Accelerating the Discovery and Design of Functional Materials
- VibeCodeHPC: An Agent-Based Iterative Prompting Auto-Tuner for HPC Code Generation Using LLMs
- Evolution of Kernels: Automated RISC-V Kernel Optimization with Large Language Models
- Robust LLM Training Infrastructure at ByteDance
* * *



