hgpu.org » Apple M2 Pro
Dahua Feng, Zhiming Xu, Rongxiang Wang, Felix Xiaozhu Lin
Tags: AI, Apple M2 Max, Apple M2 Pro, Apple M2 Ultra, Computer science, CUDA, Linear Algebra, LLM, Machine learning, nVidia, nVidia GeForce RTX 4090, nVidia GeFroce RTX 2080 Ti, nVidia Quadro RTX 4000, nVidia RTX A6000, Performance, PyTorch
February 3, 2025 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- High-Performance Computing: from Optimization to Automation
- exa-AMD: An Exascale-Ready Framework for Accelerating the Discovery and Design of Functional Materials
- VibeCodeHPC: An Agent-Based Iterative Prompting Auto-Tuner for HPC Code Generation Using LLMs
- Compile-Time Resource Safety for GPU APIs: A Low-Overhead Typestate Framework
- Accelerating cosmological simulations on GPUs: a portable approach using OpenMP
- Compiler and Runtime Systems for Generative AI Models
- EvoEngineer: Mastering Automated CUDA Kernel Code Evolution with Large Language Models
- ConCuR: Conciseness Makes State-of-the-Art Kernel Generation
- STARK: Strategic Team of Agents for Refining Kernels
- Adaptivity in AdaptiveCpp: Optimizing Performance by Leveraging Runtime Information During JIT-Compilation
* * *



