hgpu.org » Apple M2 Max
Dahua Feng, Zhiming Xu, Rongxiang Wang, Felix Xiaozhu Lin
Tags: AI, Apple M2 Max, Apple M2 Pro, Apple M2 Ultra, Computer science, CUDA, Linear Algebra, LLM, Machine learning, nVidia, nVidia GeForce RTX 4090, nVidia GeFroce RTX 2080 Ti, nVidia Quadro RTX 4000, nVidia RTX A6000, Performance, PyTorch
February 3, 2025 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Comparing Parallel Functional Array Languages: Programming and Performance
- Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration
- Can Large Language Models Predict Parallel Code Performance?
- Acceleration as a Service (XaaS) Source Containers
- Performance of Confidential Computing GPUs
- CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark
- Low-cost edge computing using upcycled smartphones
- Exploration of Cryptocurrency Mining-Specific GPUs in AI Applications: A Case Study of CMP 170HX
- Exploring SYCL for batched kernels with memory allocations
- GPU Performance Portability needs Autotuning
* * *