hgpu.org » Apple M2 Pro
Dahua Feng, Zhiming Xu, Rongxiang Wang, Felix Xiaozhu Lin
Tags: AI, Apple M2 Max, Apple M2 Pro, Apple M2 Ultra, Computer science, CUDA, Linear Algebra, LLM, Machine learning, nVidia, nVidia GeForce RTX 4090, nVidia GeFroce RTX 2080 Ti, nVidia Quadro RTX 4000, nVidia RTX A6000, Performance, PyTorch
February 3, 2025 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Hardware Acceleration for Neural Networks: A Comprehensive Survey
- The New Compiler Stack: A Survey on the Synergy of LLMs and Compilers
- SeedFold: Scaling Biomolecular Structure Prediction
- KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta
- GPU Kernel Optimization Beyond Full Builds: An LLM Framework with Minimal Executable Programs
- ParaCodex: A Profiling-Guided Autonomous Coding Agent for Reliable Parallel Code Generation and Translation
- DiffBench Meets DiffAgent: End-to-End LLM-Driven Diffusion Acceleration Code Generation
- Equivalence Checking of ML GPU Kernels
- Generative Video Compression: Towards 0.01% Compression Rate for Video Transmission
- AKG kernel Agent: A Multi-Agent Framework for Cross-Platform Kernel Synthesis
* * *



