hgpu.org » Intel UHD 770
Cristian Campos, Rafael Asenjo, Angeles Navarro
Tags: Computer science, Data parallelism, Heterogeneous systems, Intel, Intel UHD 770, oneAPI, Package, SYCL
January 27, 2025 by hgpu
Manuel Costanzo, Enzo Rucci, Carlos García-Sánchez, Marcelo Naiouf, Manuel Prieto-Matías
Tags: AMD Radeon RX 6700 XT, AMD Radeon RX Vega 6, ATI, Bioinformatics, Biology, Computer science, CUDA, Databases, Heterogeneous systems, HPC, Intel, Intel Arc A770, Intel UHD 630, Intel UHD 770, nVidia, nVidia GeForce GTX 1080, nVidia GeForce GTX 980, nVidia GeForce RTX 2070, nVidia GeForce RTX 3070, nVidia GeForce RTX 3090, oneAPI, Package, performance portability, SYCL, Tesla V100
December 15, 2024 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Architecture-Aware LLM Inference Optimization on AMD Instinct GPUs: A Comprehensive Benchmark and Deployment Study
- LLMQ: Efficient Lower-Precision LLM Training for Consumer GPUs
- AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search
- An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU
- DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation
- KernelFoundry: Hardware-aware evolutionary GPU kernel optimization
- MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?
- CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe
- Mixed-precision numerics in scientific applications: survey and perspectives
- True 4-Bit Quantized Convolutional Neural Network Training on CPU: Achieving Full-Precision Parity
* * *




