hgpu.org » nVidia GeForce GXT 980
Viktor Rosenfeld, Sebastian Bress, Steffen Zeuch, Tilmann Rabl, Volker Markl
Tags: Algorithms, AMD Radeon R9 Fury, ATI, Computer science, Hashing, nVidia, nVidia GeForce GXT 1080, nVidia GeForce GXT 980, OpenCL, Performance, Tesla K40, Tesla V100
June 16, 2019 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
- KEET: Explaining Performance of GPU Kernels Using LLM Agents
- CuBridge: An LLM-Based Framework for Understanding and Reconstructing High-Performance Attention Kernels
- CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs
- Kerncap: Automated Kernel Extraction and Isolation for AMD GPUs
- MusaCoder: Native GPU Kernel Generation with Full-Stack Training on Moore Threads GPU
- KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
- Pretraining large language models with MXFP4 on Native FP4 Hardware
- Microbenchmark-Driven Analytical Performance Modeling Across Modern GPU Architectures
- KForge: LLM-Driven Cross-Platform Kernel Generation for AI Accelerators
* * *



