Tags: Computer science, CUDA, Heterogeneous systems, nVidia, nVidia GeForce GTX 480, Operating systems, Package
Tags: Computer science, CUDA, Heterogeneous systems, nVidia, nVidia GeForce GTX 480, Operating systems, Package, Task scheduling
Tags: Algorithms, Benchmarking, Computer science, nVidia, nVidia Quadro FX 3800, OpenCL, OpenMP, Operating systems, Performance
Tags: Computer science, CUDA, Heterogeneous systems, nVidia, Operating systems, Software Engineering, Tesla C2070
Tags: ATI, ATI Radeon HD 6970, Computer science, Heterogeneous systems, MPI, nVidia, nVidia GeForce GTX 480, OpenCL, Operating systems, Package
Tags: AES, ATI, ATI Radeon HD 6750 M, Computer science, nVidia, nVidia GeForce GTX 580, nVidia GeForce GTX 590, OpenCL, Operating systems, Security, Tesla M2070
Tags: Computer science, CUDA, Energy-efficient computing, nVidia, nVidia GeForce GTX 210, nVidia GeForce GTX 280, nVidia GeForce GTX 570, Operating systems, Task scheduling
Tags: Computer science, CUDA, Heterogeneous systems, nVidia, nVidia GeForce GTX 480, Operating systems, Package
Tags: Cloud, Computer science, GPU cluster, nVidia, nVidia GeForce GTX 480, OpenCL, Operating systems, Package
Tags: ATI, ATI Radeon HD 5870, Computer science, Heterogeneous systems, OpenCL, Operating systems, Package
Recent source codes
Most viewed papers (last 30 days)
- Performance Portable Gradient Computations Using Source Transformation
- ConTraPh: Contrastive Learning for Parallelization and Performance Optimization
- Block: Balancing Load in LLM Serving with Context, Knowledge and Predictive Scheduling
- Understanding the Landscape of Ampere GPU Memory Errors
- Geak: Introducing Triton Kernel AI Agent & Evaluation Benchmarks
- SIGMo: High-Throughput Batched Subgraph Isomorphism on GPUs for Molecular Matching
- GBOTuner: Autotuning of OpenMP Parallel Codes with Bayesian Optimization and Code Representation Transfer Learning
- DGEMM without FP64 Arithmetic - using FP64 Emulation and FP8 Tensor Cores with Ozaki Scheme
- Luthier: Bridging Auto-Tuning and Vendor Libraries for Efficient Deep Learning Inference
- OpenDwarfs 2025: Modernizing the OpenDwarfs Benchmark Suite for Heterogeneous Computing