Tags: Algorithms, ATI, ATI Radeon HD 6550, Computer science, Heterogeneous systems, List ranking, OpenCL
Tags: Algorithms, Cell processor, Computer science, CUDA, Distributed computing, List ranking, nVidia, nVidia GeForce GTX 280, nVidia GeForce GTX 580, OpenCL, Sorting, Thesis
Tags: Algorithms, Computer science, CUDA, Heterogeneous systems, Information Retrieval, List ranking, nVidia, nVidia GeForce GTX 480, Tesla C1060, Tesla C2050, Thesis
Tags: Algorithms, Computer science, CUDA, List ranking, Monte Carlo simulation, nVidia, Pseudo-random number generators, Tesla C1060
Tags: Algorithms, Computer science, CUBLAS, CUDA, List ranking, nVidia, Sparse matrix, Tesla T20
Tags: Algorithms, Computer science, CUDA, List ranking, nVidia, Sorting, Tesla C1060
Tags: ATI, ATI CAL, ATI IL, ATI Radeon HD 5870, ATI Stream, Computer science, List ranking, OpenCL, Sparse matrix
Tags: Computer science, CUDA, List ranking, nVidia, Tesla C1060
Recent source codes
Most viewed papers (last 30 days)
- Performance portability evaluation of blocked stencil computations on GPUs
- Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs
- OpenRAND: A Performance Portable, Reproducible Random Number Generation Library for Parallel Computations
- Solving MaxSAT with Matrix Multiplication
- On the Three P's of Parallel Programming for Heterogeneous Computing: Performance, Productivity, and Portability
- Performance Optimization of Deep Learning Sparse Matrix Kernels on Intel Max Series GPU
- Performance Tuning for GPU-Embedded Systems: Machine-Learning-based and Analytical Model-driven Tuning Methodologies
- A Comparison of the Performance of the Molecular Dynamics Simulation Package GROMACS Implemented in the SYCL and CUDA Programming Models
- CHARM-SYCL: New Unified Programming Environment for Multiple Accelerator Types
- A Performance-Portable SYCL Implementation of CRK-HACC for Exascale