Tags: Algorithms, ATI, ATI Radeon HD 6550, Computer science, Heterogeneous systems, List ranking, OpenCL
Tags: Algorithms, Cell processor, Computer science, CUDA, Distributed computing, List ranking, nVidia, nVidia GeForce GTX 280, nVidia GeForce GTX 580, OpenCL, Sorting, Thesis
Tags: Algorithms, Computer science, CUDA, Heterogeneous systems, Information Retrieval, List ranking, nVidia, nVidia GeForce GTX 480, Tesla C1060, Tesla C2050, Thesis
Tags: Algorithms, Computer science, CUDA, List ranking, Monte Carlo simulation, nVidia, Pseudo-random number generators, Tesla C1060
Tags: Algorithms, Computer science, CUBLAS, CUDA, List ranking, nVidia, Sparse matrix, Tesla T20
Tags: Algorithms, Computer science, CUDA, List ranking, nVidia, Sorting, Tesla C1060
Tags: ATI, ATI CAL, ATI IL, ATI Radeon HD 5870, ATI Stream, Computer science, List ranking, OpenCL, Sparse matrix
Tags: Computer science, CUDA, List ranking, nVidia, Tesla C1060
Recent source codes
Most viewed papers (last 30 days)
- GPU-acceleration of the Discontinuous Galerkin Shallow Water Equations Solver (DG-SWEM) using CUDA and OpenACC
- CrossTL: A Universal Programming Language Translator with Unified Intermediate Representation
- Harnessing Batched BLAS/LAPACK Kernels on GPUs for Parallel Solutions of Block Tridiagonal Systems
- An HPC Benchmark Survey and Taxonomy for Characterization
- Home-made Diffusion Model from Scratch to Hatch
- High Performance Matrix Multiplication
- Towards Robust Agentic CUDA Kernel Benchmarking, Verification, and Optimization
- Dato: A Task-Based Programming Model for Dataflow Accelerators
- TRUST: the HPC open-source CFD platform – from CPU to GPU
- Mojo: MLIR-Based Performance-Portable HPC Science Kernels on GPUs for the Python Ecosystem