Tags: Algorithms, ATI, ATI Radeon HD 6550, Computer science, Heterogeneous systems, List ranking, OpenCL
Tags: Algorithms, Cell processor, Computer science, CUDA, Distributed computing, List ranking, nVidia, nVidia GeForce GTX 280, nVidia GeForce GTX 580, OpenCL, Sorting, Thesis
Tags: Algorithms, Computer science, CUDA, Heterogeneous systems, Information Retrieval, List ranking, nVidia, nVidia GeForce GTX 480, Tesla C1060, Tesla C2050, Thesis
Tags: Algorithms, Computer science, CUDA, List ranking, Monte Carlo simulation, nVidia, Pseudo-random number generators, Tesla C1060
Tags: Algorithms, Computer science, CUBLAS, CUDA, List ranking, nVidia, Sparse matrix, Tesla T20
Tags: Algorithms, Computer science, CUDA, List ranking, nVidia, Sorting, Tesla C1060
Tags: ATI, ATI CAL, ATI IL, ATI Radeon HD 5870, ATI Stream, Computer science, List ranking, OpenCL, Sparse matrix
Tags: Computer science, CUDA, List ranking, nVidia, Tesla C1060
Recent source codes
Most viewed papers (last 30 days)
- Asynchronous-Many-Task Systems: Challenges and Opportunities - Scaling an AMR Astrophysics Code on Exascale machines using Kokkos and HPX
- Scalable Access-Pattern Aware I/O Acceleration and Multi-Tiered Data Management for HPC and AI Workloads
- A comparison of HPC-based quantum computing simulators using Quantum Volume
- HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages
- A survey on FPGA-based accelerator for ML models
- CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection
- Reproducible Study and Performance Analysis of GPU Programming Paradigms: OpenACC vs. CUDA in Key Linear Algebra Computations
- TorchQC - A framework for efficiently integrating machine and deep learning methods in quantum dynamics and control
- Finding Missed Code Size Optimizations in Compilers using LLMs
- Utilizing Tensor Cores in Futhark