Programming
Neil G. Dickson, Kamran Karimi, Firas Hamze
October 27, 2010 by hgpu
Nikolaj Leischner, Vitaly Osipov, Peter Sanders
Tags: Computer science, CUDA, Data Structures and Algorithms, nVidia, nVidia GeForce GTX 285, Sorting, Tesla C1060
October 27, 2010 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Jailbreaking LLM-Controlled Robots
- Over-synchronization in GPU Programs
- Mixed-precision finite element kernels and assembly: Rounding error analysis and hardware acceleration
- Using modern C++ to improve CUDA programs
- General-Purpose Computing on Tensor Processors
- LLload: An Easy-to-Use HPC Utilization Tool
- LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators
- Profile Util library: A quick and easy way to get MPI, OpenMP and GPU runtime information
- Context Parallelism for Scalable Million-Token Inference
- On a Simplified Approach to Achieve Parallel Performance and Portability Across CPU and GPU Architectures
* * *