Tags: Benchmarking, Computer science, CUDA, MPI, nVidia, nVidia GeForce 8400 GS, nVidia GeForce 9400 GT, Operating systems, Performance, Tesla C1060, Tesla C2050, Tesla T10
Tags: APU, Computer science, GPU cluster, Heterogeneous systems, MPI, nVidia, nVidia GeForce GTX 480, OpenCL, Operating systems, Package
Tags: Algorithms, Benchmarking, Computer science, CUDA, Data Structures and Algorithms, nVidia, nVidia GeForce GTX 295, nVidia GeForce GTX 580, Operating systems
Tags: Computer science, CUDA, HLSL, nVidia, nVidia GeForce GT 230, nVidia GeForce GTX 470, nVidia GeForce GTX 580, OpenCL, Operating systems, Performance, Programming techniques
Tags: Algorithms, Cloud, Computer science, CUDA, nVidia, Operating systems, Performance, Tesla C2050, Virtualization
Tags: Computer science, CUDA, nVidia, Operating systems, Performance, Review, Software Engineering, Tutorial
Tags: Computer science, Heterogeneous systems, Memory, Operating systems, Performance, Programming Languages
Recent source codes
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
Most viewed papers (last 30 days)
- DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels
- BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics
- Accelerating Scientific Research with Gemini: Case Studies and Common Techniques
- Deep Kernel Fusion for Transformers
- SciDef: Automating Definition Extraction from Academic Literature with Large Language Models
- Towards Automated Kernel Generation in the Era of LLMs
- ProfInfer: An eBPF-based Fine-Grained LLM Inference Profiler
- Improving HPC Code Generation Capability of LLMs via Online Reinforcement Learning with Real-Machine Benchmark Rewards
- Private LLM Inference on Consumer Blackwell GPUs: A Practical Guide for Cost-Effective Local Deployment in SMEs
- PhysProver: Advancing Automatic Theorem Proving for Physics




