Tags: Algorithms, ATI, ATI Radeon HD 6550, Computer science, Heterogeneous systems, List ranking, OpenCL
Tags: Algorithms, Cell processor, Computer science, CUDA, Distributed computing, List ranking, nVidia, nVidia GeForce GTX 280, nVidia GeForce GTX 580, OpenCL, Sorting, Thesis
Tags: Algorithms, Computer science, CUDA, Heterogeneous systems, Information Retrieval, List ranking, nVidia, nVidia GeForce GTX 480, Tesla C1060, Tesla C2050, Thesis
Tags: Algorithms, Computer science, CUDA, List ranking, Monte Carlo simulation, nVidia, Pseudo-random number generators, Tesla C1060
Tags: Algorithms, Computer science, CUBLAS, CUDA, List ranking, nVidia, Sparse matrix, Tesla T20
Tags: Algorithms, Computer science, CUDA, List ranking, nVidia, Sorting, Tesla C1060
Tags: ATI, ATI CAL, ATI IL, ATI Radeon HD 5870, ATI Stream, Computer science, List ranking, OpenCL, Sparse matrix
Tags: Computer science, CUDA, List ranking, nVidia, Tesla C1060
Recent source codes
Most viewed papers (last 30 days)
- Optimizing CUDA like a Human: Micro-Profiling Tools as Expert Surrogates for LLM-Based GPU Kernel Optimization
- MusaCoder: Native GPU Kernel Generation with Full-Stack Training on Moore Threads GPU
- KForge: LLM-Driven Cross-Platform Kernel Generation for AI Accelerators
- Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation
- CodegenBench: Can LLMs Write Efficient Code Across Architectures?
- daVinci-kernel: Co-Evolving Skill Selection, Summarization, and Utilization via RL for GPU Kernel Optimization
- Leveraging AI Ecosystem for Portable and Sustainable GPU Kernels in HPC
- AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning
- Autonomous heterogeneous catalyst discovery with a self-evolving multi-agent digital twin
- Tangram: Hiding GPU Heterogeneity for Efficient LLM Parallelization



