Tags: Computer science, CUDA, Memory, nVidia, Operating systems, Performance, Tesla V100
Tags: Computer science, CUDA, Heterogeneous systems, nVidia, OpenCL, Operating systems, PTX, SYCL, Thesis
Tags: Computer science, CUDA, Distributed computing, Heterogeneous systems, nVidia, nVidia Quadro FX 5800, OpenCL, Operating systems, StarPU, Task scheduling, Tesla C2050, Tesla K20, Tesla M2075, Thesis
Tags: Computer science, CUDA, nVidia, nVidia Jetson TK1, Operating systems, Performance, Security, SoC
Performance Evaluation of Container-based Virtualization for High Performance Computing Environments
Tags: Benchmarking, Computer science, CUDA, MPI, nVidia, Operating systems, Package, Performance, Tesla K20, Virtualization
Tags: Computer science, CUDA, Genetic programming, nVidia, nVidia GRID K520, Operating systems, Package
Tags: ATI, C++ AMP, Computer science, Operating systems
Tags: AMD Radeon R7 250, ATI, Cloud, Computer science, nVidia, nVidia GeForce GTX 750, OpenCL, Operating systems, Security
Recent source codes
Most viewed papers (last 30 days)
- Diagnosing FP4 inference: a layer-wise and block-wise sensitivity analysis of NVFP4 and MXFP4
- Architecture-Aware LLM Inference Optimization on AMD Instinct GPUs: A Comprehensive Benchmark and Deployment Study
- EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery
- LLMQ: Efficient Lower-Precision LLM Training for Consumer GPUs
- KernelSkill: A Multi-Agent Framework for GPU Kernel Optimization
- AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search
- An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU
- MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?
- DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation
- KernelFoundry: Hardware-aware evolutionary GPU kernel optimization




