hgpu.org » nVidia GeForce GTX 780 M
Raphael Hiesgen, Dominik Charousset, Thomas C. Schmidt
Tags: Computer science, Data parallelism, Heterogeneous systems, Intel Xeon Phi, nVidia, nVidia GeForce GTX 780 M, OpenCL, Package, Tesla C2075
September 28, 2017 by hgpu
David S. Lawrie
March 10, 2016 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations
- Hardware Acceleration for Neural Networks: A Comprehensive Survey
- Tilus: A Tile-Level GPGPU Programming Language for Low-Precision Computation
- The New Compiler Stack: A Survey on the Synergy of LLMs and Compilers
- AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization
- SeedFold: Scaling Biomolecular Structure Prediction
- Memory-Efficient Acceleration of Block Low-Rank Foundation Models on Resource Constrained GPUs
- KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta
- GPU Kernel Optimization Beyond Full Builds: An LLM Framework with Minimal Executable Programs
- Optimal Software Pipelining and Warp Specialization for Tensor Core GPUs
* * *




