hgpu.org » AI
Nicolas Weber, Florian Schmidt, Mathias Niepert, Felipe Huici
Tags: AI, Artificial intelligence, cache, CNN, Computer science, cpu, CUDA, Deep learning, GPU, Machine learning, Neural and Evolutionary Computing, nVidia, nVidia GeForce GTX 1080 Ti
April 25, 2018 by hgpu
Craig McMillan, Emma Hart, Kevin Chalmers
April 15, 2015 by craigmcmillan01
Recent source codes
* * *
Most viewed papers (last 30 days)
- Acceleration as a Service (XaaS) Source Containers
- Omniwise: Predicting GPU Kernels Performance with LLMs
- Exploring SYCL as a Portability Layer for High-Performance Computing on CPUs
- All You Need Is Binary Search! A Practical View on Lightweight Database Indexing on GPUs
- CUDA-LLM: LLMs Can Write Efficient CUDA Kernels
- Engineering Supercomputing Platforms for Biomolecular Applications
- GCStack+GCScaler: Fast and Accurate GPU Performance Analyses Using Fine-Grained Stall Cycle Accounting and Interval Analysis
- P4OMP: Retrieval-Augmented Prompting for OpenMP Parallelism in Serial Code
- chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations
- A First Look at Bugs in LLM Inference Engines
* * *