hgpu.org » Comuter science
Kai Zhu, Wenyi Zhao, Zhen Zheng, Tianyou Guo, Pengzhan Zhao, Junjie Bai, Jun Yang, Xiaoyong Liu, Lansong Diao, Wei Lin
Tags: Compilers, Comuter science, CUDA, Machine learning, nVidia, Tesla T4
March 14, 2021 by hgpu
Tiago Augusto Engel, Andrea Schwertner Charao, Manuele Kirsch-Pinheiro, Luiz-Angelo Steffenel
Tags: Comuter science, CUDA, Data mining, Java, Matrix multiplication, nVidia, nVidia Quadro K 2000, Package, Tesla K20, Tesla M2050
June 13, 2014 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Performance of Confidential Computing GPUs
- Acceleration as a Service (XaaS) Source Containers
- Exploring SYCL for batched kernels with memory allocations
- Low-cost edge computing using upcycled smartphones
- CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark
- Exploring SYCL as a Portability Layer for High-Performance Computing on CPUs
- All You Need Is Binary Search! A Practical View on Lightweight Database Indexing on GPUs
- FLASH: Fast All-to-All Communication in GPU Clusters
- CUDA-LLM: LLMs Can Write Efficient CUDA Kernels
- chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations
* * *