hgpu.org » nVidia GeForce GTX 3060
Tao Lu, Chengkun Wei, Ruijing Yu, Yi Chen, Li Wang, Chaochao Chen, Zeke Wang, and Wenzhi Chen
Tags: Algorithms, Benchmarking, Computer science, CUDA, Elliptic curves, Machine learning, nVidia, nVidia GeForce GTX 3060, Security, Tesla V100
October 9, 2022 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Profiling Apple Silicon Performance for ML Training
- Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis
- Column-Oriented Datalog on the GPU
- GSParLib: A multi-level programming interface unifying OpenCL and CUDA for expressing stream and data parallelism
- Boosting Performance of Iterative Applications on GPUs: Kernel Batching with CUDA Graphs
- Compiler Support for Speculation in Decoupled Access/Execute Architectures
- Exploring data flow design and vectorization with oneAPI for streaming applications on CPU+GPU
- A User's Guide to KSig: GPU-Accelerated Computation of the Signature Kernel
- Demystifying Cost-Efficiency in LLM Serving over Heterogeneous GPUs
- Towards autonomous resource management: Deep learning prediction of CPU-GPU load balancing
* * *