hgpu.org » nVidia GeForce GTX 3060
Tao Lu, Chengkun Wei, Ruijing Yu, Yi Chen, Li Wang, Chaochao Chen, Zeke Wang, and Wenzhi Chen
Tags: Algorithms, Benchmarking, Computer science, CUDA, Elliptic curves, Machine learning, nVidia, nVidia GeForce GTX 3060, Security, Tesla V100
October 9, 2022 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization
- Scalable GPU-Based Integrity Verification for Large Machine Learning Models
- STARK: Strategic Team of Agents for Refining Kernels
- Tutoring LLM into a Better CUDA Optimizer
- INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
- Collective Communication for 100k+ GPUs
- An MLIR pipeline for offloading Fortran to FPGAs via OpenMP
- Enhancing Transformer Performance and Portability through Auto-tuning Frameworks
- RDMA Point-to-Point Communication for LLM Systems
- A Study of Floating-Point Precision Tuning in Deep Learning Operators Implementations
* * *



