hgpu.org » nVidia GeForce RTX 3090 Ti
Wentao Chen, Jiace Zhu, Qi Fan, Yehan Ma, An Zou
Tags: Artificial intelligence, Code generation, Computer sceince, CUDA, LLM, nVidia, nVidia GeForce GTX 1660, nVidia GeForce RTX 3090 Ti
June 15, 2025 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Omniwise: Predicting GPU Kernels Performance with LLMs
- P4OMP: Retrieval-Augmented Prompting for OpenMP Parallelism in Serial Code
- Engineering Supercomputing Platforms for Biomolecular Applications
- GCStack+GCScaler: Fast and Accurate GPU Performance Analyses Using Fine-Grained Stall Cycle Accounting and Interval Analysis
- A First Look at Bugs in LLM Inference Engines
- Accelerated discovery and design of Fe-Co-Zr magnets with tunable magnetic anisotropy through machine learning and parallel computing
- Efficient GPU Implementation of Multi-Precision Integer Division
- ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks
- No More Shading Languages: Compiling C++ to Vulkan Shaders
- WiLLM: An Open Wireless LLM Communication System
* * *