hgpu.org » nVidia GTX Titan X
Erfan Bank Tavakoli, Michael Riera, Masudul Hassan Quraishi, Fengbo Ren
Tags: Algorithms, Computer science, FPGA, HPC, Linear Algebra, Matrix multiplication, nVidia, nVidia GTX Titan X, OpenCL, Sparse matrix
December 26, 2021 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- An HPC Benchmark Survey and Taxonomy for Characterization
- Home-made Diffusion Model from Scratch to Hatch
- High Performance Matrix Multiplication
- Towards Robust Agentic CUDA Kernel Benchmarking, Verification, and Optimization
- Dato: A Task-Based Programming Model for Dataflow Accelerators
- TRUST: the HPC open-source CFD platform – from CPU to GPU
- Mojo: MLIR-Based Performance-Portable HPC Science Kernels on GPUs for the Python Ecosystem
- Towards Calculating HPC CUDA Kernel Performance on Nvidia GPUs
- Combining Performance and Productivity: Accelerating the Network Sensing Graph Challenge with GPUs and Commodity Data Science Software
- Towards GPU Parallelism Abstractions in Rust: A Case Study with Linear Pipelines
* * *