Tags: Code generation, Computer science, Embedded high-performance computing, nVidia, nVidia Jetson AGX Xavier, nVidia Jetson Nano, nVidia Jetson TX2, OpenCL, Tesla T4, Tesla V100, Thesis
Tags: Android, Computer science, Computer vision, Embedded high-performance computing, nVidia, nVidia GeForce GTX 660, OpenCL, Package, Thesis
Tags: Embedded high-performance computing, Energy-efficient computing, FPGA, GPU, Power-efficient computing
Tags: Computer science, CUDA, Embedded high-performance computing, GPGPU-sim, Memory, nVidia, Performance
Tags: Algorithms, ARM, Computer science, Embedded high-performance computing, OpenCL, Pattern Search
Tags: Algorithms, Computer science, CUDA, Embedded high-performance computing, nVidia, nVidia GeForce 8800 GTX, OpenMP, Performance, Ultrasound
Recent source codes
Most viewed papers (last 30 days)
- Over-synchronization in GPU Programs
- LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators
- LLload: An Easy-to-Use HPC Utilization Tool
- A Distributed-memory Tridiagonal Solver Based on a Specialised Data Structure Optimised for CPU and GPU Architectures
- SoK: A Systems Perspective on Compound AI Threats and Countermeasures
- Profile Util library: A quick and easy way to get MPI, OpenMP and GPU runtime information
- Context Parallelism for Scalable Million-Token Inference
- On a Simplified Approach to Achieve Parallel Performance and Portability Across CPU and GPU Architectures
- NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference
- Edify 3D: Scalable High-Quality 3D Asset Generation