Tags: Code generation, Computer science, Embedded high-performance computing, nVidia, nVidia Jetson AGX Xavier, nVidia Jetson Nano, nVidia Jetson TX2, OpenCL, Tesla T4, Tesla V100, Thesis
Tags: Android, Computer science, Computer vision, Embedded high-performance computing, nVidia, nVidia GeForce GTX 660, OpenCL, Package, Thesis
Tags: Embedded high-performance computing, Energy-efficient computing, FPGA, GPU, Power-efficient computing
Tags: Computer science, CUDA, Embedded high-performance computing, GPGPU-sim, Memory, nVidia, Performance
Tags: Algorithms, ARM, Computer science, Embedded high-performance computing, OpenCL, Pattern Search
Tags: Algorithms, Computer science, CUDA, Embedded high-performance computing, nVidia, nVidia GeForce 8800 GTX, OpenMP, Performance, Ultrasound
Recent source codes
Most viewed papers (last 30 days)
- Automatic Generation of OpenCL Code through Polyhedral Compilation with LLM
- Deep Learning and Machine Learning with GPGPU and CUDA: Unlocking the Power of Parallel Computing
- Testing GPU Numerics: Finding Numerical Differences Between NVIDIA and AMD GPUs
- miniLB: A Performance Portability Study of Lattice-Boltzmann Simulations
- Accelerating Drug Discovery in AutoDock-GPU with Tensor Cores
- Intel(R) SHMEM: GPU-initiated OpenSHMEM using SYCL
- OpenACC offloading of the MFC compressible multiphase flow solver on AMD and NVIDIA GPUs
- Bitstream Database-Driven FPGA Programming Flow Based on Standard OpenCL
- Superpipeline: A Universal Approach for Reducing GPU Memory Usage in Large Models
- Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores