hgpu.org » Embedded high-performance computing
Seungtae Hong, Hyunwoo Cho, Jeong-Si Kim
December 12, 2021 by hgpu
Dongrui She, Yifan He, Luc Waeijen, Henk Corporaal
September 24, 2015 by hgpu
Max Danielsson, Thomas Sievert
Tags: Android, Computer science, Computer vision, Embedded high-performance computing, nVidia, nVidia GeForce GTX 660, OpenCL, Package, Thesis
August 24, 2015 by hgpu
Luna Backes, Alejandro Rico, Bjorn Franke
July 28, 2015 by hgpu
Sparsh Mittal
Tags: Embedded high-performance computing, Energy-efficient computing, FPGA, GPU, Power-efficient computing
January 7, 2015 by sparsh0mittal
Sudipta Chattopadhyay, Petru Eles, Zebo Peng
Tags: Computer science, CUDA, Embedded high-performance computing, GPGPU-sim, Memory, nVidia, Performance
September 19, 2014 by hgpu
Elena Aragon, Juan M. Jimenez, Arian Maghazeh, Jim Rasmusson, Unmesh D. Bordoloi
Tags: Algorithms, ARM, Computer science, Embedded high-performance computing, OpenCL, Pattern Search
September 11, 2014 by hgpu
Li Tian, Fugen Zhou, Cai Meng
May 18, 2014 by hgpu
Iype P. Joseph
March 15, 2014 by hgpu
Arslan Munir, Sanjay Ranka, Ann Gordon-Ross
April 6, 2012 by hgpu
Siddharth Nilakantan, Srikanth Annangi, Nikhil Gulati, Karthik Sangaiah, Mark Hempstead
Tags: Algorithms, Computer science, CUDA, Embedded high-performance computing, nVidia, nVidia GeForce 8800 GTX, OpenMP, Performance, Ultrasound
November 12, 2011 by hgpu
Kulin V. Seth
Tags: Benchmarking, Computer science, DSP, Embedded high-performance computing, Heterogeneous systems, OpenCL, Optimization, Thesis
September 23, 2011 by hgpu
* * *
Recent source codes
* * *
Most viewed papers (last 30 days)
- Performance Comparison of Different OpenCL Implementations of LBM Simulation on Commodity Computer Hardware
- Fast Arbitrary Precision Floating Point on FPGA
- Performance study on GPU offloading techniques using the Gauss matrix inverse algorithm
- The Celerity High-level API: C++20 for Accelerator Clusters
- PM4Py-GPU: a High-Performance General-Purpose Library for Process Mining
- Explicit caching HYB: a new high-performance SpMV framework on GPGPU
- Efficient Execution of OpenMP on GPUs
- CASE: A Compiler-Assisted SchEduling Framework for Multi-GPU Systems
- Improving performance of SYCL applications on CPU architectures using LLVM-directed compilation flow
- Experience of Migrating a Parallel Graph Coloring Program from CUDA to SYCL
* * *