high performance computing on graphics processing units: hgpu.org

hgpu.org » Design space exploration

Compiler-centric across-stack deep learning acceleration

Perry Gibson

View

Tags: Compilers, Computer science, Deep learning, Design space exploration, nVidia, nVidia Jetson AGX Xavier, nVidia Titan RTX, Thesis

December 10, 2023 by hgpu

A Survey on Design Methodologies for Accelerating Deep Learning on Heterogeneous Architectures

Fabrizio Ferrandi, Serena Curzel, Leandro Fiorin, Daniele Ielmini, Cristina Silvano, Francesco Conti, Alessio Burrello, Francesco Barchi, Luca Benini, Luciano Lavagno, Teodoro Urso, Enrico Calore, Sebastiano Fabio Schifano, Cristian Zambelli, Maurizio Palesi, Giuseppe Ascia, Enrico Russo, Nicola Petra, Davide De Caro, Gennaro Di Meo, Valeria Cardellini, Salvatore Filippone, Francesco Lo Presti, Francesco Silvestri, Paolo Palazzari, Stefania Perri

View

Tags: AI, Artificial intelligence, Computer science, CUDA, Deep learning, Design space exploration, Hardware Architecture, Heterogeneous systems, Machine learning, Neural networks, nVidia, nVidia H100, OpenCL, survey

December 3, 2023 by hgpu

Compilation and Design Space Exploration of Dataflow Programs for Heterogeneous CPU-GPU Platforms

Aurélien François Gilbert Bloch

View

Tags: Compilers, Computer science, CUDA, Design space exploration, DSP, FPGA, Heterogeneous systems, nVidia, nVidia GeForce GTX 1660, nVidia GeForce RTX 3080 Ti, Package, Thesis

June 25, 2023 by hgpu

DeepAxe: A Framework for Exploration of Approximation and Reliability Trade-offs in DNN Accelerators

Mahdi Taheri, Mohamad Riazati, Mohammad Hasan Ahmadilivani, Maksim Jenihhin, Masoud Daneshtalab, Jaan Raik, Mikael Sjodin, Bjorn Lisper

View

Tags: Computer science, Deep learning, Design space exploration, FPGA, HLS, Neural networks, Package

March 5, 2023 by hgpu

Pulsar search acceleration using FPGAs and OpenCL templates

Julian Oppermann, Mitchell B. Mickaliger, Oliver Sinnen

View

Tags: Astrophysics, Design space exploration, FPGA, OpenCL, Package, Physics, Signal processing

January 29, 2023 by hgpu

Design Space Exploration of Concurrency Mapping to FPGAs in Weather and Climate Applications with Xilinx SDSoC OpenCL, SDSoC C++ and Vivad

Moteb Salem Alghamdi

View

Tags: Computer science, Design space exploration, FPGA, HLS, HPC, OpenCL, Thesis

November 27, 2022 by hgpu

An OpenCL-Based FPGA Accelerator for Faster R-CNN

Jianjing An, Dezheng Zhang, Ke Xu, Dong Wang

View

Tags: Computational Complexity, Computer science, Deep learning, Design space exploration, FPGA, Neural networks, nVidia, OpenCL, Package, RNN, Tesla K40

October 2, 2022 by hgpu

Lina: a fast design optimisation tool for software-based FPGA programming

Andre Bannwart Perina

View

Tags: Computer science, Design space exploration, FPGA, nVidia, nVidia Quadro K620, OpenCL, Package, Thesis

September 4, 2022 by hgpu

FPGA Acceleration of Structured-Mesh-Based Explicit and Implicit Numerical Solvers using SYCL

K. Kamalakkannan, G.R. Mudalige, I.Z. Reguly, S.A. Fahmy

View

Tags: Computer science, Design space exploration, FPGA, nVidia, SYCL, Tesla V100

May 8, 2022 by hgpu

Simulation Methodologies for Mobile GPUs

Kuba Kaszyk

View

Tags: Computer science, Design space exploration, OpenCL, simulation, Thesis

March 27, 2022 by hgpu

Studying the Potential of Automatic Optimizations in the Intel FPGA SDK for OpenCL

Adel Ejjeh, Vikram Adve, Rob Rutenbar

View

Tags: Computer science, Design space exploration, FPGA, HLS, OpenCL

January 16, 2022 by hgpu

CFU Playground: Full-Stack Open-Source Framework for Tiny Machine Learning (tinyML) Acceleration on FPGAs

Shvetank Prakash, Tim Callahan, Joseph Bushagour, Colby Banbury, Alan V. Green, Pete Warden, Tim Ansell, Vijay Janapa Reddi

View

Tags: Computer science, Design space exploration, FPGA, Machine learning, Package

January 9, 2022 by hgpu

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

SimSYCL: A SYCL Implementation Targeting Development, Debugging, Simulation and Conformance

GPU plugin for PySCF

Python-Based Quantum Chemistry Calculations with GPU Acceleration

QArray

QArray: a GPU-accelerated constant capacitance model simulator for large quantum dot arrays

Celerity: High-level C++ for Accelerator Clusters

Balancing Tracking Granularity and Parallelism in Many-Task Systems: The Horizons Approach

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

94% on CIFAR-10 in 3.29 Seconds on a Single GPU

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

Parallel Gaussian process with kernel approximation in CUDA

See all packages

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Login | Sitemap | Feedback | Policy

Contact us: