User homepage
You must sign in before.
* * *
Recent source codes
* * *
Most viewed papers (last 30 days)
- Ultra-low latency recurrent neural network inference on FPGAs for physics applications with hls4ml
- Reducing Synchronous GPU Memory Transfers: Design and implementation of a Futhark compiler optimisation
- Design and Implementation of CNN-FPGA accelerator based on Open Computing Language
- High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs
- DarKnight: An Accelerated Framework for Privacy and Integrity Preserving Deep Learning Using Trusted Hardware
- Demystifying Dependency Bugs in Deep Learning Stack
- Theseus: A Library for Differentiable Nonlinear Optimization
- Heterogeneous Energy-aware Load Balancing for Industry 4.0 and IoT Environments
- The OpenMP Cluster Programming Model
- CPU-GPU Layer-Switched Low Latency CNN Inference
* * *