1173

Papers on hgpu.org (.txt-file)

BANG: Billion-Scale Approximate Nearest Neighbor Search using a Single GPU Download

Barnes-hut treecode on GPU

Barra, a Modular Functional GPU Simulator for GPGPU Download Package

Barra: A Parallel Functional Simulator for GPGPU Download Package

BarraCUDA – a fast short read sequence aligner using graphics processing units Download Package

Barrier Invariants: A Shared State Abstraction for the Analysis of Data-Dependent GPU Kernels Download Package

Barycentric coordinates computation in homogeneous coordinates Download

BASEMENT v3: a modular freeware for river process modelling over multiple computational backends Download Package

Basker: A Threaded Sparse LU Factorization Utilizing Hierarchical Parallelism and Data Layouts Download

BAT: A Benchmark suite for AutoTuners Download Package

Batch Method for Efficient Resource Sharing in Real-time Multi-GPU Systems Download

Batch Records Insertion into Multidimensional Linear Dynamic Hashing Table on GPU Download

Batched Kronecker product for 2-D matrices and 3-D arrays on NVIDIA GPUs Download

Batched Linear Algebra Problems on GPU Accelerators Download

Batched Matrix Computations on Hardware Accelerators Download

Batched Matrix Computations on Hardware Accelerators Based on GPUs Download

Batched Multi Triangulation Download

Batched QR and SVD Algorithms on GPUs with Applications in Hierarchical Matrix Compression Download

Batched Shift Reduce Parsing with Lists of Vectors on CUDA Download Package

Bayesian Image Restoration Using A Large-scale Total Patch Variation Prior Download

Bayesian inference for artificial perception using OpenCL on FPGAs and GPUs Download

Bayesian model comparison via sequential Monte Carlo Download Package

Bayesian neural networks for detecting epistasis in genetic association studies Download Package

Bayesian Neural Networks for Genetic Association Studies of Complex Disease Download Package

Bayesian Neural Networks in Data-Intensive High Energy Physics Applications Download

Bayesian Optimization for auto-tuning GPU kernels Download Package

Bayesian real-time perception algorithms on GPU Download

Bayesian Sparse Unsupervised Learning for Probit Models of Binary Data Download

Bayesian Sparsity-Path-Analysis of Genetic Association Signal using Generalized t Priors Download

Bayesian State-Space Modelling on High-Performance Hardware Using LibBi Download Package

BbmTTP: Beat-based Parallel Simulated Annealing Algorithm on GPGPUs for the Mirrored Traveling Tournament Problem Download Package

BEAGLE: an Application Programming Interface and High-Performance Computing Library for Statistical Phylogenetics Download Package

Beam Dynamics Simulations Using GPUs Download

Beam Dynamics Simulations with a GPU-accelerated Version of ELEGANT Download

Beauty And The Beast: Exploiting GPUs In Haskell Download Package

Beehive SPIR-V Toolkit: A Composable and Functional API for Runtime SPIR-V Code Generation Download Package

Behavioral graph fraud detection in E-commerce Download

Behavioral Non-portability in Scientific Numeric Computing Download

Behavioral Spherical Harmonics for Long-Range Agents’ Interaction Download

Belief Propagation by Message Passing in Junction Trees: Computing Each Message Faster Using GPU Parallelization Download

Belief Propagation on the GPU for Stereo Vision Download

Believe it or Not! Multi-core CPUs Can Match GPU Performance for FLOP-intensive Application! Download

Bempp-cl: A fast Python based just-in-time compiling boundary element library Download Package

BenchDirect: A Directed Language Model for Compiler Benchmarks Download Package

BenchFriend: Correlating the Performance of GPU Benchmarks Download

BENCHIP: Benchmarking Intelligence Processors Download

Benchmarking a Proof-of-Concept Performance Portable SYCL-based Fast Fourier Transformation Library Download

Benchmarking Across Platforms: European Option Pricing Download

Benchmarking and Dissecting the Nvidia Hopper GPU Architecture Download

Benchmarking and Implementation of Probability-Based Simulations on Programmable Graphics Cards Download

Benchmarking and modelling of POWER7, Westmere, BG/P, and GPUs: an industry case study Download

Benchmarking and Optimization of Gradient Boosted Decision Tree Algorithms Download

Benchmarking Data Analysis and Machine Learning Applications on the Intel KNL Many-Core Processor Download

Benchmarking Deep Learning Models on Jetson TX2 Download Package

Benchmarking GPU and CPU codes for Heisenberg spin glass overrelaxation

Benchmarking GPU and TPU Performance with Graph Neural Networks Download

Benchmarking GPU Devices with N-Body Simulations Download

Benchmarking GPUs to tune dense linear algebra Download

Benchmarking Harp-DAAL: High Performance Hadoop on KNL Clusters Download Package

Benchmarking Intel Xeon Phi to Guide Kernel Design Download

Benchmarking Modern Edge Devices for AI Applications Download

Benchmarking Next Generation Hardware Platforms: An Experimental Approach Download

Benchmarking OpenCL, OpenACC, OpenMP, and CUDA: programming productivity, performance, and energy consumption Download

Benchmarking optimization algorithms for auto-tuning GPU kernels Download

Benchmarking Parallel Performance on Many-Core Processors Download

Benchmarking performance of a hybrid Xeon/Xeon Phi system for parallel computation of similarity measures between large vectors Download

Benchmarking State-of-the-Art Deep Learning Software Tools Download Package

Benchmarking the cost of thread divergence in CUDA Download

Benchmarking the Intel Xeon Phi Coprocessor Download

Benchmarking the Memory Hierarchy of Modern GPUs Download Package

Benchmarking the Nvidia GPU Lineage: From Early K80 to Modern A100 with Asynchronous Memory Transfers Download

Benchmarking Thread Block Cluster Download

Benchmarking TPU, GPU, and CPU Platforms for Deep Learning Download

Benchmarks Based on Anti-Parallel Patterns for the Evaluation of GPUs Download

Benchmarks for Intel MIC Architecture Download

BenchPress: A Deep Active Benchmark Generator Download Package

Berkeley Dwarfs on CUDA Download

Best bang for your buck: GPU nodes for GROMACS biomolecular simulations Download Package

Best Practice Guide – GPGPU Download

Best Practice Guide – Intel Xeon Phi Download

Best Practice Guide Intel Xeon Phi v2.0 Download

Best-effort semantic document search on GPUs

Betatron tune measurement with the LHC damper using a GPU Download

Better GPU Hash Tables Download

Better speedups using simpler parallel programming for graph connectivity and biconnectivity Download

Betweenness Centrality on GPUs and Heterogeneous Architectures Download Package

Beyond 16GB: Out-of-Core Stencil Computations Download

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising Download Package

Beyond Amdahl’s Law: An Objective Function That Links Multiprocessor Performance Gains To Delay and Energy Download

Beyond Desktop Computation: Challenges in Scaling a GPU Infrastructure Download

Beyond programmable shading (parts I and II) Download

Beyond Straightforward Vectorization of Lightweight Data Compression Algorithms for Larger Vector Sizes Download

BFROST: Binary Features from Robust Orientation Segment Tests accelerated on the GPU Download

Bi-directional Path Tracing on GPU Download Package

Bidimensional Median Filter for Parallel Computing Architectures Download

BIDMach: Large-scale Learning with Zero Memory Allocation Download Package

Bifrost: a Python/C++ Framework for High-Throughput Stream Processing in Astronomy Download Package

Big Integer Multiplication with CUDA FFT (cuFFT) Library Download

Bigger Buffer k-d Trees on Multi-Many-Core Systems Download Package

BigKernel — High Performance CPU-GPU Communication Pipelining for Big Data-style Applications Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 97

Package packages: 30

Recent source codes

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org