1173

Papers on hgpu.org (.txt-file)

Batched Matrix Computations on Hardware Accelerators Based on GPUs Download

Batched Multi Triangulation Download

Batched QR and SVD Algorithms on GPUs with Applications in Hierarchical Matrix Compression Download

Batched Shift Reduce Parsing with Lists of Vectors on CUDA Download Package

Bayesian Image Restoration Using A Large-scale Total Patch Variation Prior Download

Bayesian inference for artificial perception using OpenCL on FPGAs and GPUs Download

Bayesian model comparison via sequential Monte Carlo Download Package

Bayesian neural networks for detecting epistasis in genetic association studies Download Package

Bayesian Neural Networks for Genetic Association Studies of Complex Disease Download Package

Bayesian Neural Networks in Data-Intensive High Energy Physics Applications Download

Bayesian Optimization for auto-tuning GPU kernels Download Package

Bayesian real-time perception algorithms on GPU Download

Bayesian Sparse Unsupervised Learning for Probit Models of Binary Data Download

Bayesian Sparsity-Path-Analysis of Genetic Association Signal using Generalized t Priors Download

Bayesian State-Space Modelling on High-Performance Hardware Using LibBi Download Package

BbmTTP: Beat-based Parallel Simulated Annealing Algorithm on GPGPUs for the Mirrored Traveling Tournament Problem Download Package

BEAGLE: an Application Programming Interface and High-Performance Computing Library for Statistical Phylogenetics Download Package

Beam Dynamics Simulations Using GPUs Download

Beam Dynamics Simulations with a GPU-accelerated Version of ELEGANT Download

Beauty And The Beast: Exploiting GPUs In Haskell Download Package

Beehive SPIR-V Toolkit: A Composable and Functional API for Runtime SPIR-V Code Generation Download Package

Behavioral graph fraud detection in E-commerce Download

Behavioral Non-portability in Scientific Numeric Computing Download

Behavioral Spherical Harmonics for Long-Range Agents’ Interaction Download

Belief Propagation by Message Passing in Junction Trees: Computing Each Message Faster Using GPU Parallelization Download

Belief Propagation on the GPU for Stereo Vision Download

Believe it or Not! Multi-core CPUs Can Match GPU Performance for FLOP-intensive Application! Download

Bempp-cl: A fast Python based just-in-time compiling boundary element library Download Package

BenchDirect: A Directed Language Model for Compiler Benchmarks Download Package

BenchFriend: Correlating the Performance of GPU Benchmarks Download

BENCHIP: Benchmarking Intelligence Processors Download

Benchmarking a Proof-of-Concept Performance Portable SYCL-based Fast Fourier Transformation Library Download

Benchmarking Across Platforms: European Option Pricing Download

Benchmarking and Dissecting the Nvidia Hopper GPU Architecture Download

Benchmarking and Implementation of Probability-Based Simulations on Programmable Graphics Cards Download

Benchmarking and modelling of POWER7, Westmere, BG/P, and GPUs: an industry case study Download

Benchmarking and Optimization of Gradient Boosted Decision Tree Algorithms Download

Benchmarking Data Analysis and Machine Learning Applications on the Intel KNL Many-Core Processor Download

Benchmarking Deep Learning Models on Jetson TX2 Download Package

Benchmarking GPU and CPU codes for Heisenberg spin glass overrelaxation

Benchmarking GPU and TPU Performance with Graph Neural Networks Download

Benchmarking GPU Devices with N-Body Simulations Download

Benchmarking GPUs to tune dense linear algebra Download

Benchmarking Harp-DAAL: High Performance Hadoop on KNL Clusters Download Package

Benchmarking Intel Xeon Phi to Guide Kernel Design Download

Benchmarking Modern Edge Devices for AI Applications Download

Benchmarking Next Generation Hardware Platforms: An Experimental Approach Download

Benchmarking OpenCL, OpenACC, OpenMP, and CUDA: programming productivity, performance, and energy consumption Download

Benchmarking optimization algorithms for auto-tuning GPU kernels Download

Benchmarking Parallel Performance on Many-Core Processors Download

Benchmarking performance of a hybrid Xeon/Xeon Phi system for parallel computation of similarity measures between large vectors Download

Benchmarking State-of-the-Art Deep Learning Software Tools Download Package

Benchmarking the cost of thread divergence in CUDA Download

Benchmarking the Intel Xeon Phi Coprocessor Download

Benchmarking the Memory Hierarchy of Modern GPUs Download Package

Benchmarking the Nvidia GPU Lineage: From Early K80 to Modern A100 with Asynchronous Memory Transfers Download

Benchmarking Thread Block Cluster Download

Benchmarking TPU, GPU, and CPU Platforms for Deep Learning Download

Benchmarks Based on Anti-Parallel Patterns for the Evaluation of GPUs Download

Benchmarks for Intel MIC Architecture Download

BenchPress: A Deep Active Benchmark Generator Download Package

Berkeley Dwarfs on CUDA Download

Best bang for your buck: GPU nodes for GROMACS biomolecular simulations Download Package

Best Practice Guide – GPGPU Download

Best Practice Guide – Intel Xeon Phi Download

Best Practice Guide Intel Xeon Phi v2.0 Download

Best-effort semantic document search on GPUs

Betatron tune measurement with the LHC damper using a GPU Download

Better GPU Hash Tables Download

Better speedups using simpler parallel programming for graph connectivity and biconnectivity Download

Betweenness Centrality on GPUs and Heterogeneous Architectures Download Package

Beyond 16GB: Out-of-Core Stencil Computations Download

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising Download Package

Beyond Amdahl’s Law: An Objective Function That Links Multiprocessor Performance Gains To Delay and Energy Download

Beyond Desktop Computation: Challenges in Scaling a GPU Infrastructure Download

Beyond programmable shading (parts I and II) Download

Beyond Straightforward Vectorization of Lightweight Data Compression Algorithms for Larger Vector Sizes Download

BFROST: Binary Features from Robust Orientation Segment Tests accelerated on the GPU Download

Bi-directional Path Tracing on GPU Download Package

Bidimensional Median Filter for Parallel Computing Architectures Download

BIDMach: Large-scale Learning with Zero Memory Allocation Download Package

Bifrost: a Python/C++ Framework for High-Throughput Stream Processing in Astronomy Download Package

Big Integer Multiplication with CUDA FFT (cuFFT) Library Download

Bigger Buffer k-d Trees on Multi-Many-Core Systems Download Package

BigKernel — High Performance CPU-GPU Communication Pipelining for Big Data-style Applications Download

Bilateral Filtering with CUDA Download Package

Billion-scale similarity search with GPUs Download Package

Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models Download

Binary Interval Search (BITS): A Scalable Algorithm for Counting Interval Intersections Download Package

Binary Interval Search: a scalable algorithm for counting interval intersections Download Package

Binary Mesh Partitioning for Cache-Efficient Visualization Download

Binary Segmentation of Video Sequences in Real Time

BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1 Download Package

Binaural Simulations Using Audio Rate FDTD Schemes and CUDA Download

Binomial American Option Pricing on CPU-GPU Hetergenous System Download

Bio-inspired computer visual system using GPU and Visual Pattern Assessment Language (ViPAL): Application on breast cancer prognosis Download

Bio-Inspired Optimization of Ultra-Wideband Patch Antennas Using Graphics Processing Unit Acceleration Download

Bio-sequence database scanning on a GPU Download

BioEM: GPU-accelerated computing of Bayesian inference of electron microscopy images Download Package

Bioinformatics Sequence Comparisons on Manycore Processors Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 97

Package packages: 30

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: