1173

Papers on hgpu.org (.txt-file)

Block-Relaxation Methods for 3D Constant-Coefficient Stencils on GPUs and Multicore CPUs Download

Block-Size Independence for GPU Programs Download

Blockchain Goes Green? Part II: Characterizing the Performance and Cost of Blockchains on the Cloud and at the Edge Download Package

Blocked All-Pairs Shortest Paths Algorithm on Intel Xeon Phi KNL Processor: A Case Study Download

Blocking Self-avoiding Walks Stops Cyber-epidemics: A Scalable GPU-based Approach Download

Blocks and Fuel: Frameworks for deep learning Download Package

Blum Blum Shub on the GPU Download

Boda-RTC: Productive Generation of Portable, Efficient Code for Convolutional Neural Networks on Mobile Computing Platforms Download

Bohrium: Unmodified NumPy Code on CPU, GPU, and Cluster Download

Boids that see: Using self-occlusion for simulating large groups on GPUs

Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance Download

Bone structure analysis on multiple GPGPUs Download

Bone Structure Analysis with GPGPUs Download

Bonsai: A GPU Tree-Code Download Package

Boosted Algorithms for Visual Object Detection on Graphics Processing Units Download

Boosting GPU Virtualization Performance with Hybrid Shadow Page Tables Download

Boosting Java Performance using GPGPUs Download

Boosting quantum evolutions using Trotter-Suzuki algorithms on GPUs Download

Boosting sphere decoding speed through Graphic Processing Units Download Package

BootCMatchG: An adaptive Algebraic MultiGrid linear solver for GPUs Download Package

BOPM implemented on a GPU-architecture Download

Bothnia: a dual-personality extension to the Intel integrated graphics driver

Bottleneck Analysis of Dynamic Graph Neural Network Inference on CPU and GPU Download Package

Bouncing Behavior of Microscopic Dust Aggregates Download

Bound the Peak Performance of SGEMM on GPU with software-controlled fast memory Download

Bounding the effect of partition camping in GPU kernels Download

Bounds Checking on GPU Download Package

Bounds on the Energy Consumption of Computational Kernels Download Package

Brain perfusion imaging: performance and accuracy Download

BrainCove: A Tool for Voxel-wise fMRI Brain Connectivity Visualization Download Package

BrainFrame: A heterogeneous accelerator platform for neuron simulations Download

BrainSlug: Transparent Acceleration of Deep Learning Through Depth-First Parallelism Download

Branch and Data Herding: Reducing Control and Memory Divergence for Error-tolerant GPU Applications Download

Breadth First Search Vectorization on the Intel Xeon Phi Download

Breadth-First Search using Dynamic Parallelism on the GPU Download Package

Breaking DVB-CSA Download

Breaking ECC2K-130 Download

Breaking the GPU programming barrier with the auto-parallelising SAC compiler

Bridging Control-Centric and Data-Centric Optimization Download Package

Bridging OpenCL and CUDA: A Comparative Analysis and Translation Download

Bridging parallel and reconfigurable computing with multilevel PGAS and SHMEM+ Download

Bridging the Gap between FPGAs and Multi-Processor Architectures: A Video Processing Perspective Download

Bridging the GPGPU-FPGA efficiency gap

Bridging the Performance-Programmability Gap for FPGAs via OpenCL: A Case Study with OpenDwarfs Download

Bridging the Semantic Gaps of GPU Acceleration for Scaleout CNN-based Big Data Processing: Think Big, See Small Download

Brief announcement: better speedups for parallel max-flow Download

Brief Announcement: On the Limits of Parallelizing Convolutional Neural Networks on GPUs Download

Bringing OpenCL to Commodity RISC-V CPUs Download Package

Bringing Parallel Performance to Python with Domain-Specific Selective Embedded Just-in-Time Specialization Download

Brook for GPUs: Stream Computing on Graphics Hardware Download

Brownian Dynamics of Active Sphere Suspensions Confined Near a No-Slip Boundary Download Package

Brownian dynamics simulations on CPU and GPU with BD_BOX Package

Browsing a Large Collection of Community Photos Based on Similarity on GPU

Browsing Large Image Datasets through Voronoi Diagrams Download

Brute force de-shredding algorithm using the GPU Download Package

Brute-Force k-Nearest Neighbors Search on the GPU Download

BSGP: bulk-synchronous GPU programming Download

Buffer k-d Trees: Processing Massive Nearest Neighbor Queries on GPUs Download

Buffer overflow vulnerabilities in CUDA: a preliminary analysis Download

Bufferless NOC Simulation of Large Multicore System on GPU Hardware Download

Build and Travel KD-Tree with CUDA Download Package

Building a Performance Model for Deep Learning Recommendation Model Training on GPUs Download Package

Building a Personal High Performance Computer with Heterogeneous Processors

Building a Real-Time Multi-GPU Platform: Robust Real-Time Interrupt Handling Despite Closed-Source Drivers Download Package

Building Correlators with Many-Core Hardware Download

Building Human Brain Network in 3D Coefficient Map Determined by X-ray Microtomography Download Package

Building Multiclass Nonlinear Classifiers with GPUs Download

Building Source-to-Source Compilers for Heterogeneous Targets Download

Building-Blocks for Performance Oriented DSLs Download

Bulk Execution of Oblivious Algorithms on the Unified Memory Machine, with GPU Implementation Download

Bulk GCD Computation Using a GPU to Break Weak RSA Keys Download

Bump Mapping Unparametrized Surfaces on the GPU Download

Bundled depth-map merging for multi-view stereo Download

Burrows-Wheeler Aligner: A Parallel Approach Download

BVH for efficient raytracing of dynamic metaballs on GPU Download

C and CUDA Implementation for SIRT and SART Reconstruction Algorithms Download

C Language Extensions for Hybrid CPU/GPU Programming with StarPU Download Package

C to Cellular Automata and Execution on CPU, GPU and FPGA Download

C-DAC’s Efforts – Application Kernels on HPC Cluster with GPU Accelerators Download

C-for-Metal: High Performance SIMD Programming on Intel GPUs Download Package

C++ AMP: Accelerated Massive Parallelism with Microsoft Visual C++ Download Package

Cache and bandwidth aware matrix multiplication on the GPU Download

Cache Miss Analysis for GPU Programs Based on Stack Distance Profile

Cache-efficient numerical algorithms using graphics hardware Download

CADDIES: A New Framework for Rapid Development of Parallel Cellular Automata Algorithms for Flood Simulation Download

Caffe con Troll: Shallow Ideas to Speed Up Deep Learning Download

Caffe: Convolutional Architecture for Fast Feature Embedding Download Package

Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks Download Package

Caffeine: Towards Uniformed Representation and Acceleration for Deep Convolutional Neural Networks Download

CaffeLink: Mathematica binding for Caffe Deep Learning Framework Download Package

CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms Download Package

Calamari – A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition Download Package

Calculation by articificial compressibility method and virtual flux method on GPU

Calculation of fermion loops for eta-prime and nucleon scalar and electromagnetic form factors Download

Calculation of Force Field Grids for Molecular Docking Using Graphics Processing Unit Download

Calculation of HELAS amplitudes for QCD processes using graphics processing unit (GPU) Download

Calculation of Stochastic Heating and Emissivity of Cosmic Dust Grains with Optimization for the Intel Many Integrated Core Architecture Download

Calculation of weight vectors for wideband beamforming using Graphics Processing Units Download

CAMPAIGN: An open-source Library of GPU-accelerated Data Clustering Algorithms Package

Can CUDA be exposed through web services? Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 90

Package packages: 28

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: