Papers on hgpu.org (.txt-file)
Monte Carlo randomization tests for large-scale abundance datasets on the GPU
Monte Carlo simulation of photon migration in 3D turbid media accelerated by graphics processing units

Monte Carlo simulations on Graphics Processing Units

Monte-Carlo Black-Scholes Implementation using OpenCL Standard

More Bang For Your Buck(et): Fast and Space-efficient Hardware-accelerated Coarse-granular Indexing on GPUs

Morphological Proximity Priors: Spatial Relationships for Semantic Segmentation

Motion Compensation and Reconstruction of H.264/AVC Video Bitstreams using the GPU

Motion Estimation for H.264/AVC using Programmable Graphics Hardware

Motion Estimation with Non-Local Total Variation Regularization

Motion planning for autonomous driving with a conformal spatiotemporal lattice

Movement Tracking in Terrain Conditions Accelerated with CUDA

Moving Least-Squares Reconstruction of Large Models with GPUs

Mpache: Interaction Aware Multi-level Cache Bypassing on GPUs

MPC Toolbox with GPU Accelerated Optimization Algorithms

MPC: A Massively Parallel Compression Algorithm for Scientific Data

MPI Derived Datatypes Processing on Noncontiguous GPU-resident Data

MPI Parallelization of GPU-based Lattice Boltzmann Simulations

MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-Based Systems

MPI-GIS: New Parallel Overlay Algorithm and System Prototype

MPI-GPU parallelism in iterative eigensolvers for block-tridiagonal matrices

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

MR-API: A Comprehensive API Framework for Heterogeneous Multi-core Systems using Map Reduce Programming Model

Mr. Scan: Extreme Scale Density-Based Clustering using a Tree-Based Network of GPGPU Nodes

MrBayes on a Graphics Processing Unit

MrBayes tgMC3: A Tight GPU Implementation of MrBayes

MRCUDA: MapReduce Acceleration Framework Based on GPU

MRPB: Memory Request Prioritization for Massively Parallel Processors

MSA-CUDA: Multiple Sequence Alignment on Graphics Processing Units with CUDA

MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications

MSREP: A Fast yet Light Sparse Matrix Framework for Multi-GPU Systems

MSTg: Cryptographically strong pseudorandom number generator and its realization

MT4G: A Tool for Reliable Auto-Discovery of NVIDIA and AMD GPU Compute and Memory Topologies

mu-cuDNN: Accelerating Deep Learning Frameworks with Micro-Batching

mu-grind: A Framework for Dynamically Instrumenting HLS-Generated RTL

Multi Agent Navigation on the GPU

Multi GPU Implementation of Iterative Tomographic Reconstruction Algorithms

Multi GPU Implementation of the Simplex Algorithm

Multi GPU Performance of Conjugate Gradient Algorithm with Staggered Fermions

Multi GPU Performance of Conjugate Gradient Solver with Staggered Fermions in Mixed Precision

Multi scale block histogram of template feature for pedestrian detection
Multi- and many-core data mining with adaptive sparse grids

Multi-Agent Systems and General-Purpose Computing on Graphics Processing Units: A Survey

Multi-agent traffic simulation with CUDA

Multi-camera real-time depth estimation with discontinuity handling on PC graphics hardware
Multi-Centroid PSO Classification Learning on the GPU

Multi-core CPU or GPU-accelerated Multiscale Modeling for Biomolecular Complexes

Multi-core CUDA Architecture for Parallelization of Hierarchical Text Clustering

Multi-core parallelism in a column-store

Multi-Core Programming Design Patterns: Stream Processing Algorithms for Dynamic Scene Perceptions

Multi-core programming with OpenCL: performance and portability: OpenCL in a memory bound scenario

Multi-dimensional characterization of electrostatic surface potential computation on graphics processors

Multi-dimensional characterization of temporal data mining on graphics processors

Multi-dimensional Functional Principal Component Analysis

Multi-Directional Optimisation on the GPU

Multi-domain, Higher Order Level Set Scheme for 3D Image Segmentation on the GPU

Multi-Elimination ILU Preconditioners on GPUs

Multi-fragment effects on the GPU using the k-buffer

Multi-GPGPU Cellular Automata Simulations using OpenACC

Multi-GPU accelerated multi-spin Monte Carlo simulations of the 2D Ising model

Multi-GPU Accelerated Parallel Algorithm of Wallis Transformation for Image Enhancement

Multi-GPU Acceleration of Black-Scholes Equation based Option Pricing

Multi-GPU and Multi-CPU Parallelization for Interactive Physics Simulations

Multi-GPU Based Lattice Boltzmann Method for Hemodynamic Simulation in Patient-Specific Cerebral Aneurysm

Multi-GPU based on multicriteria optimization for motion estimation system

Multi-GPU cluster wave propagation and OpenGL visualization

Multi-GPU Computing for Achieving Speedup in Real-time Aggregate Risk Analysis

Multi-GPU Distributed Parallel Bayesian Differential Topic Modelling

Multi-GPU Implementation for Iterative MR Image Reconstruction with Field Correction

Multi-GPU Implementation of a Hybrid Thermal Lattice Boltzmann Solver using the TheLMA Framework

Multi-GPU implementation of a VMAT treatment plan optimization algorithm

Multi-GPU Implementation of Machine Learning Algorithm using CUDA and OpenCL

Multi-GPU Implementation of the Minimum Volume Simplex Analysis Algorithm for Hyperspectral Unmixing

Multi-GPU implementation of the NICAM atmospheric model

Multi-GPU Implementation of the Uniformization Method for Solving Markov Models

Multi-GPU Island-Based Genetic Algorithm

Multi-GPU Island-Based Genetic Algorithm for Solving the Knapsack Problem

Multi-GPU Load Balancing for In-Situ Simulation and Visualization

Multi-GPU Load Balancing for In-situ Visualization

Multi-GPU numerical simulation of electromagnetic waves

Multi-GPU Parallel Computing and Task Scheduling under Virtualization

Multi-GPU parallel memetic algorithm for capacitated vehicle routing problem

Multi-GPU parallelization of a 3D Bayesian CT algorithm and its application on real foam reconstruction with incomplete data set

Multi-GPU Performance of Incompressible Flow Computation by Lattice Boltzmann Method on GPU Cluster
Multi-GPU Performance Optimization of a CFD Code using OpenACC on Different Platforms

Multi-GPU performance optimization of a computational fluid dynamics code using OpenACC

Multi-GPU Rendering with Vulkan API

Multi-GPU Support on Shared Memory System using Directive-based Programming Model

Multi-GPU Support on Single Node Using Directive-Based Programming Model

Multi-GPU Support on the Marrow Algorithmic Skeleton Framework

Multi-GPU thermal lattice Boltzmann simulations using OpenACC and MPI

Multi-GPU volume rendering using MapReduce

Multi-GPU-based Swendsen-Wang multi-cluster algorithm for the simulation of two-dimensional q-state Potts model

Multi-grain Parallel Processing of Data-Clustering on Programmable Graphics Hardware

Multi-hetero Acceleration by GPU and FPGA for Astrophysics Simulation on oneAPI Environment

Multi-Kepler GPU vs. Multi-Intel MIC for spin systems simulations

Multi-kernel Data Partitioning with Channel on OpenCL-based FPGAs

Titles: 100
open PDFs: 94
packages: 11
