Papers on hgpu.org (.txt-file)
Molecular Dynamics Simulations Using Graphics Processing Units
Molecular dynamics simulations with many-body potentials on multiple GPUs – the implementation, package and performance

Molecular Simulation of ab Initio Protein Folding for a Millisecond Folder NTL9(1-39)
Molecular Simulations using CUDA

Molecular structural mechanics approach to carbon nanotubes on graphics processing units
Monitoring Collective Communication Among GPUs

Monitoring Large-scale Microblog on GPUs

Monitoring Multiple Streams with Dynamic Time Warping using Graphic Processors

Montage: A Neural Network Language Model-Guided JavaScript Engine Fuzzer

Montblanc: GPU accelerated Radio Interferometer Measurement Equations in support of Bayesian Inference for Radio Observations

Monte Carlo integration on GPU

Monte Carlo methods for massively parallel computers

Monte Carlo Modeling of Electron Transport Using CUDA Technology

Monte Carlo Path Tracing with OpenCL

Monte Carlo Radiative Transport on the GPU

Monte Carlo randomization tests for large-scale abundance datasets on the GPU
Monte Carlo simulation of photon migration in 3D turbid media accelerated by graphics processing units

Monte Carlo simulations on Graphics Processing Units

Monte-Carlo Black-Scholes Implementation using OpenCL Standard

More Bang For Your Buck(et): Fast and Space-efficient Hardware-accelerated Coarse-granular Indexing on GPUs

Morphological Proximity Priors: Spatial Relationships for Semantic Segmentation

Motion Compensation and Reconstruction of H.264/AVC Video Bitstreams using the GPU

Motion Estimation for H.264/AVC using Programmable Graphics Hardware

Motion Estimation with Non-Local Total Variation Regularization

Motion planning for autonomous driving with a conformal spatiotemporal lattice

Movement Tracking in Terrain Conditions Accelerated with CUDA

Moving Least-Squares Reconstruction of Large Models with GPUs

Mpache: Interaction Aware Multi-level Cache Bypassing on GPUs

MPC Toolbox with GPU Accelerated Optimization Algorithms

MPC: A Massively Parallel Compression Algorithm for Scientific Data

MPI Derived Datatypes Processing on Noncontiguous GPU-resident Data

MPI Parallelization of GPU-based Lattice Boltzmann Simulations

MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-Based Systems

MPI-GIS: New Parallel Overlay Algorithm and System Prototype

MPI-GPU parallelism in iterative eigensolvers for block-tridiagonal matrices

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

MR-API: A Comprehensive API Framework for Heterogeneous Multi-core Systems using Map Reduce Programming Model

Mr. Scan: Extreme Scale Density-Based Clustering using a Tree-Based Network of GPGPU Nodes

MrBayes on a Graphics Processing Unit

MrBayes tgMC3: A Tight GPU Implementation of MrBayes

MRCUDA: MapReduce Acceleration Framework Based on GPU

MRPB: Memory Request Prioritization for Massively Parallel Processors

MSA-CUDA: Multiple Sequence Alignment on Graphics Processing Units with CUDA

MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications

MSREP: A Fast yet Light Sparse Matrix Framework for Multi-GPU Systems

MSTg: Cryptographically strong pseudorandom number generator and its realization

MT4G: A Tool for Reliable Auto-Discovery of NVIDIA and AMD GPU Compute and Memory Topologies

mu-cuDNN: Accelerating Deep Learning Frameworks with Micro-Batching

mu-grind: A Framework for Dynamically Instrumenting HLS-Generated RTL

Multi Agent Navigation on the GPU

Multi GPU Implementation of Iterative Tomographic Reconstruction Algorithms

Multi GPU Implementation of the Simplex Algorithm

Multi GPU Performance of Conjugate Gradient Algorithm with Staggered Fermions

Multi GPU Performance of Conjugate Gradient Solver with Staggered Fermions in Mixed Precision

Multi scale block histogram of template feature for pedestrian detection
Multi- and many-core data mining with adaptive sparse grids

Multi-Agent Systems and General-Purpose Computing on Graphics Processing Units: A Survey

Multi-agent traffic simulation with CUDA

Multi-camera real-time depth estimation with discontinuity handling on PC graphics hardware
Multi-Centroid PSO Classification Learning on the GPU

Multi-core CPU or GPU-accelerated Multiscale Modeling for Biomolecular Complexes

Multi-core CUDA Architecture for Parallelization of Hierarchical Text Clustering

Multi-core parallelism in a column-store

Multi-Core Programming Design Patterns: Stream Processing Algorithms for Dynamic Scene Perceptions

Multi-core programming with OpenCL: performance and portability: OpenCL in a memory bound scenario

Multi-dimensional characterization of electrostatic surface potential computation on graphics processors

Multi-dimensional characterization of temporal data mining on graphics processors

Multi-dimensional Functional Principal Component Analysis

Multi-Directional Optimisation on the GPU

Multi-domain, Higher Order Level Set Scheme for 3D Image Segmentation on the GPU

Multi-Elimination ILU Preconditioners on GPUs

Multi-fragment effects on the GPU using the k-buffer

Multi-GPGPU Cellular Automata Simulations using OpenACC

Multi-GPU accelerated multi-spin Monte Carlo simulations of the 2D Ising model

Multi-GPU Accelerated Parallel Algorithm of Wallis Transformation for Image Enhancement

Multi-GPU Acceleration of Black-Scholes Equation based Option Pricing

Multi-GPU and Multi-CPU Parallelization for Interactive Physics Simulations

Multi-GPU Based Lattice Boltzmann Method for Hemodynamic Simulation in Patient-Specific Cerebral Aneurysm

Multi-GPU based on multicriteria optimization for motion estimation system

Multi-GPU cluster wave propagation and OpenGL visualization

Multi-GPU Computing for Achieving Speedup in Real-time Aggregate Risk Analysis

Multi-GPU Distributed Parallel Bayesian Differential Topic Modelling

Multi-GPU Implementation for Iterative MR Image Reconstruction with Field Correction

Multi-GPU Implementation of a Hybrid Thermal Lattice Boltzmann Solver using the TheLMA Framework

Multi-GPU implementation of a VMAT treatment plan optimization algorithm

Multi-GPU Implementation of Machine Learning Algorithm using CUDA and OpenCL

Multi-GPU Implementation of the Minimum Volume Simplex Analysis Algorithm for Hyperspectral Unmixing

Multi-GPU implementation of the NICAM atmospheric model

Multi-GPU Implementation of the Uniformization Method for Solving Markov Models

Multi-GPU Island-Based Genetic Algorithm

Multi-GPU Island-Based Genetic Algorithm for Solving the Knapsack Problem

Multi-GPU Load Balancing for In-Situ Simulation and Visualization

Multi-GPU Load Balancing for In-situ Visualization

Multi-GPU numerical simulation of electromagnetic waves

Multi-GPU Parallel Computing and Task Scheduling under Virtualization

Titles: 100
open PDFs: 92
packages: 14
