Papers on hgpu.org (.txt-file)
Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-Core Architectures

Modeling Deep Learning Accelerator Enabled GPUs

Modeling GPU Dynamic Parallelism for Self Similar Density Workloads

Modeling GPU-CPU Workloads and Systems

Modeling Image Patches with a Generic Dictionary of Mini-Epitomes

Modeling of Heat Diffusion Through Isotropic Media Using Graphical Processing Units

Modeling of Heterogeneous Architecture with GPU to Exascale System

Modeling of High Performance Programs to Support Heterogeneous Computing

Modeling of the behavior of 222 Rn progeny in diffusion chamber using CUDA

Modeling of tsunami waves and atmospheric swirling flows with graphics processing unit

Modeling Parallel Programs for Heterogeneous Computing

Modeling Parallel Programs using Large Language Models

Modeling Rotor Wakes with a Hybrid OVERFLOW-Vortex Method on a GPU Cluster

Modeling system for GPU parallel tasks performance simulation

Modeling the propagation of elastic waves using spectral elements on a cluster of 192 GPUs

Modeling the Resource Requirements of Convolutional Neural Networks on Mobile Devices

Modeling the spatio-temporal evolution of fracture networks and fluid-rock interactions in GPU: Applications to lithospheric geodynamics

Modelling sea water intrusion in coastal aquifers using heterogeneous computing

Modelling the Formation of Ordered Acentrosomal Microtubule Arrays

Modelling, simulating and visualising the Cahn-Hilliard-Cook field equation

Modern GPGPU Frameworks and their Application to the Physical Core of the ASUCA Weather Prediction Model

Modern GPU-Based Forward-Projection Algorithm with a New Sampling Method
Modern Gyrokinetic Particle-In-Cell Simulation of Fusion Plasmas on Top Supercomputers

Modern Platform for Parallel Algorithms Testing: Java on Intel Xeon Phi

Modernization and Optimization of MPI Codes

Modernizing the core quantum chemistry algorithms

MODESTO: Data-centric Analytic Optimization of Complex Stencil Programs on Heterogeneous Architectures

Modification of self-organizing migration algorithm for OpenCL framework

Modified Bloom filter for high performance hybrid NoSQL systems

Modified Levels of Parallel Odd-Even Transposition Sorting Network (OETSN) with GPU Computing using CUDA

Modular & Scalable Ultrasound Platform with GPU Processing

Modular Arithmetic for Solving Linear Equations on the GPU

Modular FPGA Systems with Support for Dynamic Workloads and Virtualisation

Modular Resultant Algorithm for Graphics Processors
Modular Technology in the Modelling of Large Virtual Environments in Driving Simulators
Moim: A Multi-GPU MapReduce Framework

Mojo: MLIR-Based Performance-Portable HPC Science Kernels on GPUs for the Python Ecosystem

Molecular Activity Prediction using Deep Learning Software Library

Molecular Distance Geometry Optimization Using Geometric Build-up and Evolutionary Techniques on GPU

Molecular Docking on FPGA and GPU Platforms

Molecular dynamics for long-range interacting systems on Graphic Processing Units

Molecular Dynamics on a Grand Scale

Molecular dynamics recipes for genome research

Molecular Dynamics Simulation Based on Hadoop MapReduce

Molecular dynamics simulation of complex multiphase flow on a computer cluster with GPUs

Molecular Dynamics Simulation of Macromolecules Using Graphics Processing Unit

Molecular Dynamics Simulation of Multi-Scale Flows on GPUs

Molecular dynamics simulation of the supercooled Al melt on GPUs

Molecular dynamics simulation of UO2 nanocrystals melting

Molecular dynamics simulations of the relaxation processes in the condensed matter on GPUs
Molecular Dynamics Simulations on Commodity GPUs with CUDA

Molecular dynamics simulations through GPU video games technologies

Molecular Dynamics Simulations Using Graphics Processing Units
Molecular dynamics simulations with many-body potentials on multiple GPUs – the implementation, package and performance

Molecular Simulation of ab Initio Protein Folding for a Millisecond Folder NTL9(1-39)
Molecular Simulations using CUDA

Molecular structural mechanics approach to carbon nanotubes on graphics processing units
Monitoring Collective Communication Among GPUs

Monitoring Large-scale Microblog on GPUs

Monitoring Multiple Streams with Dynamic Time Warping using Graphic Processors

Montage: A Neural Network Language Model-Guided JavaScript Engine Fuzzer

Montblanc: GPU accelerated Radio Interferometer Measurement Equations in support of Bayesian Inference for Radio Observations

Monte Carlo integration on GPU

Monte Carlo methods for massively parallel computers

Monte Carlo Modeling of Electron Transport Using CUDA Technology

Monte Carlo Path Tracing with OpenCL

Monte Carlo Radiative Transport on the GPU

Monte Carlo randomization tests for large-scale abundance datasets on the GPU
Monte Carlo simulation of photon migration in 3D turbid media accelerated by graphics processing units

Monte Carlo simulations on Graphics Processing Units

Monte-Carlo Black-Scholes Implementation using OpenCL Standard

More Bang For Your Buck(et): Fast and Space-efficient Hardware-accelerated Coarse-granular Indexing on GPUs

Morphological Proximity Priors: Spatial Relationships for Semantic Segmentation

Motion Compensation and Reconstruction of H.264/AVC Video Bitstreams using the GPU

Motion Estimation for H.264/AVC using Programmable Graphics Hardware

Motion Estimation with Non-Local Total Variation Regularization

Motion planning for autonomous driving with a conformal spatiotemporal lattice

Movement Tracking in Terrain Conditions Accelerated with CUDA

Moving Least-Squares Reconstruction of Large Models with GPUs

Mpache: Interaction Aware Multi-level Cache Bypassing on GPUs

MPC Toolbox with GPU Accelerated Optimization Algorithms

MPC: A Massively Parallel Compression Algorithm for Scientific Data

MPI Derived Datatypes Processing on Noncontiguous GPU-resident Data

MPI Parallelization of GPU-based Lattice Boltzmann Simulations

MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-Based Systems

MPI-GIS: New Parallel Overlay Algorithm and System Prototype

MPI-GPU parallelism in iterative eigensolvers for block-tridiagonal matrices

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

MR-API: A Comprehensive API Framework for Heterogeneous Multi-core Systems using Map Reduce Programming Model

Mr. Scan: Extreme Scale Density-Based Clustering using a Tree-Based Network of GPGPU Nodes

MrBayes on a Graphics Processing Unit

MrBayes tgMC3: A Tight GPU Implementation of MrBayes

MRCUDA: MapReduce Acceleration Framework Based on GPU

MRPB: Memory Request Prioritization for Massively Parallel Processors

MSA-CUDA: Multiple Sequence Alignment on Graphics Processing Units with CUDA

MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications

Titles: 100
open PDFs: 90
packages: 14
