Papers on hgpu.org (.txt-file)
MinGPU: a minimum GPU library for computer vision

miniLB: A Performance Portability Study of Lattice-Boltzmann Simulations

Minimal models for finite particles in fluctuating hydrodynamics

minimap2-fpga: Integrating hardware-accelerated chaining for efficient end-to-end long-read sequence mapping

Minimising Testing in Genetic Programming

Mining Rare Features in Fingerprints Using Core Points and Triplet-based Features

Mint: realizing CUDA performance in 3D stencil methods with annotated C

Minuet: Accelerating 3D Sparse Convolutions on GPUs

MIOpen: An Open Source Library For Deep Learning Primitives

Miriam: Exploiting Elastic Kernels for Real-time Multi-DNN Inference on Edge GPU

Mirovia: A Benchmarking Suite for Modern Heterogeneous Computing

MITHRA: Multiple data independent tasks on a heterogeneous resource architecture

Mix-and-Match: A Model-driven Runtime Optimisation Strategy for BFS on GPUs

Mixed precision in Graphics Processing Unit

Mixed Precision Iterative Refinement Techniques for the Solution of Dense Linear Systems

Mixed Precision Solver Scalable to 16000 MPI Processes for Lattice Quantum Chromodynamics Simulations on the Oakforest-PACS System

Mixed-Precision Embedding Using a Cache

Mixed-precision finite element kernels and assembly: Rounding error analysis and hardware acceleration

Mixed-Precision GPU-Multigrid Solvers with Strong Smoothers

Mixed-precision Orthogonalization Scheme and Adaptive Step Size for CA-GMRES on GPUs

Mixed-precision orthogonalization scheme and its case studies with CA-GMRES on a GPU

Mixed-Resolution Patch-Matching

Mixed-Tool Performance Analysis on Hybrid Multicore Architectures

Mixing Low-Precision Formats in Multiply-Accumulate Units for DNN Training

Mixing Multi-Core CPUs and GPUs for Scientific Simulation Software

MKPipe: A Compiler Framework for Optimizing Multi-Kernel Workloads in OpenCL for FPGA

ML-Triton, A Multi-Level Compilation and Language Extension to Triton GPU Programming

MLitB: Machine Learning in the Browser

MLS-based scalar fields over triangle meshes and their application in mesh processing
MNN: A Universal and Efficient Inference Engine

Mobile GPGPU Acceleration of Embodied Robot Simulation

Mobile GPU Computing Based Filter Bank Convolution for Three-dimensional Wavelet Transform

MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU

MobiRT: an implementation of OpenGL ES-based CPU-GPU hybrid ray tracer for mobile devices

Model Coupling between the Weather Research and Forecasting Model and the DPRI Large Eddy Simulator for Urban Flows on GPU-accelerated Multicore Systems

Model-Based 3D Object Tracking Using an Extended-Extended Kalman Filter and Graphics Rendered Measurements

Model-based optimization of MPDATA on Intel Xeon Phi through load imbalancing

Model-Based Warp-Level Tiling for Image Processing Programs on GPUs

Model-driven autotuning of sparse matrix-vector multiply on GPUs

Model-driven optimisation of memory hierarchy and multithreading on GPUs

Model-Driven Tile Size Selection for DOACROSS Loops on GPUs

Model-independent partial wave analysis using a massively-parallel fitting framework

Model-T: Rethinking the OS for terabit speeds

Modeling and Evaluation of Synchronous Stochastic Gradient Descent in Distributed Deep Learning on Multiple GPUs

Modeling and generating complex motion blur for real-time tracking

Modeling and Optimization of Parallel Matrix-based Computations on GPU

Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-Core Architectures

Modeling Deep Learning Accelerator Enabled GPUs

Modeling GPU Dynamic Parallelism for Self Similar Density Workloads

Modeling GPU-CPU Workloads and Systems

Modeling Image Patches with a Generic Dictionary of Mini-Epitomes

Modeling of Heat Diffusion Through Isotropic Media Using Graphical Processing Units

Modeling of Heterogeneous Architecture with GPU to Exascale System

Modeling of High Performance Programs to Support Heterogeneous Computing

Modeling of the behavior of 222 Rn progeny in diffusion chamber using CUDA

Modeling of tsunami waves and atmospheric swirling flows with graphics processing unit

Modeling Parallel Programs for Heterogeneous Computing

Modeling Parallel Programs using Large Language Models

Modeling Rotor Wakes with a Hybrid OVERFLOW-Vortex Method on a GPU Cluster

Modeling system for GPU parallel tasks performance simulation

Modeling the propagation of elastic waves using spectral elements on a cluster of 192 GPUs

Modeling the Resource Requirements of Convolutional Neural Networks on Mobile Devices

Modeling the spatio-temporal evolution of fracture networks and fluid-rock interactions in GPU: Applications to lithospheric geodynamics

Modelling sea water intrusion in coastal aquifers using heterogeneous computing

Modelling the Formation of Ordered Acentrosomal Microtubule Arrays

Modelling, simulating and visualising the Cahn-Hilliard-Cook field equation

Modern GPGPU Frameworks and their Application to the Physical Core of the ASUCA Weather Prediction Model

Modern GPU-Based Forward-Projection Algorithm with a New Sampling Method
Modern Gyrokinetic Particle-In-Cell Simulation of Fusion Plasmas on Top Supercomputers

Modern Platform for Parallel Algorithms Testing: Java on Intel Xeon Phi

Modernization and Optimization of MPI Codes

Modernizing the core quantum chemistry algorithms

MODESTO: Data-centric Analytic Optimization of Complex Stencil Programs on Heterogeneous Architectures

Modification of self-organizing migration algorithm for OpenCL framework

Modified Bloom filter for high performance hybrid NoSQL systems

Modified Levels of Parallel Odd-Even Transposition Sorting Network (OETSN) with GPU Computing using CUDA

Modular & Scalable Ultrasound Platform with GPU Processing

Modular Arithmetic for Solving Linear Equations on the GPU

Modular FPGA Systems with Support for Dynamic Workloads and Virtualisation

Modular Resultant Algorithm for Graphics Processors
Modular Technology in the Modelling of Large Virtual Environments in Driving Simulators
Moim: A Multi-GPU MapReduce Framework

Mojo: MLIR-Based Performance-Portable HPC Science Kernels on GPUs for the Python Ecosystem

Molecular Activity Prediction using Deep Learning Software Library

Molecular Distance Geometry Optimization Using Geometric Build-up and Evolutionary Techniques on GPU

Molecular Docking on FPGA and GPU Platforms

Molecular dynamics for long-range interacting systems on Graphic Processing Units

Molecular Dynamics on a Grand Scale

Molecular dynamics recipes for genome research

Molecular Dynamics Simulation Based on Hadoop MapReduce

Molecular dynamics simulation of complex multiphase flow on a computer cluster with GPUs

Molecular Dynamics Simulation of Macromolecules Using Graphics Processing Unit

Molecular Dynamics Simulation of Multi-Scale Flows on GPUs

Molecular dynamics simulation of the supercooled Al melt on GPUs

Molecular dynamics simulation of UO2 nanocrystals melting

Molecular dynamics simulations of the relaxation processes in the condensed matter on GPUs
Molecular Dynamics Simulations on Commodity GPUs with CUDA

Molecular dynamics simulations through GPU video games technologies

Titles: 100
open PDFs: 95
packages: 22
