1173

Papers on hgpu.org (.txt-file)

maxDNN: An Efficient Convolution Kernel for Deep Learning with Maxwell GPUs Download Package

Maximal Information Coefficient Analysis Download

Maximize Performance on GPUs Using the Rake-based Optimization: A Case Study Download

Maximizing Parallelism and GPU Utilization For Direct GPU Compilation Through Ensemble Execution Download

Maximum likelihood event estimation and list-mode image reconstruction on GPU hardware Download

Maximum mipmaps for fast, accurate, and scalable dynamic height field rendering Download

MaxSSmap: A GPU program for short read mapping with the maximum scoring subsequence Download

MC-RANSAC: A Pre-processing Model for RANSAC using Monte Carlo method implemented on a GPU Download

MCBooster: a library for fast Monte Carlo generation of phase-space decays on massively parallel platforms Download Package

MCMini: Monte Carlo on GPGPU Download

MCS 572: Introduction to Supercomputing Download Package

MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs Download Package

MCUDA: An Efficient Implementation of CUDA Kernels on Multi-cores Download Package

md_poly: A Performance-Portable Polyhedral Compiler Based on Multi-Dimensional Homomorphisms Download Package

MDLab: A molecular dynamics simulation prototyping environment Download Package

MDR: performance model driven runtime for heterogeneous parallel platforms

Mean shift for graph bundling Download

Mean Shift Parallel Tracking on GPU

Measurement and Analysis of GPU-accelerated Applications with HPCToolkit Download Package

Measurements of performance of hardware and general purpose classical molecular dynamics simulation software Download

Measuring Bandwidth for Super Computer Workloads Download

Measuring the evolving Internet ecosystem with exchange points Download

Measuring the Impact of Configuration Parameters in CUDA Through Benchmarking Download

Measuring the Performance of Realtime DSP Using Pure Data and GPU Download

Mechanical Characterization and Performance Optimization for GPU Fan-Sink Cooling Module Assembly

Median Based Parallel Steering Kernel Regression for Image Reconstruction Download

Medical Image Registration using OpenCL Download

Medical imaging using CUDA Download

MEDINA: MECCA Development in Accelerators – KPP Fortran to CUDA source-to-source Preprocessor Download Package

Medium-Grained Functions Mapping using Modern GPUs Download

Medusa: A Parallel Graph Processing System on Graphics Processors Download Package

Medusa: Simplified Graph Processing on GPUs Download Package

Mega-KV: A Case for GPUs to Maximize the Throughput of In-Memory Key-Value Stores Download Package

MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph Download Package

Megakernels Considered Harmful: Wavefront Path Tracing on GPUs Download

Megapixel Topology Optimization on a Graphics Processing Unit

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism Download Package

Melia: A MapReduce Framework on OpenCL-based FPGAs Download Package

MELT-a Translated Domain Specific Language Embedded in the GCC Compiler Download

MemcachedGPU: Scaling-up Scale-out Key-value Stores Download Package

Memory Access Optimized Implementation of Cyclic and Quasi-Cyclic LDPC Codes on a GPGPU

Memory Bandwidth and Latency in HPC: System Requirements and Performance Impact Download

Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes Download

Memory Efficient Mixed-Precision Optimizers Download

Memory Interference and Performance Prediction in GPU-Accelerated Heterogeneous Systems Download

Memory layout in GPU implementation of lattice Boltzmann method for sparse 3D geometries Download

Memory Optimization for Deep Networks Download Package

Memory Saving Discrete Fourier Transform on GPUs Download

Memory transfer optimization for a lattice Boltzmann solver on Kepler architecture nVidia GPUs Download

Memory-efficient Adaptive Subdivision for Software Rendering on the GPU Download Package

Memory-efficient implementation of a graphics processor-based cluster detection algorithm for large spatial databases

Memory-Efficient Implementation of DenseNets Download Package

Memory-Efficient Object-Oriented Programming on GPUs Download Package

Memory-Efficient Single-Pass GPU Rendering of Multi-fragment Effects Download

Memory-level and Thread-level Parallelism Aware GPU Architecture Performance Analytical Model Download

Memory-Scalable GPU Spatial Hierarchy Construction Download

Merge or Separate? Multi-job Scheduling for OpenCL Kernels on CPU/GPU Platforms Download

Merge: a programming model for heterogeneous multi-core systems Download

Mersenne Twister Random Number Generation on FPGA, CPU and GPU Download

Mesh deformations in X3D via CUDA with freeform deformation lattices Download

Mesh Independent Loop Fusion for Unstructured Mesh Applications Download

Mesh mutation in programmable graphics hardware Download

Meshfree/GFEM in hardware-efficiency prospective Download

Message passing for GPGPU clusters: CudaMPI Download Package

Message Passing Interface support for the runtime adaptive multi-processor system-on-chip RAMPSoC

Message passing on data-parallel architectures Download

Meta Networks for Neural Style Transfer Download Package

Meta-Programming and Auto-Tuning in the Search for High Performance GPU Code Download

Meta-programming and Multi-stage Programming for GPGPUs Download

Meta-simulation of large WSN on multi-core computers Download

MetaBinG: Using GPUs to Accelerate Metagenomic Sequence Classification Download Package

MetaCL – A Model-Based Approach to Programming Heterogeneous Architectures Using OpenCL Download

MetaFork: A Compilation Framework for Concurrency Models Targeting Hardware Accelerators and Its Application to the Generation of Parametric CUDA Kernels Download Package

MetaMorph: A Library Framework for Interoperable Kernels on Multi- and Many-core Clusters Download

Metamorphic Testing for (Graphics) Compilers Download

Metaprogramming GPUs with Sh Download

Method for simulation of coastal terrain on GPU

Methodology of control and supervision of web connected mobile robots with CUDA technology application Download

Methods and Metrics for Fair Server Assessment under Real-Time Financial Workloads Download

Methods for Accelerating Machine Learning in High Performance Computing Download

Methods for GPU Acceleration of Big Data Applications Download

Methods for Optimizing OpenCL Applications on Heterogeneous Multicore Architectures Download

MGARD: A multigrid framework for high-performance, error-controlled data compression and refactoring Download Package

MGPUSim: Enabling Multi-GPU Performance Modeling and Optimization Download Package

MIC-SVM: Designing A Highly Efficient Support Vector Machine For Advanced Modern Multi-Core and Many-Core Architectures Download

MICA: A fast short-read aligner that takes full advantage of Intel Many Integrated Core Architecture (MIC) Download Package

Microarchitectural Performance Characterization of Irregular GPU Kernels Download

Microbenchmarks for GPU characteristics: the occupancy roofline and the pipeline model Download Package

Microbranching in mode-I fracture using large scale simulations of amorphous and perturbed lattice models Download

Microlensing Observations Rapid Search for Exoplanets: MORSE code for GPUs Download

Micropolygon ray tracing with defocus and motion blur

MIDeA: a multi-parallel intrusion detection architecture Download

Migrating CUDA to oneAPI: A Smith-Waterman Case Study Download

Migrating from OpenGL ES to Vulkan Download

Migrating real-time depth image-based rendering from traditional to next-gen GPGPU

MILC Code Performance on High End CPU and GPU Supercomputer Clusters Download

MILC on GPUs Download

MILC staggered conjugate gradient performance on Intel KNL Download

MILJS: Brand New JavaScript Libraries for Matrix Calculation and Machine Learning Download Package

MiMatrix: A Massively Distributed Deep Learning Framework on a Petascale High-density Heterogeneous Cluster Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 90

Package packages: 29

Recent source codes

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org