1173

Papers on hgpu.org (.txt-file)

Massively parallel Monte Carlo for many-particle simulations on GPUs Download

Massively Parallel Network Coding on GPUs Download

Massively Parallel Neural Encoding and Decoding of Visual Stimuli Download

Massively Parallel Ray Tracing Algorithm Using GPU Download

Massively parallel read mapping on GPUs with PEANUT Download Package

Massively parallel read mapping on GPUs with the q-group index and PEANUT Download

Massively Parallel Sequential Monte Carlo for Bayesian Inference Download

Massively parallel simulations of relativistic fluid dynamics on graphics processing units with CUDA Download

Massively Parallel Suffix Array Queries and On-Demand Phrase Extraction for Statistical Machine Translation Using GPUs Download

Massively parallel two-dimensional TLM algorithm on graphics processing units Download

Massively parallelizable list-mode reconstruction using a Monte Carlo-based elliptical Gaussian model Download

Massively Parallelized Monte Carlo Simulation and its Applications in Finance Download

Massively parallelized replica-exchange simulations of polymers on GPUs Download

Massively-Parallel Lossless Data Decompression Download

Mastering Atari with Discrete World Models Download Package

Mastering Software Variant Explosion for GPU Accelerators Download Package

Matched Filter Computation on FPGA, Cell and GPU

MatConvNet – Convolutional Neural Networks for MATLAB Download

Material Removal Simulation and Cutting Force Prediction of Multi-Axis Machining Processes on General-Purpose Graphics Processing Units Download

Mathematical limits of parallel computation for embedded systems Download

MATLAB and Python for GPU Computing Download

MATLAB graphical interface for GPU based FDTD method

MATLAB Medical Images Classification on Graphics Processors Download

MATLAB Parallelization through Scalarization Download

Matrix Computations and Optimization in Apache Spark Download Package

Matrix Convolution using Parallel Programming Download

Matrix Factorization on GPUs with Memory Optimization and Approximate Computing Download Package

Matrix inversion speed up with CUDA Download

Matrix Multiplication Beyond Auto-Tuning: Rewrite-based GPU Code Generation Download

Matrix Multiplication on GPUs with On-Line Fault Tolerance

Matrix Multiplication Using Only Addition Download

Matrix Multiplication with CUDA – A basic introduction to the CUDA programming model Download

Matrix-free GPU implementation of a preconditioned conjugate gradient solver for anisotropic elliptic PDEs Download

Matrix-Matrix Multiplications on GPUs for Accelerating a Parallel Fluid Dynamics Code Download

maxDNN: An Efficient Convolution Kernel for Deep Learning with Maxwell GPUs Download Package

Maximal Information Coefficient Analysis Download

Maximize Performance on GPUs Using the Rake-based Optimization: A Case Study Download

Maximizing Parallelism and GPU Utilization For Direct GPU Compilation Through Ensemble Execution Download

Maximum likelihood event estimation and list-mode image reconstruction on GPU hardware Download

Maximum mipmaps for fast, accurate, and scalable dynamic height field rendering Download

MaxSSmap: A GPU program for short read mapping with the maximum scoring subsequence Download

MC-RANSAC: A Pre-processing Model for RANSAC using Monte Carlo method implemented on a GPU Download

MCBooster: a library for fast Monte Carlo generation of phase-space decays on massively parallel platforms Download Package

MCMini: Monte Carlo on GPGPU Download

MCS 572: Introduction to Supercomputing Download Package

MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs Download Package

MCUDA: An Efficient Implementation of CUDA Kernels on Multi-cores Download Package

md_poly: A Performance-Portable Polyhedral Compiler Based on Multi-Dimensional Homomorphisms Download Package

MDLab: A molecular dynamics simulation prototyping environment Download Package

MDR: performance model driven runtime for heterogeneous parallel platforms

Mean shift for graph bundling Download

Mean Shift Parallel Tracking on GPU

Measurement and Analysis of GPU-accelerated Applications with HPCToolkit Download Package

Measurements of performance of hardware and general purpose classical molecular dynamics simulation software Download

Measuring Bandwidth for Super Computer Workloads Download

Measuring the evolving Internet ecosystem with exchange points Download

Measuring the Impact of Configuration Parameters in CUDA Through Benchmarking Download

Measuring the Performance of Realtime DSP Using Pure Data and GPU Download

Mechanical Characterization and Performance Optimization for GPU Fan-Sink Cooling Module Assembly

Median Based Parallel Steering Kernel Regression for Image Reconstruction Download

Medical Image Registration using OpenCL Download

Medical imaging using CUDA Download

MEDINA: MECCA Development in Accelerators – KPP Fortran to CUDA source-to-source Preprocessor Download Package

Medium-Grained Functions Mapping using Modern GPUs Download

Medusa: A Parallel Graph Processing System on Graphics Processors Download Package

Medusa: Simplified Graph Processing on GPUs Download Package

Mega-KV: A Case for GPUs to Maximize the Throughput of In-Memory Key-Value Stores Download Package

MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph Download Package

Megakernels Considered Harmful: Wavefront Path Tracing on GPUs Download

Megapixel Topology Optimization on a Graphics Processing Unit

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism Download Package

Melia: A MapReduce Framework on OpenCL-based FPGAs Download Package

MELT-a Translated Domain Specific Language Embedded in the GCC Compiler Download

MemcachedGPU: Scaling-up Scale-out Key-value Stores Download Package

Memory Access Optimized Implementation of Cyclic and Quasi-Cyclic LDPC Codes on a GPGPU

Memory Bandwidth and Latency in HPC: System Requirements and Performance Impact Download

Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes Download

Memory Efficient Mixed-Precision Optimizers Download

Memory layout in GPU implementation of lattice Boltzmann method for sparse 3D geometries Download

Memory Optimization for Deep Networks Download Package

Memory Saving Discrete Fourier Transform on GPUs Download

Memory transfer optimization for a lattice Boltzmann solver on Kepler architecture nVidia GPUs Download

Memory-efficient Adaptive Subdivision for Software Rendering on the GPU Download Package

Memory-efficient implementation of a graphics processor-based cluster detection algorithm for large spatial databases

Memory-Efficient Implementation of DenseNets Download Package

Memory-Efficient Object-Oriented Programming on GPUs Download Package

Memory-Efficient Single-Pass GPU Rendering of Multi-fragment Effects Download

Memory-level and Thread-level Parallelism Aware GPU Architecture Performance Analytical Model Download

Memory-Scalable GPU Spatial Hierarchy Construction Download

Merge or Separate? Multi-job Scheduling for OpenCL Kernels on CPU/GPU Platforms Download

Merge: a programming model for heterogeneous multi-core systems Download

Mersenne Twister Random Number Generation on FPGA, CPU and GPU Download

Mesh deformations in X3D via CUDA with freeform deformation lattices Download

Mesh Independent Loop Fusion for Unstructured Mesh Applications Download

Mesh mutation in programmable graphics hardware Download

Meshfree/GFEM in hardware-efficiency prospective Download

Message passing for GPGPU clusters: CudaMPI Download Package

Message Passing Interface support for the runtime adaptive multi-processor system-on-chip RAMPSoC

Message passing on data-parallel architectures Download

Meta Networks for Neural Style Transfer Download Package

 

Brief statistics for this page

Titles: 100

Download open PDFs: 90

Package packages: 27

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: