1173

Papers on hgpu.org (.txt-file)

Implementing Molecular Dynamics on Hybrid High Performance Computers – Particle-Particle Particle-Mesh Download Package

Implementing molecular dynamics on hybrid high performance computers – short range forces

Implementing Molecular Dynamics on Hybrid High Performance Computers – Three-Body Potentials Download Package

Implementing Neural Networks Efficiently Download

Implementing Open-Source CUDA Runtime Download Package

Implementing Parallel SMO to Train SVM on CUDA-Enabled Systems

Implementing Push-Pull Efficiently in GraphBLAS Download

Implementing QR Factorization Updating Algorithms on GPUs Download

Implementing sparse matrix-vector multiplication on throughput-oriented processors Download Package

Implementing Sparse Matrix-Vector multiplication using CUDA based on a hybrid sparse matrix format

Implementing Sparse Matrix-Vector Multiplication with QCSR on GPU Download

Implementing Stereo Vision of GPU-Accelerated Scientific Simulations using Commodity Hardware Download

Implementing Strassen’s Algorithm with CUTLASS on NVIDIA Volta GPUs Download Package

Implementing the Approximate Message Passing (AMP) Algorithm on a GPU

Implementing the Himeno benchmark with CUDA on GPU clusters

Implementing the PGI Accelerator model Download

Implementing the Projected Spatial Rich Features on a GPU Download

Implementing Ultrasound Beamforming on the GPU using CUDA Download

Implications of the Turing completeness of reaction-diffusion models, informed by GPGPU simulations on an XBox 360: cardiac arrhythmias, re-entry and the Halting problem Download

Implicit Adaptive Volume Ray Casting Download

Implicit and dynamic trees for high performance rendering Download

Implicit Boundary Control of Vector Field Based Shape Deformations Download

Implicit Feature-Based Alignment System for Radiotherapy

Implicit Methods for Real-Time simulation of Interactive Waves Download

Implicit Parallel Time Integrators Download

Implicit Skinning: Real-Time Skin Deformation with Contact Modeling Download

Importance of Data Loading Pipeline in Training Deep Neural Networks Download Package

Importance of Explicit Vectorization for CPU and GPU Software Performance Download

Importance Point Projection for GPU-based Final Gathering Download

Importance sampling algorithms for first passage time probabilities in the infinite server queue Download

Importance Sampling of Realistic Light Sources Download

Importance-driven compositing window management Download

Importance-Driven Isosurface Decimation for Visualization of Large Simulation Data Based on OpenCL Download

Importance-Driven Particle Techniques for Flow Visualization Download

Impostors and pseudo-instancing for GPU crowd rendering Download

Impostors, Pseudo-instancing and Image Maps for GPU Crowd Rendering Download

Improved automated lattice perturbation theory in background field gauge Download

Improved Distance Weighted GPU-based 3D Ultrasound Reconstruction Methods Download

Improved FCM algorithm for Clustering on Web Usage Mining Download

Improved Finite Difference Schemes for a 3-D Viscothermal Wave Equation on a GPU Download

Improved GPU Co-processor Sorting Algorithm with Barrier Synchronization Download

Improved Implementation of Simulation for Membrane Computing on the Graphic Processing Unit Download

Improved Integral Histogram Algorithm for Big Sized Images in CUDA Environment Download

Improved Lossless Image Compression Model Using Coefficient Based Discrete Wavelet Transform Download

Improved OpenCL-based Implementation of Social Field Pedestrian Model Download

Improved Performance of CaFE and IRIS Model Fitting Using CUDA Download

Improved Poisson Matting for a Real Time Tele-presence System Using GPU

Improved Programming of GPU Architectures through Automated Data Allocation and Loop Restructuring

Improved Real-Time Stereo on Commodity Graphics Hardware Download

Improved Row-Grouped CSR Format for Storing of Sparse Matrices on GPU Download

Improved Sequential & Parallel Designs and Implementations of the Eight Direction Prewitt Edge Detection Download

Improvement of the fused CUDA kernels performance prediction Download

Improvement Study of EEMD Decomposition Efficiency Based on CUDA Architecture Download

Improvements to Physically Based Cloth Simulation Download

Improving 3D Lattice Boltzmann Method stencil with asynchronous transfers on many-core processors Download

Improving accuracy for matrix multiplications on GPUs

Improving Atmospheric Model Performance on a Multi-Core Cluster System Download

Improving Automatic Parallel Training via Balanced Memory Workload Optimization Download

Improving Cache Locality for GPU-based Volume Rendering Download

Improving Cache Locality for Ray Casting with CUDA Download

Improving Communication Performance and Scalability of Native Applications on Intel Xeon Phi Coprocessor Clusters Download

Improving Communication Performance in GPU-Accelerated HPC Clusters Download Package

Improving CUDA DNA Analysis Software with Genetic Programming Download Package

Improving CUDASW++, a Parallelization of Smith-Waterman for CUDA Enabled Devices Download

Improving energy and power efficiency using NComputing and approaches for predicting reliability of complex computing systems Download

Improving Energy Efficiency of Basic Linear Algebra Routines on Heterogeneous Systems with Multiple GPUs Download

Improving Energy Efficiency of GPU based General-Purpose Scientific Computing through Automated Selection of Near Optimal Configurations Download

Improving GPGPU Concurrency with Elastic Kernels Download

Improving GPU particle filter with shader model 3.0 for visual tracking Download

Improving GPU Performance by Regrouping CPU-Memory Data Download

Improving GPU Performance Prediction with Data Transfer Modeling Download

Improving GPU Performance through Instruction Redistribution and Diversification Download

Improving GPU Performance via Large Warps and Two-Level Warp Scheduling Download

Improving GPU Performance: Reducing Memory Conflicts and Latency Download

Improving GPU programming models through hardware cache coherence Download

Improving GPU Robustness by Making Use of Faulty Parts Download

Improving GPU Simulations of Spiking Neural P Systems Download

Improving GPU Sparse Matrix-Vector Multiplication for Probabilistic Model Checking Download Package

Improving GPU-accelerated Adaptive IDW Interpolation Algorithm Using Fast kNN Search Download

Improving Hybrid OpenCL Performance by High Speed Networks

Improving Locality of Unstructured Mesh Algorithms on GPUs Download

Improving Loop Parallelization by a Combination of Static and Dynamic Analyses in HLS Download Package

Improving many flavor QCD simulations using multiple GPUs Download

Improving Numerical Accuracy for Non-Negative Matrix Multiplication on GPUs using Recursive Algorithms Download

Improving numerical reproducibility and stability in large-scale numerical simulations on GPUs Download

Improving OpenACC compatibility within accULL Download

Improving OpenCL Performance by Specializing Compiler Phase Selection and Ordering Download

Improving OpenCL Programmability with the Heterogeneous Programming Library Download Package

Improving Parallel Program Performance Through DSL-Driven Code Generation with LLM Optimizers Download

Improving Performance and Energy Consumption of Runtime Schedulers for Dense Linear Algebra Download

Improving Performance and Energy Efficiency of GPUs through Locality Analysis Download

Improving Performance and Energy Efficiency of Heterogeneous Systems with rCUDA Download

Improving performance for emergent environments parameter tuning and simulation in games using GPU

Improving Performance of Hardware Accelerators by Optimizing Data Movement: A Bioinformatics Case Study Download Package

Improving Performance of Iterative Applications through Interleaved Execution of Approximated CUDA Kernels Download

Improving Performance of Matrix Multiplication and FFT on GPU Download

Improving Performance of OpenCL on CPUs Download

Improving performance of SYCL applications on CPU architectures using LLVM-directed compilation flow Download

Improving performance portability for GPU-specific OpenCL kernels on multi-core/many-core CPUs by analysis-based transformations Download

Improving Performance Portability in OpenCL Programs Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 89

Package packages: 12

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: