1173

Papers on hgpu.org (.txt-file)

Live Migration for OpenCL FPGA Accelerators Download

Live Migration of FPGA Applications Download

Live, Video-Rate Super-Resolution Microscopy Using Structured Illumination and Rapid GPU-Based Parallel Processing

Living Flows: Enhanced Exploration of Edge-Bundled Graphs Based on GPU-Intensive Edge Rendering Download

LLload: An Easy-to-Use HPC Utilization Tool Download

LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators Download Package

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale Download Package

LLVM to PTX Backend Download

LLVM-based automation of memory decoupling for OpenCL applications on FPGAs Download Package

LN-Annote: An Alternative Approach to Information Extraction from Emails using Locally-Customized Named-Entity Recognition Download

LNA: Fast Protein Classification Using A Laplacian Characterization of Tertiary Structure Download

LO-SpMM: Low-cost Search for High-performance SpMM Kernels on GPUs Download

Load Balanced Parallel GPU Out-of-Core for Continuous LOD Model Visualization Download

Load Balancing for Constraint Solving with GPUs Download

Load Balancing in a Changing World: Dealing with Heterogeneity and Performance Variability Download

Load Balancing in Data Warehouse – Evolution and Perspectives Download

Load Balancing Utilizing Data Redundancy in Distributed Volume Rendering Download

Load Balancing versus Occupancy Maximization on Graphics Processing Units: The Generalized Hough Transform as a Case Study

Load-Balanced Multi-GPU Ambient Occlusion for Direct Volume Rendering Download

Local Alignment Tool Based on Hadoop Framework and GPU Architecture Download

Local Histogram Modification Based Contrast Enhancement with GPU Acceleration Download

Local Laplacian Filters: Edge-aware Image Processing with a Laplacian Pyramid Download Package

Local Search Algorithms on Graphics Processing Units. A Case Study: The Permutation Perceptron Problem Download

Local Volatility FX Basket Option on CPU and GPU Download

Local vs. Global Optimization: Operator Placement Strategies in Heterogeneous Environments Download

Locality Analysis for Characterizing Applications Based on Sparse Matrices Download

Locality and parallelism optimization for dynamic programming algorithm in bioinformatics Download

Locality Aware Work-Stealing Based Scheduling in Hybrid CPU-GPU Download

Locality optimization on a NUMA architecture for hybrid LU factorization Download

Locality-Aware Automatic Parallelization for GPGPU with OpenHMPP Directives Download

Locality-Aware Mapping of Nested Parallel Patterns on GPUs Download

Locality-aware parallel block-sparse matrix-matrix multiplication using the Chunks and Tasks programming model Download

Locality-Aware Work Stealing on Multi-CPU and Multi-GPU Architectures Download Package

LocalityGuru: A PTX Analyzer for Extracting Thread Block-level Locality in GPGPUs Download

Locally-Oriented Programming: A Simple Programming Model for Stencil-Based Computations on Multi-Level Distributed Memory Architectures Download

Location-based Matching in Publish/Subscribe Revisited Download

LOD Terrain Rendering by Local Parallel Processing on GPU Download

Log File Regular Expression Pattern Matching And Capture With GPUs Download Package

LOGAN: High-Performance GPU-Based X-Drop Long-Read Alignment Download Package

LoGV: Low-overhead GPGPU Virtualization Download

Long Code for Code Search Download

Long time-scale simulations of in vivo diffusion using GPU hardware Download

Long Timestep Molecular Dynamics on the Graphical Processing Unit Download Package

Long-time Simulations with Complex Code Using Multiple Nodes of Intel Xeon Phi Knights Landing Download

Loo.py: From Fortran to performance via transformation and substitution rules Download Package

Loo.py: transformation-based code generation for GPUs and CPUs Download Package

Looking at the surprise: Bottom-up attentional control of an active camera system Download

LookNN: Neural Network with No Multiplication Download

Loop Perforation in OpenACC Download Package

Loop Transformation Recipes for Code Generation and Auto-Tuning Download

LoopBench: An Evaluation of Loop Acceleration in Heterogeneous Systems Download Package

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers Download Package

Loose capacity-constrained representatives for the qualitative visual analysis in molecular dynamics Download

Lossless Acceleration for Seq2seq Generation with Aggressive Decoding Download Package

Lossless Compression of Variable-Precision Floating-Point Buffers on GPUs Download

Lossless data compression on GPGPU architectures Download

Lossless LZW Data Compression Algorithm on CUDA Download

Lost in Abstraction: Pitfalls of Analyzing GPUs at the Intermediate Language Level Download

Lost in Translation: Challenges in Automating CUDA-to-OpenCL Translation Download

Low Complexity Corner Detector Using CUDA for Multimedia Applications Download

Low cost approach to real-time vehicle to vehicle communication using parallel CPU and GPU processing Download

Low cost, high performance GPU computing solution for atomic resolution cryoEM single-particle reconstruction

Low Latency Complex Event Processing on Parallel Hardware Download

Low latency photon mapping using block hashing Download

Low viscosity flow simulations for animation Download

Low-complexity Distributed Tomographic Backprojection for large datasets Download Package

Low-cost, high-speed computer vision using NVIDIA’s CUDA architecture Download

Low-Frequency MLFMA on Graphics Processors Download

Low-Impact Profiling of Streaming, Heterogeneous Applications Download

Low-Latency Elliptic Curve Scalar Multiplication Download

Low-latency Image Recognition with GPU-accelerated Convolutional Networks for Web-based Services Download

Low-overhead diskless checkpoint for hybrid computing systems Download

Low-Overhead Trace Collection and Profiling on GPU Compute Kernels Download Package

Low-power System-on-Chip Processors for Energy Efficient High Performance Computing: The Texas Instruments Keystone II Download

Low-power Task Scheduling for GPU Energy Reduction Download

Lowering IrGL to CUDA Download

LS-CAT: A Large-Scale CUDA AutoTuning Dataset Download

LTE Physical Layer Implementation Using GPU Based High Performance Computing Download

LTTng CLUST: A system-wide unified CPU and GPU tracing tool for OpenCL applications Download

LU Factorization for Accelerator-based Systems Download

LU Factorization with Partial Pivoting for a Multi-CPU, Multi-GPU Shared Memory System Download

LU Factorization with Partial Pivoting for a Multicore System with Accelerators Download

LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware Download

LU, QR, and Cholesky factorizations: Programming Model, Performance Analysis and Optimization Techniques for the Intel Knights Landing Xeon Phi Download

LUDA: Boost LSM Key Value Store Compactions with GPUs Download

Lynx: A Dynamic Instrumentation System for Data-Parallel Applications on GPGPU Architectures Download Package

Lyra2: Password Hashing Scheme with improved security against time-memory trade-offs Download

MACC: An OpenACC Transpiler for Automatic Multi-GPU Use Download

Machine Learning and Deep Learning frameworks and libraries for large-scale data mining: a survey Download

Machine Learning at the Limit Download Package

Machine Learning Based Auto-tuning for Enhanced OpenCL Performance Portability Download

Machine Learning Based Intrusion Detection in Controller Area Networks Download Package

Machine learning enhanced code optimization for high-level synthesis (ML-ECOHS) Download Package

Machine Learning for CUDA+MPI Design Rules Download

Machine Learning for Predictive Auto-Tuning with Boosted Regression Trees Download

Machine learning for ultrafast X-ray diffraction patterns on large-scale GPU clusters Download

Machine Learning from Streaming Data in Heterogeneous Computing Environments Download

Machine Learning in Compilers: Past, Present and Future Download

Machine Learning-Driven Adaptive OpenMP For Portable Performance on Heterogeneous Systems Download

Machine-Learning-based Performance Heuristics for Runtime CPU/GPU Selection Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 97

Package packages: 20

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: