1173

Papers on hgpu.org (.txt-file)

Least Squares on GPUs in Multiple Double Precision Download Package

Lectures on Parallel Computing Download

LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations Download Package

LeFlow: Enabling Flexible FPGA High-Level Synthesis of Tensorflow Deep Neural Networks Download Package

LeftoverLocals: Listening to LLM Responses Through Leaked GPU Local Memory Download Package

Legion: Programming Distributed Heterogeneous Architectures with Logical Regions Download Package

Legolizer: A Real-Time System for Modeling and Rendering LEGO Representations of Boundary Models Download

Lensed: a code for the forward reconstruction of lenses and sources from strong lensing observations Download Package

Leo: A Profile-Driven Dynamic Optimization Framework for GPU Applications Download

Lessons learned from contrasting a BLAS kernel implementations Download

Lessons learned in a decade of research software engineering GPU applications Download

Lessons Learned Migrating CUDA to SYCL: A HEP Case Study with ROOT RDataFrame Download Package

Let’s sort this out: GPGPU Verification of Radix Sort Download Package

Lettuce: PyTorch-based Lattice Boltzmann Framework Download Package

Level Sets and Voronoi based Feature Extraction from any Imagery Download

Level-of-Detail Triangle Strips for Deforming Meshes Download

Leveraging Binary Translation for Heterogeneous Profiling Download

Leveraging Computation Sharing and Parallel Processing in Location-Based Services

Leveraging Data-Flow Information for Efficient Scheduling of Task-Parallel Programs on Heterogeneous Systems Download

Leveraging LLVM OpenMP GPU Offload Optimizations for Kokkos Applications Download Package

Leveraging Memory Copy Overlap for Efficient Sparse Matrix-Vector Multiplication on GPUs Download

Leveraging on High-Performance Computing and Cloud Technologies in Digital Libraries: A Case Study Download

Leveraging Parallelism with CUDA and OpenCL Download

Leveraging the potential of task-based programming with OpenMP task graphs Download Package

Levy Flights for Particle Swarm Optimisation Algorithms on Graphical Processing Units Download

LeXInt: GPU-accelerated Exponential Integrators package Download Package

LHCb GPU acceleration project Download

libcloudph++ 0.1: single-moment bulk, double-moment bulk, and particle-based warm-rain microphysics library in C++ Download Package

libCudaOptimize: an Open Source Library of GPU-based Metaheuristics Download Package

libhclooc: Software Library Facilitating Out-of-core Implementations of Accelerator Kernels on Hybrid Computing Platforms Download Package

libmolgrid: GPU Accelerated Molecular Gridding for Deep Learning Applications Download Package

libWater: Heterogeneous Distributed Computing Made Easy Download

LIFT: LLM-Based Pragma Insertion for HLS via GNN Supervised Fine-Tuning Download

Light Loss-Less Data Compression, with GPU Implementation Download

Light propagation for mixed polygonal and volumetric data Download

Light Propagation Maps on Parallel Graphics Architectures Download

Lighting Details Preserving Photon Density Estimation Download

LightNet: A Versatile, Standalone Matlab-based Environment for Deep Learning Download Package

Lightning: Scaling the GPU Programming Model Beyond a Single GPU Download Package

LightPlay: Efficient Replay with GPUs Download

LightRNN: Memory and Computation-Efficient Recurrent Neural Networks Download

LightScan: Faster Scan Primitive on CUDA Compatible Manycore Processors Download Package

Lightweight bleeding and smoke effect for surgical simulators

Lightweight Modular Staging and Embedded Compilers: Abstraction Without Regret for High-Level High-Performance Programming Download

Lightweight modular staging: a pragmatic approach to runtime code generation and compiled DSLs Download

Lina: a fast design optimisation tool for software-based FPGA programming Download Package

linalg: Matrix Computations in Apache Spark Download Package

Line-art Illustration of Dynamic and Specular Surfaces Download

Linear Algebra Algorithms for Hybrid Architectures with XKaapi Download

Linear algebra operators for GPU implementation of numerical algorithms Download

Linear Feature Detection on GPUs

Linear genetic programming GPGPU on Microsoft’s Xbox 360 Download

Linear optimization on modern GPUs Download

Linear Performance-Breakdown Model: A Framework for GPU kernel programs performance analysis Download

Linear Solvers for Stable Fluids: GPU vs CPU Download

Linearised inversion with GPUs Download Package

Linpack evaluation on a supercomputer with heterogeneous accelerators Download

linus: Conveniently explore, share, and present large-scale biological trajectory data from a web browser Download Package

liquidSVM: A Fast and Versatile SVM package Download Package

List Mode PET reconstruction Download

Liszt: A Domain Specific Language for Building Portable Mesh-based PDE Solvers Download Package

Literature Review and Implementation Overview: High Performance Computing with Graphics Processing Units for Classroom and Research Use Download

Literature review: Build and Travel KD-Tree with CUDA Download

Literature Review: Parallel Computing on linear equations of linear elastic FEM stimulation with CUDA Download

LithOS: An Operating System for Efficient Machine Learning on GPUs Download

Live Migration for OpenCL FPGA Accelerators Download

Live Migration of FPGA Applications Download

Live, Video-Rate Super-Resolution Microscopy Using Structured Illumination and Rapid GPU-Based Parallel Processing

Living Flows: Enhanced Exploration of Edge-Bundled Graphs Based on GPU-Intensive Edge Rendering Download

LLload: An Easy-to-Use HPC Utilization Tool Download

LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators Download Package

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale Download Package

LLMPerf: GPU Performance Modeling meets Large Language Models Download Package

LLOR: Automated Repair of OpenMP Programs Download Package

LLVM to PTX Backend Download

LLVM-based automation of memory decoupling for OpenCL applications on FPGAs Download Package

LN-Annote: An Alternative Approach to Information Extraction from Emails using Locally-Customized Named-Entity Recognition Download

LNA: Fast Protein Classification Using A Laplacian Characterization of Tertiary Structure Download

LO-SpMM: Low-cost Search for High-performance SpMM Kernels on GPUs Download

Load Balanced Parallel GPU Out-of-Core for Continuous LOD Model Visualization Download

Load Balancing for Constraint Solving with GPUs Download

Load Balancing in a Changing World: Dealing with Heterogeneity and Performance Variability Download

Load Balancing in Data Warehouse – Evolution and Perspectives Download

Load Balancing Utilizing Data Redundancy in Distributed Volume Rendering Download

Load Balancing versus Occupancy Maximization on Graphics Processing Units: The Generalized Hough Transform as a Case Study

Load-Balanced Multi-GPU Ambient Occlusion for Direct Volume Rendering Download

Local Alignment Tool Based on Hadoop Framework and GPU Architecture Download

Local Histogram Modification Based Contrast Enhancement with GPU Acceleration Download

Local Laplacian Filters: Edge-aware Image Processing with a Laplacian Pyramid Download Package

Local Search Algorithms on Graphics Processing Units. A Case Study: The Permutation Perceptron Problem Download

Local Volatility FX Basket Option on CPU and GPU Download

Local vs. Global Optimization: Operator Placement Strategies in Heterogeneous Environments Download

Locality Analysis for Characterizing Applications Based on Sparse Matrices Download

Locality and parallelism optimization for dynamic programming algorithm in bioinformatics Download

Locality Aware Work-Stealing Based Scheduling in Hybrid CPU-GPU Download

Locality optimization on a NUMA architecture for hybrid LU factorization Download

Locality-Aware Automatic Parallelization for GPGPU with OpenHMPP Directives Download

Locality-Aware Mapping of Nested Parallel Patterns on GPUs Download

Locality-aware parallel block-sparse matrix-matrix multiplication using the Chunks and Tasks programming model Download

Locality-Aware Work Stealing on Multi-CPU and Multi-GPU Architectures Download Package

 

Brief statistics for this page

Titles: 100

Download open PDFs: 95

Package packages: 32

Recent source codes

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org