1173

Papers on hgpu.org (.txt-file)

The 3D Flow Field Around an Embedded Planet Download

The Accelerated Universe

The accelerating implementation of BLAST with stream processor

The Accelerator Wall: Limits of Chip Specialization Download

The AES Implantation Based on OpenCL for Multi/many Core Architecture

The AGILE library for image reconstruction in biomedical sciences using graphics card hardware acceleration Download Package

The AI CUDA Engineer: Agentic CUDA Kernel Discovery, Optimization and Composition Download Package

The AlexNet Moment for Homomorphic Encryption: HCNN, the First Homomorphic CNN on Encrypted Data with GPUs Download

The Anatomy of High-Performance 2D Similarity Calculations Download Package

The ANTAREX Approach to Autotuning and Adaptivity for Energy Efficient HPC Systems Download

The ANTAREX Domain Specific Language for High Performance Computing Download Package

The Application of AI Technology in GPU Scheduling Algorithm Optimization Download

The Application of CUDA Architecture in Facial Expression Recognition

The application of GPU particle tracing to diffusion tensor field visualization Download

The Application Perspective: Seeking Productivity and Performance Download

The Arcane development framework

The Architecture and Evolution of CPU-GPU Systems for General Purpose Computing Download

The architecture of the DecentVM: towards a decentralized virtual machine for many-core computing Download

The Art of Balance: A RateupDB Experience of Building a CPU/GPU Hybrid Database Product Download

The Astrophysical Multipurpose Software Environment Download Package

The battle of the giants: a case study of GPU vs FPGA optimisation for real-time image processing Download

The BiConjugate gradient method on GPUs Download

The Boat Hull Model: Adapting the Roofline Model to Enable Performance Prediction for Parallel Computing Download

The BondMachine toolkit: Enabling Machine Learning on FPGA Download

The Bones Source-to-Source Compiler Manual Download Package

The Case for Higher Computational Density in the Memory-Bound FDTD Method within Multicore Environments Download

The case for VOS: the vector operating system Download

The Celerity High-level API: C++20 for Accelerator Clusters Download Package

The Chamomile Scheme: An Optimized Algorithm for N-body simulations on Programmable Graphics Processing Units Download

The Comparisons of OpenCL and OpenMP Computing Paradigm Download

The Complete Rank Transform: A Tool for Accurate and Morphologically Invariant Matching of Structures Download

The computer graphics wars heat up Download

The conjugate gradient solver accelerated by GPU for solving wave-propagation problems Download

The CoRa Tensor Compiler: Compilation for Ragged Tensors with Minimal Padding Download

The CUBLAS and CULA based GPU acceleration of adaptive finite element framework for bioluminescence tomography Download

The CUDA Handbook: A Comprehensive Guide to GPU Programming Download Package

The CUDA implementation of the method of lines for the curvature dependent flows Download

The CUDA LATCH Binary Descriptor: Because Sometimes Faster Means Better Download Package

The DabR – A multitouch system for intuitive 3D scene navigation Download

The Deep Learning Compiler: A Comprehensive Survey Download

The density matrix renormalization group algorithm on kilo-processor architectures: implementation and trade-offs Download

The Design and Implementation of a GPU-enabled Multi-objective Tabu-search Intended for Real World and High-dimensional Applications Download

The Design and Implementation of a Verification Technique for GPU Kernels Download Package

The design and verification of Mumax3 Download Package

The development and expansion of HOOMD-blue through six years of GPU proliferation Download Package

The development of Mellanox/NVIDIA GPUDirect over InfiniBand-a new model for GPU to GPU communications

The discrete dipole approximation code DDscat.C++: features, limitations and plans Download Package

The distributed diagonal force decomposition method for parallelizing molecular dynamics simulations

The Distribution of OpenCL Kernel Execution Across Multiple Devices Download

The Dual-Path Execution Model for Efficient GPU Control Flow Download

The Dynamical Kernel Scheduler – Part 1 Download

The Ecological Footprint of Neural Machine Translation Systems Download Package

The effects of nutrient chemotaxis on bacterial aggregation patterns with non-linear degenerate cross diffusion Download

The Fast and Wideband MoM Based on GPU and Two-Path AFS Acceleration Download

The fast evaluation of hidden Markov models on GPU

The fast multipole method on parallel clusters, multicore processors, and graphics processing units Download

The Fast Multipole Method on the Cell processor Download

The Fat-Link Computation On Large GPU Clusters for Lattice QCD Download

The Feasibility of Using OpenCL Instead of OpenMP for Parallel CPU Programming Download

The FFT on a GPU Download

The Flocking Based and GPU Accelerated Internet Traffic Classification Download

The Framework and Compilation Techniques for Directive-based GPU Cluster Programming Download

The Future in Mobile Multicore Computing Download

The Future of Accelerator Programming: Abstraction, Performance or Can We Have Both? Download

The future of microprocessors Download

The GASPI API specification and its implementation GPI 2.0 Download Package

The Geant4 Visualisation System – a multi-driver graphics system Download

The GeForce 6 series GPU architecture Download

The GeForce 6800 Download

The Genetic Convolutional Neural Network Model Based on Random Sample Download

The GENGA Code: Gravitational Encounters in N-body simulations with GPU Acceleration Download Package

The GPU as a high performance computational resource Download

The GPU as numerical simulation engine Download

The GPU Computing Era

The GPU Computing Revolution: From Multi-Core CPUs To Many-Core Graphics Processors Download

The GPU Enhanced Parallel Computing for Large Scale Data Clustering Download

The GPU enters computing’s mainstream

The GPU on biomedical image processing for color and phenotype analysis Download

The GPU on irregular computing: performance issues and contributions Download

The GPU on the simulation of cellular computing models

The GPU vs Phi Debate: Risk Analytics Using Many-Core Computing Download

The GPU-based High-performance Pattern-matching Algorithm for Intrusion Detection Download

The GPU-based Parallel Ant Colony System Download

The GPU-based String Matching System in Advanced AC Algorithm

The gputools package enables GPU computing in R Download Package

The GPUVerify Method: a Tutorial Overview Download Package

The Graphics Card as a Streaming Computer Download

The Graphics Processor as a Mathematical Coprocessor in MATLAB

The Heisenberg spin glass model on GPU: myths and actual facts Download

The Hierarchical Memory Machine Model for GPUs Download

The Hitchhiker’s Guide to Cross-Platform OpenCL Application Development Download

The impact of accelerator processors for high-throughput molecular modeling and simulation Download

The impact of diverse memory architectures on multicore consumer software: an industrial perspective from the video games domain Download

The Impact of GPU DVFS on the Energy and Performance of Deep Learning: an Empirical Study Download

The impact of GPU/Multicore in Signal Processing: a quantitative approach Download

The Impact of Modern Consumer GPUs on Commonly Used Secure Password Standards Download

The Implement of Common Beam Forming Using GPU Download

The implementation and optimization of Bitonic sort algorithm based on CUDA Download

The Implementation of a Real-Time Polyphase Filter Download

The implementation of Multi-Scale Retinex image enhancement algorithm based on GPU via CUDA Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 87

Package packages: 18

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org