2402

Views of posts on hgpu.org

GPU-based point radiation for interactive volume sculpting and segmentation  1,684 views

GPU-accelerated power pattern synthesis of aperiodic linear arrays  1,684 views

Using Intel oneAPI for Multi-hybrid Acceleration Programming with GPU and FPGA Coupling  1,684 views

GPU-accelleration of image rendering and sorting algorithms with the OpenCL framework  1,684 views

Multiresolution MIP Rendering of Large Volumetric Data Accelerated on Graphics Hardware  1,684 views

Accelerating data mining workloads: current approaches and future challenges in system architecture design  1,684 views

Acceleration of Medical Image Registration using Graphics Process Units in Computing Normalized Mutual Information  1,684 views

Towards a Software Transactional Memory for Graphics Processors  1,684 views

Image Space Gathering  1,684 views

Analysis of the Performance of the Fish School Search Algorithm Running in Graphic Processing Units  1,684 views

A visibility-based approach for occupancy grid computation in disparity space  1,684 views

A parallel evolutionary algorithm to optimize dynamic memory managers in embedded systems  1,684 views

Impact of the channel count on the nonlinear tolerance in coherently-detected POLMUX-QPSK modulation  1,684 views

Executing Dynamic Data Rate Actor Networks on OpenCL Platforms  1,684 views

Speeding Up Cycle Based Logic Simulation Using Graphics Processing Units  1,683 views

Petascale visualization: Approaches and initial results  1,683 views

GiST Scan Acceleration using Coprocessors  1,683 views

Automatic shader level of detail  1,683 views

Customizable Domain-Specific Computing  1,683 views

Programming hybrid systems with implicit memory based synchronization  1,683 views

HALO 1.0: A Hardware-agnostic Accelerator Orchestration Framework for Enabling Hardware-agnostic Programming with True Performance Portability for Heterogeneous HPC  1,683 views

In vivo interactive visualization of four-dimensional blood flow patterns  1,683 views

GPGPU-based Gaussian Filtering for Surface Metrological Data Processing  1,683 views

Massively Parallel GPU Computing of Continuum Robotic Dynamics  1,682 views

Paragon: Collaborative Speculative Loop Execution on GPU and CPU  1,682 views

An algorithm for efficient computation of spatial impulse response on the GPU with application in ultrasound simulation  1,682 views

Towards Large-Scale Molecular Dynamics Simulations on Graphics Processors  1,682 views

Increasing predictability of GPU’s  1,682 views

Compute Unified Device Architecture Application Suitability  1,682 views

An Efficient Stream Buffer Mechanism for Dataflow Execution on Heterogeneous Platforms with GPUs  1,682 views

Chunkflow: Distributed Hybrid Cloud Processing of Large 3D Images by Convolutional Nets  1,682 views

CoCoNet: Co-Optimizing Computation and Communication for Distributed Machine Learning  1,682 views

Accelerating urban fast response Lagrangian dispersion simulations using inexpensive graphics processor parallelism  1,682 views

Unified Parallel C for GPU Clusters: Language Extensions and Compiler Implementation  1,682 views

Genetic programming on GPUs for image processing  1,682 views

Hierarchical DAG Scheduling for Hybrid Distributed Systems  1,682 views

Large neighborhood local search optimization on graphics processing units  1,681 views

Fast continuous collision detection among deformable models using graphics processors  1,681 views

Parallel Cycle Based Logic Simulation Using Graphics Processing Units  1,681 views

Efficient nearest-neighbor computation for GPU-based motion planning  1,681 views

High performance dense linear system solver with soft error resilience  1,681 views

Data Layout Transformation for Structured-Grid Codes on GPU  1,681 views

Deforming a High-Resolution Mesh in Real-Time by Mapping onto a Low-Resolution Physical Model  1,680 views

Optimized GPU Framework for Ultrasound Color Flow Imaging  1,680 views

Using Reconfigurable Logic to Optimise GPU Memory Accesses  1,680 views

Dataflow-based Design and Implementation of Image Processing Applications  1,680 views

Many-threaded implementation of differential evolution for the CUDA platform  1,680 views

The GPU-based String Matching System in Advanced AC Algorithm  1,680 views

Experimentation Procedure for Offloaded Mini-Apps Executed on Cluster Architectures with Xeon Phi Accelerators  1,680 views

Object oriented framework for real-time image processing on GPU  1,679 views

MPI Derived Datatypes Processing on Noncontiguous GPU-resident Data  1,679 views

Using Hybrid CPU-GPU Platforms to Accelerate the Computation of the Matrix Sign Function  1,679 views

Verification of GPU Program Optimizations in Lean  1,679 views

Full Speed Ahead: 3D Spatial Database Acceleration with GPUs  1,679 views

Task and Data Distribution in Hybrid Parallel Systems  1,679 views

A GPU-based survey for millisecond radio transients using ARTEMIS  1,679 views

FC_ACCEL: Enabling Efficient, Low-Latency and Flexible Inference in DNN Fully Connected Layers, using Optimized Checkerboard Block matrix decomposition, fast scheduling, and a resource efficient 1D PE array with a custom HBM2 memory subsystem  1,678 views

Towards chip-on-chip neuroscience: fast mining of neuronal spike streams using graphics hardware  1,678 views

Realistic real-time rendering for large-scale forest scenes  1,678 views

FEAST – Realisation of hardware-oriented Numerics for HPC simulations with Finite Elements  1,678 views

Performance Aware Convolutional Neural Network Channel Pruning for Embedded GPUs  1,678 views

A Fast, GPU based, Dictionary Attack to OpenPGP Secret Keyrings  1,678 views

LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators  1,678 views

Real-time Medical Image Volume Rendering Based on GPU Accelerated Method  1,678 views

GPU based extraction of moving objects without shadows under intensity changes  1,678 views

Performance Analysis of a New Real-Time Elastographic Time Constant Estimator  1,677 views

Energy efficient biomolecular simulations with FPGA-based reconfigurable computing  1,677 views

Parallel Computation for Discrete Orthogonal Moments of Images Using Graphic Processing Unit  1,677 views

Using Graphics Processors to Accelerate Synthetic Aperture Sonar Imaging via Backpropagation  1,677 views

Parallelized agent-based simulation on CPU and graphics hardware for spatial and stochastic models in biology  1,677 views

Novel Methodologies for Predictable CPU-To-GPU Command Offloading  1,677 views

Fast and Efficient FPGA-Based Feature Detection Employing the SURF Algorithm  1,677 views

Whole-function vectorization  1,677 views

High dimensional pricing of exotic European contracts on a GPU Cluster, and comparison to a CPU cluster  1,676 views

Record Setting Software Implementation of DES Using CUDA  1,676 views

A Scalable GPU-based Approach to Accelerate the Multiple-Choice Knapsack Problem  1,676 views

Optimizing GPU to GPU Communication on Cray XK7  1,676 views

Towards real-time radiation therapy: GPU accelerated superposition/convolution  1,676 views

Quantifying NUMA and contention effects in multi-GPU systems  1,676 views

High-dimensional Planning on the GPU  1,676 views

Vortex methods for incompressible flow simulations on the GPU  1,676 views

Active Structured Learning for High-Speed Object Detection  1,675 views

Evolving a CUDA kernel from an nVidia template  1,675 views

Acceleration of Scientific Deep Learning Models on Heterogeneous Computing Platform with Intel FPGAs  1,675 views

Probing biomolecular machines with graphics processors  1,675 views

GPU friendly fast Poisson solver for structured power grid network analysis  1,675 views

Program Optimization Strategies for Data-Parallel Many-Core Processors  1,675 views

Real time ultrasound image denoising  1,674 views

Real-time hair simulation on GPU with a dynamic wisp model  1,674 views

ATI Stream Profiler: a tool to optimize an OpenCL kernel on ATI Radeon GPUs  1,674 views

Object-oriented stream programming using aspects  1,674 views

Feature Generation for Quantification of Visual Similarity  1,674 views

An Efficient Acceleration of Symmetric Key Cryptography Using General Purpose Graphics Processing Unit  1,674 views

Scalable Software Defined FM-radio receiver running on desktop computers  1,674 views

A Framework for Transparent Execution of Massively-Parallel Applications on CUDA and OpenCL  1,673 views

Overhauling SC atomics in C11 and OpenCL  1,673 views

Designing Fast Architecture Sensitive Tree Search on Modern Multi-Core/Many-Core Processors  1,673 views

Cloudlet-screen computing: A multi-core-based, cloud-computing-oriented, traditional-computing-compatible parallel computing Paradigm for the masses  1,673 views

On Expressing Different Concurrency Paradigms on Virtual Execution Systems (thesis)  1,673 views

Using GPU to Accelerate Cache Simulation  1,673 views

 

Brief statistics for this page

Titles: 100

Total views: 167920

 

Most viewed items:

Recent source codes

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org