Views of posts on hgpu.org
High-Performance Monte Carlo Radiosity on GPU based on Scene Partitioning 1,218 views
Simulation of bevel gear cutting with GPGPUs-performance and productivity 1,217 views
On the Usage of GPUs for Efficient Motion Estimation in Medical Image Sequences 1,217 views
Code Generation for Embedded Heterogeneous Architectures on Android 1,217 views
Efficient MPI-based Communication for GPU-Accelerated Dask Applications 1,217 views
Evaluating polynomials in several variables and their derivatives on a GPU computing processor 1,217 views
A framework for GPU-based application-independent 3D interactions 1,216 views
Accelerated Root Finding for Computational Finance 1,216 views
Experimental Evaluation of Multiprecision Strategies for GMRES on GPUs 1,216 views
Plant Leaf Modeling and Rendering Based-On GPU 1,216 views
Architecting graphics processors for non-graphics compute acceleration 1,215 views
PyTorchPipe: a framework for rapid prototyping of pipelines combining language and vision 1,215 views
SU(2) Lattice Gauge Theory Simulations on Fermi GPUs 1,214 views
HCW 2009 keynote talk: GPU computing: Heterogeneous computing for future systems 1,214 views
OpenCL Performance on the Intel Heterogeneous Architecture Research Platform 1,214 views
Real-time 3D reconstruction for mobile robot using catadioptric cameras 1,214 views
Implementing a GPU Programming Model on a non-GPU Accelerator Architecture 1,213 views
A Scalable and Reconfigurable Shared-Memory Graphics Cluster Architecture 1,213 views
Task-Based Parallel Strategies for CFD Application in Heterogeneous CPU/GPU Resources 1,212 views
Fast-Coding Robust Motion Estimation Model in a GPU 1,212 views
Advanced Programming Platform for efficient use of Data Parallel Hardware 1,212 views
Array-Oriented Languages and Polyhedral Compilation 1,212 views
Improving accuracy for matrix multiplications on GPUs 1,211 views
Development of nonlinear filter bank system for real-time beautification of facial video using GPGPU 1,211 views
CoCoNet: Co-Optimizing Computation and Communication for Distributed Machine Learning 1,211 views
A Survey on GPU System Considering its Performance on Different Applications 1,211 views
A parallelization cost model for GPU 1,211 views
dMath: Distributed Linear Algebra for DL 1,210 views
Evaluating multi-core platforms for HPC data-intensive kernels 1,210 views
Static Analysis and Dynamic Adaptation of Parallelism 1,210 views
Connected-component identification and cluster update on graphics processing units 1,210 views
Top-k Queries Processing With Uncertain Data on Graphics Processing Units 1,208 views
Implementation of algorithms with a fine-grained parallelism on GPUs 1,208 views
GP-GPU: Bridging the Gap between Modelling & Experimentation 1,208 views
HAM – Heterogenous Active Messages for Efficient Offloading on the Intel Xeon Phi 1,208 views
Real-time Adaptive Tone Mapping for Monitoring High Contrast Hemispherical Image Capture with the GPU 1,208 views
Implementation of a programming environment with a multithread model for reconfigurable systems 1,208 views
Unsupervised Markovian Segmentation on Graphics Hardware 1,207 views
De-specializing an HLS library for Deep Neural Networks: improvements upon hls4ml 1,207 views
Automatic Code Generation and Adaptive Grid Scheduling for GPU Cluster Computing 1,207 views
GPU Acceleration of Near-Minimal Logic Minimization 1,207 views
On the Use of an Algebraic Language Interface for Waveform Definition 1,205 views
Scaleable Sparse Matrix-Vector Multiplication with Functional Memory and GPUs 1,205 views
Taming irregular EDA applications on GPUs 1,205 views
Interactive Point-based Isosurface Exploration and High-quality Rendering 1,204 views
Real Time Capture of Audio Images and their Use with Video 1,203 views
Visualizing and Analyzing the Mona Lisa 1,203 views
Visualizing complex dynamics in many-core accelerator architectures 1,203 views
Real-Time Tracking with Non-Rigid Geometric Templates Using the GPU 1,203 views
Curling and clumping fur represented by texture layers 1,203 views
Implementation of Autoencoders with Systolic Arrays through OpenCL 1,203 views
Sparse regularization in MRI iterative reconstruction using GPUs 1,202 views
Understanding the design trade-offs among current multicore systems for numerical computations 1,202 views
In-memory grid files on graphics processors 1,202 views
Learning Massive Graph Embeddings on a Single Machine 1,201 views
Directive-Based Data Partitioning and Pipelining and Auto-Tuning for High-Performance GPU Computing 1,201 views
Quantifying the Impact of GPUs on Performance and Energy Efficiency in HPC Clusters 1,201 views
Ray-Casted BlockMaps for Large Urban Models Visualization 1,199 views
Accelerate Scientific Deep Learning Models on Heterogeneous Computing Platform with FPGA 1,199 views
Iterative SLE Solvers over a CPU-GPU Platform 1,199 views
Fast convolutional neural networks on FPGAs with hls4ml 1,198 views
Automatic safety proofs for asynchronous memory operations 1,198 views
PoCL-R: A Scalable Low Latency Distributed OpenCL Runtime 1,198 views
GPULib: GPU Computing in High-Level Languages 1,197 views
GPU-accelerated synthesis of echo generators 1,197 views
Fast view synthesis using GPU for 3D display 1,197 views
Performance of CPU and GPU HPC Architectures for off-design aircraft simulation 1,196 views
Experiences Developing the OpenUH Compiler and Runtime Infrastructure 1,195 views
Improving the Performance of a Ray Tracing Algorithm Using a GPU 1,195 views
Environment Lighting for Point Sampled Geometry 1,195 views
Runtime Support for Adaptive Power Capping on Heterogeneous SoCs 1,194 views
GPU-based matrix-free finite element solver exploiting symmetry of elemental matrices 1,194 views
A Parallel Algorithm Development Model for the GPU Architecture 1,193 views
Non-Parametric Adaptive Network Pruning 1,193 views
Data-parallel algorithms and data structures 1,193 views
Efficiently Using a CUDA-enabled GPU as Shared Resource 1,193 views
Method for simulation of coastal terrain on GPU 1,193 views
A toolkit to describe and interactively display three-manifolds embedded in four-space 1,193 views
Power-Efficient Work Distribution Method for CPU-GPU Heterogeneous System 1,191 views
Memory-Scalable GPU Spatial Hierarchy Construction 1,190 views
Adjustable GPU Acceleration for Hermitian Eigensystems 1,190 views
Real-time Minute Change Detection on GPU for Cellular and Remote Sensor Imaging 1,190 views
Design and implementation of the Smith-Waterman algorithm on the CUDA-compatible GPU 1,190 views
GPU Nonlinear Fixed Points, with an application to GPU IFS Rendering 1,189 views
Flexible Pixel Compositor for Plug-and-Play Multi-Projector Displays 1,189 views
Optimization of mapped functions sequences using fusions on GPU 1,189 views
Acceleration of spiking neural networks in emerging multi-core and GPU architectures 1,189 views
Synergistic CPU-FPGA Acceleration of Sparse Linear Algebra 1,188 views
Empowering Visual Categorization With the GPU 1,188 views
Heterogeneous (CPU+GPU) Working-set Hash Tables 1,188 views
Parallel time integration using Batched BLAS (Basic Linear Algebra Subprograms) routines 1,188 views
Heterogeneous Active Messages (HAM) – Implementing Lightweight Remote Procedure Calls in C++ 1,187 views
Titles: 100
Total views: 120355
- Programming - 186,129 views
- Login - 164,382 views
- User dashboard - 90,624 views
- Paper titles list - 70,014 views
- Add new event - 64,589 views
- Add new post - 59,353 views
- Register - 49,179 views
- Statistics - 36,495 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,167 views
- Books on OpenCL and CUDA - 28,819 views