Views of posts on hgpu.org
GPIC – GPU Power Iteration Cluster 1,338 views
Accelerating Deep Neural Networks implementation: A survey 1,338 views
A small-world network model for distributed storage of semantic metadata 1,338 views
Many-threaded implementation of differential evolution for the CUDA platform 1,338 views
A Performance Study for Iterative Stencil Loops on GPUs with Ghost Zone Optimizations 1,338 views
An Optimized Multiple Right-Hand Side Dslash Kernel for Intel Xeon Phi 1,338 views
Automatic Termination Analysis for GPU Kernels 1,338 views
Anytime Algorithms for GPU Architectures 1,338 views
A survey of BRDF models for computer graphics 1,337 views
Graphics processing unit implementation of lattice Boltzmann models for flowing soft systems 1,337 views
Experiments with Single Core, Multi-core, and GPU Based Computation of Cellular Automata 1,337 views
GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition 1,337 views
Studies on CUDA Offloading for Real-Time Simulation and Visualization 1,337 views
Analysis of periodic structures with GPU accelerating 1,337 views
Performance Analysis of the OP2 Framework on Many-core Architectures 1,337 views
Improving the Performance of CA-GMRES on Multicores with Multiple GPUs 1,337 views
Block based Singular Value Decomposition approach to matrix factorization for recommender systems 1,337 views
Optimizing GPU to GPU Communication on Cray XK7 1,337 views
Accelerating linear system solutions using randomization techniques 1,337 views
Quantifying NUMA and contention effects in multi-GPU systems 1,336 views
Performance in GPU Architectures: Potentials and Distances 1,336 views
GPU-based real-time acoustical occlusion modeling 1,336 views
Increasing predictability of GPU’s 1,336 views
Online Adaptive Code Generation and Tuning 1,336 views
An Efficient Stream Buffer Mechanism for Dataflow Execution on Heterogeneous Platforms with GPUs 1,336 views
Acceleration of a Locally Tuned Sine Non Linear Video Enhancement Algorithm on GPGPU 1,336 views
Hardware Accelerators for Cartesian Genetic Programming 1,335 views
A GPU-tailored approach for training kernelized SVMs 1,335 views
Auto-Generation of Parallel Finite-Differencing Code for MPI, TBB and CUDA 1,335 views
Decoupled Access/Execute Metaprogramming for GPU-Accelerated Systems 1,335 views
SLATE port to AMD and Intel platforms 1,335 views
Co-processor acceleration of an unmodified parallel solid mechanics code with FEASTGPU 1,335 views
Real-time Medical Image Volume Rendering Based on GPU Accelerated Method 1,335 views
Efficient Intranode Communication in GPU-Accelerated Systems 1,335 views
Automatically Tuned Dense Linear Algebra for Multicore+GPU 1,335 views
Toward efficient GPU-accelerated N-body simulations 1,335 views
Deforming a High-Resolution Mesh in Real-Time by Mapping onto a Low-Resolution Physical Model 1,334 views
Line-art Illustration of Dynamic and Specular Surfaces 1,334 views
Feature Generation for Quantification of Visual Similarity 1,334 views
GPGPU supported cooperative acceleration in molecular dynamics 1,334 views
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System 1,334 views
Hera-JVM: a runtime system for heterogeneous multi-core architectures 1,334 views
Towards systematic exploration of tradeoffs for medical image registration on heterogeneous platforms 1,334 views
GPU accelerated FDTD solver and its application in MRI 1,334 views
Task and Data Distribution in Hybrid Parallel Systems 1,334 views
Two-Level Approach to Efficient Visualization of Protein Dynamics 1,333 views
CANSCID-CUDA 1,333 views
Load Balancing Utilizing Data Redundancy in Distributed Volume Rendering 1,333 views
Dependable Embedded Systems 1,333 views
Program Optimization of Array-Intensive SPEC2k Benchmarks on Multithreaded GPU Using CUDA and Brook+ 1,333 views
Development of Krylov and AMG linear solvers for large-scale sparse matrices on GPUs 1,333 views
Petascale visualization: Approaches and initial results 1,333 views
GPUQT: An efficient linear-scaling quantum transport code fully implemented on graphics processing units 1,333 views
Directives Based Programming of GPU Accelerated Systems 1,333 views
GPU Rigid Skinning based on a Refined Skeletonization Method 1,333 views
Realtime phase-based optical flow on the GPU 1,333 views
High throughput multiple-precision GCD on the CUDA architecture 1,333 views
An Explicit Algorithm for Porous Media Flow Simulation using GPUs 1,332 views
High Performance Data Mining Using R on Heterogeneous Platforms 1,332 views
Real time ultrasound image denoising 1,332 views
Accelerating Stochastic Simulations on GPUs Using OpenCL 1,332 views
Achieving High Throughput Sequencing with Graphics Processing Units 1,332 views
Accelerating data mining workloads: current approaches and future challenges in system architecture design 1,332 views
Linear Algebra Algorithms for Hybrid Architectures with XKaapi 1,332 views
A framework for lab-based real-time video analysis on distributed camera networks 1,332 views
Parallel Streaming Intra Prediction for Full HD H.264 Encoding 1,332 views
An events based algorithm for distributing concurrent tasks on multi-core architectures 1,332 views
Building a Real-Time Multi-GPU Platform: Robust Real-Time Interrupt Handling Despite Closed-Source Drivers 1,332 views
GEVO: GPU Code Optimization using Evolutionary Computation 1,332 views
Stadium Hashing: Scalable and Flexible Hashing on GPUs 1,332 views
GPU-based multi-view rendering for spatial-multiplex autostereoscopic displays 1,332 views
Multi-core programming with OpenCL: performance and portability: OpenCL in a memory bound scenario 1,332 views
Accelerating Beam Dynamics Simulations with GPUs 1,331 views
Sub-seasonal forecasting with a large ensemble of deep-learning weather prediction models 1,331 views
Parallel Cycle Based Logic Simulation Using Graphics Processing Units 1,331 views
Software-Defined FPGA Accelerator Design for Mobile Deep Learning Applications 1,331 views
Fine-grained parallelization of a Vlasov-Poisson application on GPU 1,331 views
A Self-Optimizing Framework for Developing Metrology Software on Massive Parallel Processor Architectures 1,331 views
Acceleration of a Full-scale Industrial CFD Application with OP2 1,331 views
Using of New Possibilities of Fermi Architecture by Development of GPGPU Programs 1,331 views
Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms 1,331 views
Balancing locality and concurrency: solving sparse triangular systems on GPUs 1,331 views
Anisotropic noise 1,331 views
GPU-based implementation of a cerebellar spiking network model for realtime robot control 1,331 views
Automatic Scheduling of Compute Kernels Across Heterogeneous Architectures 1,331 views
Optimized GPU Framework for Ultrasound Color Flow Imaging 1,330 views
Leveraging Binary Translation for Heterogeneous Profiling 1,330 views
Designing Fast Architecture Sensitive Tree Search on Modern Multi-Core/Many-Core Processors 1,330 views
GPU-Enabled AI 1,330 views
Rendering of 3D Dynamic Virtual Environments 1,330 views
FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow 1,330 views
Importance of Data Loading Pipeline in Training Deep Neural Networks 1,329 views
Performance analysis and optimization of a CFD application 1,329 views
Efficient nearest-neighbor computation for GPU-based motion planning 1,329 views
Titles: 100
Total views: 133366
- Programming - 186,129 views
- Login - 164,359 views
- User dashboard - 90,593 views
- Paper titles list - 70,004 views
- Add new event - 64,577 views
- Add new post - 59,319 views
- Register - 49,175 views
- Statistics - 36,467 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,165 views
- Books on OpenCL and CUDA - 28,809 views