Views of posts on hgpu.org
OpenCL-Darknet: implementation and optimization of OpenCL-based deep learning object detection framework 2,109 views
OpenCL-accelerated Point Feature Histogram and Its Application in Railway Track Point Cloud Data Processing 2,109 views
Fast Monte Carlo Simulation for Patient-specific CT/CBCT Imaging Dose Calculation 2,108 views
Development of a GPU-based Monte Carlo dose calculation code for coupled electron-photon transport 2,108 views
A Domain Specific Approach to Heterogeneous Computing: From Availability to Accessibility 2,108 views
Parallel Unsmoothed Aggregation Algebraic Multigrid Algorithms on GPUs 2,108 views
Evaluating Performance Portability of Accelerator Programming Models using SPEC ACCEL 1.2 Benchmarks 2,107 views
Massively parallel approximate Gaussian process regression 2,107 views
A fast Texture-by-numbers synthesis method based on texture optimization 2,107 views
Learning hash codes for efficient content reuse detection 2,107 views
DeepBach: a Steerable Model for Bach chorales generation 2,107 views
A GPU accelerated storage system 2,107 views
Sparse matrix solvers on the GPU: conjugate gradients and multigrid 2,107 views
Parallel Benefit on Different Programming Paradigms 2,107 views
Early Experiences With The OpenMP Accelerator Model 2,106 views
Computing finite models using free Boolean generators 2,106 views
A Survey Of Architectural Approaches for Data Compression in Cache and Main Memory Systems 2,106 views
ProGraML: Graph-based Deep Learning for Program Optimization and Analysis 2,106 views
Serial and Parallel Bayesian Spam Filtering using Aho-Corasick and PFAC 2,106 views
Lightweight modular staging: a pragmatic approach to runtime code generation and compiled DSLs 2,105 views
Real-time colouring and filtering with graphics shaders 2,105 views
CUDACLAW: a Data Parallel Solution Framework for Hyperbolic PDEs 2,105 views
Adding GPU Computing to Computer Organization Courses 2,105 views
On the Characterization of OpenCL Dwarfs on Fixed and Reconfigurable Platforms 2,105 views
GPU-based NSEC3 Hash Breaking 2,104 views
Declarative Parallel Programming for GPUs 2,104 views
Architectural Analysis and Performance Characterization of NVIDIA GPUs using Microbenchmarking 2,104 views
Fast quantum Monte Carlo on a GPU 2,104 views
Performance Tradeoff Spectrum of Integer and Floating Point Applications 2,104 views
On the Use of Remote GPUs and Low-Power Processors for the Acceleration of Scientific Applications 2,104 views
Graphics Processing Units and High-Dimensional Optimization 2,104 views
Design and Storage Optimization of GPU-based Parallel Program of Image Registration for Remote Sensing 2,104 views
A GPU-Based Accelerator for Chinese Word Segmentation 2,103 views
Neural Networks through Shared Maps in Mobile Devices 2,103 views
Algorithms for Rapid Characterization and Optimization of Aperture and Reflector Antennas 2,103 views
Efficient Parallel Video Processing Techniques on GPU: From Framework to Implementation 2,103 views
Enhancing and Porting the HPC-Lab Snow Simulator to OpenCL on Mobile Platforms 2,103 views
A CUDA Back-End for the Equelle Compiler 2,103 views
A Novel Implementation of QuickHull Algorithm on the GPU 2,102 views
Ultra-fast treatment plan optimization for volumetric modulated arc therapy (VMAT) 2,102 views
Finding, Measuring, and Reducing Inefficiencies in Contemporary Computer Systems 2,102 views
Programming Abstractions and Optimization Techniques for GPU-based Heterogeneous Systems 2,102 views
Practical CFD Simulations on Programmable Graphics Hardware using SMAC 2,102 views
Real-time adaptive fluid simulation with complex boundaries 2,102 views
Accelerating Kirchhoff Migration by CPU and GPU Cooperation 2,102 views
Efficient GPU-Based Texture Interpolation using Uniform B-Splines 2,102 views
Machine Learning at the Limit 2,101 views
Classiffication-based Financial Markets Prediction using Deep Neural Networks 2,101 views
Architecting an LTE Base Station with Graphics Processing Units 2,100 views
Astrophysical Supercomputing with GPUs: Critical Decisions for Early Adopters 2,100 views
A Code Optimization Framework for Performance Portability of GPU Kernels onto Custom Accelerators 2,100 views
FMM-based vortex method for simulation of isotropic turbulence on GPUs, compared with a spectral method 2,100 views
A compiler framework for optimization of affine loop nests for gpgpus 2,100 views
GPU-Based Local-Dimming for Power Efficient Imaging 2,100 views
Characterization of FPGA-based High Performance Computers 2,100 views
GPU-accelerated MRF segmentation algorithm for SAR images 2,099 views
Optimizing Deep CNN-Based Queries over Video Streams at Scale 2,099 views
Parallel Graph Mining with GPUs 2,099 views
Code Optimization on Kepler GPUs and Xeon Phi 2,099 views
D5.5.3 – Design and implementation of the SIMD-MIMD GPU architecture 2,099 views
Parallel Approach for Longest Common Subsequence problem on GPU 2,099 views
An adaptive Expectation-Maximization algorithm with GPU implementation for electron cryomicroscopy 2,098 views
Ultra-Fast Hybrid CPU-GPU Multiple Scatter Simulation for 3D PET 2,098 views
Hybrid MPI/GPU Interpolation for Grid DEM Construction 2,098 views
Finite Difference Time Domain (FDTD) Simulations Using Graphics Processors 2,098 views
MGPUSim: Enabling Multi-GPU Performance Modeling and Optimization 2,098 views
Signed distance transform using graphics hardware 2,097 views
GPU-Mapping: Robotic Map Building with Graphical Multiprocessors 2,097 views
Learning Representation for Scene Understanding: Epitomes, CRFs, and CNNs 2,097 views
Performance and Portability of Accelerated Lattice Boltzmann Applications with OpenACC 2,097 views
A Reduction of the Elastic Net to Support Vector Machines with an Application to GPU Computing 2,097 views
Improvements to Physically Based Cloth Simulation 2,096 views
Fast Neural Network Training on General Purpose Computers 2,096 views
Adding special-purpose processor support to the Erlang VM 2,096 views
Hermes: an integrated CPU/GPU microarchitecture for IP routing 2,096 views
Fast CT Image Processing using Parallelized Non-local Means 2,096 views
Gauge Field Generation on Large-Scale GPU-Enabled Systems 2,096 views
Volume-preserving FFD for programmable graphics hardware 2,095 views
Development of JavaScript-based deep learning platform and application to distributed training 2,095 views
Improving GPU programming models through hardware cache coherence 2,095 views
A GPU-Accelerated Framework for Image Processing and Computer Vision 2,095 views
Enabling OpenCL on a Configurable, VLIW Chip-Multiprocessor 2,095 views
Cluster-Level Tuning of a Shallow Water Equation Solver on the Intel MIC Architecture 2,095 views
A Comparative Study of Neighborhood Filters for Artifact Reduction in Iterative Low-Dose CT 2,095 views
Split tiling for GPUs: automatic parallelization using trapezoidal tiles 2,094 views
GPU-Accelerated Nearest Neighbor Search for 3D Registration 2,094 views
Grids, Clouds and Virtualization 2,094 views
Algorithms and Data Structures for Interactive Ray Tracing on Commodity Hardware 2,094 views
Benchmarking Parallel Performance on Many-Core Processors 2,094 views
Parallel Game Tree Search Using GPU 2,094 views
Real-Time Automatic Object Classification and Tracking using Genetic Programming and NVIDIA CUDA 2,094 views
Performance Evaluations of Document-Oriented Databases using GPU and Cache Structure 2,094 views
GPU-based Cloud Computing for Comparing the Structure of Protein Binding Sites 2,093 views
Titles: 100
Total views: 210095
- Programming - 186,133 views
- Login - 164,567 views
- User dashboard - 91,311 views
- Paper titles list - 71,335 views
- Add new event - 64,814 views
- Add new post - 59,614 views
- Register - 49,321 views
- Statistics - 37,173 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,190 views
- Books on OpenCL and CUDA - 28,900 views