Views of posts on hgpu.org
Performance Improvement of Data Mining in Weka through GPU Acceleration 3,526 views
GPU TV-L1 Optical Flow 3,525 views
Multi-view Rendering Approach for Cloud-based Gaming Services 3,510 views
Evaluating GPU Passthrough in Xen for High Performance Cloud Computing 3,510 views
Benchmarking the Memory Hierarchy of Modern GPUs 3,509 views
An optimized GPU implementation of a 2D free surface simulation model on unstructured meshes 3,508 views
Real-Time Spherical Panorama Image Stitching Using OpenCL 3,508 views
Improving the speed of neural networks on CPUs 3,508 views
Performance Analysis of CUDA and OpenCL By Implementation of Cryptographic Algorithms 3,501 views
A GPU accelerated algorithm for 3D Delaunay triangulation 3,496 views
Data Compression using CUDA programming in GPU 3,492 views
Benchmarking Harp-DAAL: High Performance Hadoop on KNL Clusters 3,477 views
Bohrium: Unmodified NumPy Code on CPU, GPU, and Cluster 3,475 views
Accelerating Computer Vision Algorithms Using OpenCL on Mobile GPU – A Case Study 3,472 views
OpenSSL acceleration using Graphics Processing Units 3,471 views
Fast and Efficient Lossless Image Compression Based on CUDA Parallel Wavelet Tree Encoding 3,459 views
Real-Time Deformation of Subdivision Surfaces from Object Collisions 3,455 views
Implementation of Diamond Search Algorithm Using Parallel Processing Architecture 3,453 views
3D Skeleton Extraction Method using Potential Field on OpenCL 3,450 views
Videogame Graphics, BigData & Analytics 3,449 views
GooFit: A library for massively parallelising maximum-likelihood fits 3,446 views
CUDArray: CUDA-based NumPy 3,446 views
FDTD on Distributed Heterogeneous Multi-GPU Systems 3,443 views
Parallel Computing the Longest Common Subsequence (LCS) on GPUs: Efficiency and Language Suitability 3,438 views
Duplicate Detection on GPUs 3,436 views
BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing 3,435 views
CUDA Application Design and Development 3,434 views
Markov Chain Monte Carlo on the GPU 3,428 views
Big Integer Multiplication with CUDA FFT (cuFFT) Library 3,427 views
GPU Computing: Image Convolution 3,426 views
A Case Study of OpenCL on an Android Mobile GPU 3,426 views
Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation 3,418 views
gpucc: an open-source GPGPU compiler 3,415 views
An OpenCL-based Implementation of H.264 Encoder 3,414 views
Best Practice Guide – Intel Xeon Phi 3,411 views
Astrophysical data mining with GPU. A case study: genetic classification of globular clusters 3,408 views
In-Datacenter Performance Analysis of a Tensor Processing Unit 3,401 views
Accelerating recurrent neural network training using sequence bucketing and multi-GPU data parallelization 3,400 views
Delaunay Triangulation in R3 on the GPU 3,396 views
Real Time Face Detection on GPU Using OpenCL 3,393 views
Convolutional Neural Network for Sentence Classification 3,391 views
Buffer k-d Trees: Processing Massive Nearest Neighbor Queries on GPUs 3,387 views
Accelerating Fast Fourier Transforms Using Hadoop and CUDA 3,380 views
Modular Arithmetic for Solving Linear Equations on the GPU 3,378 views
A Research of MapReduce with GPU Acceleration 3,377 views
Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters 3,373 views
Performance comparison of gauss-Jordan elimination method using OpenMP and CUDA 3,371 views
Numerical Simulation for the MHD System in 2D Using OpenCL 3,369 views
GPU-Based Implementation of JPEG2000 Encoder 3,368 views
Accelerating Fully Homomorphic Encryption Using GPU 3,368 views
Fast Morphological Image Processing on GPU using CUDA 3,366 views
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour 3,363 views
2D and 3D level-set algorithms on GPU 3,357 views
GPU-Accelerated Scalable Solver for Banded Linear Systems 3,353 views
Real-Time GPU Path Tracing 3,350 views
Ray Tracing in Real-Time Games 3,346 views
Image Super-Resolution Using Deep Convolutional Networks 3,343 views
Theano: A CPU and GPU Math Compiler in Python 3,337 views
10×10: A General-purpose Architectural Approach to Heterogeneity and Energy Efficiency 3,336 views
GPU Fluid Simulation using Smoothed Particle Hydrodynamics 3,324 views
Parallel Implementation of the Wu-Manber Algorithm Using the OpenCL Framework 3,320 views
Analysis and Review of Sorting Algorithms 3,315 views
GPU Acceleration for the C++ Standard Template Library 3,314 views
OmniDB: Towards Portable and Efficient Query Processing on Parallel CPU/GPU Architectures 3,306 views
A comparison between parallelization approaches in molecular dynamics simulations on GPUs 3,306 views
Programming massively parallel processors : A Hands – on approach 3,303 views
Towards Predictable Real-Time Performance on Multi-Core Platforms 3,301 views
General Purpose Computation on Graphics Processing Units Using OpenCL 3,300 views
3D finite difference computation on GPUs using CUDA 3,293 views
Grex: An efficient MapReduce framework for graphics processing units 3,292 views
GPU vs FPGA: A Comparative Analysis for Non-standard Precision 3,289 views
Towards On-Chip Optical FFTs for Convolutional Neural Networks 3,284 views
Histogram Computations on GPUs Kernel using Global and Shared Memory Atomics 3,280 views
Parallel implementation of 3D protein structure similarity searches using a GPU and the CUDA 3,278 views
A Survey of Recent Prefetching Techniques for Processor Caches 3,271 views
Computational Fluid Dynamics using OpenCL – a Practical Introduction 3,271 views
Comparison of SPMV performance on matrices with different matrix format using CUSP, cuSPARSE and ViennaCL 3,269 views
HadoopCL: MapReduce on Distributed Heterogeneous Platforms Through Seamless Integration of Hadoop and OpenCL 3,269 views
Heterogeneous Computing with OpenCL 3,267 views
Analysis of GPU accelerated OpenCL applications on the Intel HD 4600 GPU 3,264 views
Triangular mesh simplification on the GPU 3,263 views
CUDA Accelerated Face Recognition Using Local Binary Patterns 3,258 views
GPU Implementation of the Keccak Hash Function Family 3,257 views
Ray Tracing in the Cloud using MapReduce 3,255 views
Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising 3,248 views
Memory transfer optimization for a lattice Boltzmann solver on Kepler architecture nVidia GPUs 3,246 views
Fast 3D Graphics Rendering Technique with CUDA Parallel Processing 3,239 views
OpenCL Based High-Quality HEVC Motion Estimation on GPU 3,238 views
SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters 3,236 views
Comparison of Fragmentation/Dispersion Models for Asteroid Nuclear Disruption Mission Design 3,233 views
Bitcoin and The Age of Bespoke Silicon 3,225 views
GPU Accelerated Nonlinear Optimization in Radio Interferometric Calibration 3,224 views
Implementing Open-Source CUDA Runtime 3,223 views
Solving 3D Anisotropic Elastic Wave Equations on Parallel GPU Devices 3,218 views
Whippletree: Task-based Scheduling of Dynamic Workloads on the GPU 3,217 views
Deep learning review and its applications 3,215 views
Medical imaging using CUDA 3,214 views
Parallel GPU-accelerated Recursion-based Generators of Pseudorandom Numbers 3,213 views
Multi-Tasking Scheduling for Heterogeneous Systems 3,210 views
GenBase: A Complex Analytics Genomics Benchmark 3,208 views
Titles: 100
Total views: 335864
- Programming - 186,126 views
- Login - 164,123 views
- User dashboard - 90,236 views
- Paper titles list - 69,509 views
- Add new event - 64,509 views
- Add new post - 59,082 views
- Register - 49,120 views
- Statistics - 36,124 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,158 views
- Books on OpenCL and CUDA - 28,744 views