Views of posts on hgpu.org
Performance Evaluation of Sparse Matrix Multiplication Kernels on Intel Xeon Phi 3,961 views
Hidden Surface Removal Using BSP Tree with CUDA 3,951 views
An optimized GPU implementation of a 2D free surface simulation model on unstructured meshes 3,949 views
FDTD on Distributed Heterogeneous Multi-GPU Systems 3,946 views
In-Datacenter Performance Analysis of a Tensor Processing Unit 3,937 views
GPGPU Acceleration for Skeletal Animation-comparing OpenCL with CUDA and GLSL 3,934 views
Dogwild! – Distributed Hogwild for CPU & GPU 3,932 views
A Survey of Recent Prefetching Techniques for Processor Caches 3,930 views
Videogame Graphics, BigData & Analytics 3,910 views
Data Compression using CUDA programming in GPU 3,908 views
Big Integer Multiplication with CUDA FFT (cuFFT) Library 3,907 views
OpenSSL acceleration using Graphics Processing Units 3,906 views
Astrophysical data mining with GPU. A case study: genetic classification of globular clusters 3,905 views
Real-Time Spherical Panorama Image Stitching Using OpenCL 3,902 views
Improving the speed of neural networks on CPUs 3,901 views
High-Performance GPGPU Programming with OCaml 3,898 views
Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters 3,895 views
An OpenCL Runtime and Scheduler for Embedded Multicore DSP Parallel Systems 3,893 views
An OpenCL-based Implementation of H.264 Encoder 3,891 views
Performance Improvement of Data Mining in Weka through GPU Acceleration 3,888 views
gpucc: an open-source GPGPU compiler 3,887 views
Bohrium: Unmodified NumPy Code on CPU, GPU, and Cluster 3,882 views
A Case Study of OpenCL on an Android Mobile GPU 3,882 views
Efficient Hybrid Execution of C++ Applications using Intel(R) Xeon Phi(TM) Coprocessor 3,875 views
GPU TV-L1 Optical Flow 3,874 views
Buffer k-d Trees: Processing Massive Nearest Neighbor Queries on GPUs 3,867 views
Real-Time Deformation of Subdivision Surfaces from Object Collisions 3,865 views
Fast and Efficient Lossless Image Compression Based on CUDA Parallel Wavelet Tree Encoding 3,864 views
A GPU accelerated algorithm for 3D Delaunay triangulation 3,863 views
3D Skeleton Extraction Method using Potential Field on OpenCL 3,862 views
Evaluating GPU Passthrough in Xen for High Performance Cloud Computing 3,857 views
Parallel Computing the Longest Common Subsequence (LCS) on GPUs: Efficiency and Language Suitability 3,856 views
Programming massively parallel processors : A Hands – on approach 3,854 views
GooFit: A library for massively parallelising maximum-likelihood fits 3,852 views
Performance Analysis of CUDA and OpenCL By Implementation of Cryptographic Algorithms 3,845 views
Delaunay Triangulation in R3 on the GPU 3,844 views
Best Practice Guide – Intel Xeon Phi 3,839 views
Implementation of Diamond Search Algorithm Using Parallel Processing Architecture 3,830 views
Benchmarking Harp-DAAL: High Performance Hadoop on KNL Clusters 3,829 views
CUDArray: CUDA-based NumPy 3,826 views
Markov Chain Monte Carlo on the GPU 3,821 views
GPU Computing: Image Convolution 3,818 views
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour 3,814 views
Accelerating recurrent neural network training using sequence bucketing and multi-GPU data parallelization 3,810 views
Modular Arithmetic for Solving Linear Equations on the GPU 3,806 views
3D finite difference computation on GPUs using CUDA 3,805 views
CUD@ASP: Experimenting with GPUs in ASP solving 3,797 views
BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing 3,796 views
Heterogeneous Computing with OpenCL 3,792 views
Fast Morphological Image Processing on GPU using CUDA 3,790 views
Accelerating Computer Vision Algorithms Using OpenCL on Mobile GPU – A Case Study 3,786 views
Theano: A CPU and GPU Math Compiler in Python 3,780 views
Parallel implementation of 3D protein structure similarity searches using a GPU and the CUDA 3,779 views
Image Super-Resolution Using Deep Convolutional Networks 3,777 views
Real Time Face Detection on GPU Using OpenCL 3,776 views
Performance comparison of FPGA, GPU and CPU in image processing 3,773 views
Accelerating Fully Homomorphic Encryption Using GPU 3,771 views
2D and 3D level-set algorithms on GPU 3,771 views
Convolutional Neural Network for Sentence Classification 3,770 views
OpenCL Based High-Quality HEVC Motion Estimation on GPU 3,761 views
10×10: A General-purpose Architectural Approach to Heterogeneity and Energy Efficiency 3,757 views
GPU-Accelerated Scalable Solver for Banded Linear Systems 3,755 views
CUDA by Example: An Introduction to General-Purpose GPU Programming 3,753 views
GPU-Based Implementation of JPEG2000 Encoder 3,746 views
A Research of MapReduce with GPU Acceleration 3,745 views
Performance comparison of gauss-Jordan elimination method using OpenMP and CUDA 3,743 views
Ray Tracing in Real-Time Games 3,741 views
Numerical Simulation for the MHD System in 2D Using OpenCL 3,739 views
Accelerating Fast Fourier Transforms Using Hadoop and CUDA 3,738 views
Analysis and Review of Sorting Algorithms 3,729 views
Understanding Latency Hiding on GPUs 3,723 views
Two Approaches to Particle Simulation: OpenMPI and CUDA 3,722 views
SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters 3,720 views
OmniDB: Towards Portable and Efficient Query Processing on Parallel CPU/GPU Architectures 3,718 views
General Purpose Computation on Graphics Processing Units Using OpenCL 3,715 views
Parallel Implementation of the Wu-Manber Algorithm Using the OpenCL Framework 3,706 views
A comparison between parallelization approaches in molecular dynamics simulations on GPUs 3,699 views
Real-Time GPU Path Tracing 3,696 views
Stackless KD-Tree Traversal for High Performance GPU Ray Tracing 3,695 views
Brute-Force k-Nearest Neighbors Search on the GPU 3,692 views
Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising 3,689 views
CUDA Accelerated Face Recognition Using Local Binary Patterns 3,689 views
GPU Acceleration for the C++ Standard Template Library 3,687 views
GPU Fluid Simulation using Smoothed Particle Hydrodynamics 3,680 views
Solving 3D Anisotropic Elastic Wave Equations on Parallel GPU Devices 3,678 views
Computational Fluid Dynamics using OpenCL – a Practical Introduction 3,677 views
GPU Computing 3,676 views
Towards Predictable Real-Time Performance on Multi-Core Platforms 3,674 views
Programming Massively Parallel Processors with CUDA (audio course) 3,674 views
Implementing Open-Source CUDA Runtime 3,672 views
Grex: An efficient MapReduce framework for graphics processing units 3,672 views
Analysis of GPU accelerated OpenCL applications on the Intel HD 4600 GPU 3,671 views
Implementation of Keccak hash function in Tree hashing mode on Nvidia GPU 3,671 views
HadoopCL: MapReduce on Distributed Heterogeneous Platforms Through Seamless Integration of Hadoop and OpenCL 3,665 views
GPU vs FPGA: A Comparative Analysis for Non-standard Precision 3,651 views
Comparison of SPMV performance on matrices with different matrix format using CUSP, cuSPARSE and ViennaCL 3,648 views
Brook for GPUs: Stream Computing on Graphics Hardware 3,647 views
Towards On-Chip Optical FFTs for Convolutional Neural Networks 3,643 views
GPU Implementation of the Keccak Hash Function Family 3,639 views
Whippletree: Task-based Scheduling of Dynamic Workloads on the GPU 3,638 views
Titles: 100
Total views: 379163
- Programming - 186,226 views
- Login - 171,888 views
- User dashboard - 98,374 views
- Paper titles list - 91,765 views
- Add new event - 69,091 views
- Add new post - 62,671 views
- Register - 52,971 views
- Statistics - 44,030 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,513 views
- Books on OpenCL and CUDA - 31,042 views