2402

Views of posts on hgpu.org

Performance Evaluation of Sparse Matrix Multiplication Kernels on Intel Xeon Phi  3,961 views

Hidden Surface Removal Using BSP Tree with CUDA  3,951 views

An optimized GPU implementation of a 2D free surface simulation model on unstructured meshes  3,949 views

FDTD on Distributed Heterogeneous Multi-GPU Systems  3,946 views

In-Datacenter Performance Analysis of a Tensor Processing Unit  3,937 views

GPGPU Acceleration for Skeletal Animation-comparing OpenCL with CUDA and GLSL  3,934 views

Dogwild! – Distributed Hogwild for CPU & GPU  3,932 views

A Survey of Recent Prefetching Techniques for Processor Caches  3,930 views

Videogame Graphics, BigData & Analytics  3,910 views

Data Compression using CUDA programming in GPU  3,908 views

Big Integer Multiplication with CUDA FFT (cuFFT) Library  3,907 views

OpenSSL acceleration using Graphics Processing Units  3,906 views

Astrophysical data mining with GPU. A case study: genetic classification of globular clusters  3,905 views

Real-Time Spherical Panorama Image Stitching Using OpenCL  3,902 views

Improving the speed of neural networks on CPUs  3,901 views

High-Performance GPGPU Programming with OCaml  3,898 views

Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters  3,895 views

An OpenCL Runtime and Scheduler for Embedded Multicore DSP Parallel Systems  3,893 views

An OpenCL-based Implementation of H.264 Encoder  3,891 views

Performance Improvement of Data Mining in Weka through GPU Acceleration  3,888 views

gpucc: an open-source GPGPU compiler  3,887 views

Bohrium: Unmodified NumPy Code on CPU, GPU, and Cluster  3,882 views

A Case Study of OpenCL on an Android Mobile GPU  3,882 views

Efficient Hybrid Execution of C++ Applications using Intel(R) Xeon Phi(TM) Coprocessor  3,875 views

GPU TV-L1 Optical Flow  3,874 views

Buffer k-d Trees: Processing Massive Nearest Neighbor Queries on GPUs  3,867 views

Real-Time Deformation of Subdivision Surfaces from Object Collisions  3,865 views

Fast and Efficient Lossless Image Compression Based on CUDA Parallel Wavelet Tree Encoding  3,864 views

A GPU accelerated algorithm for 3D Delaunay triangulation  3,863 views

3D Skeleton Extraction Method using Potential Field on OpenCL  3,862 views

Evaluating GPU Passthrough in Xen for High Performance Cloud Computing  3,857 views

Parallel Computing the Longest Common Subsequence (LCS) on GPUs: Efficiency and Language Suitability  3,856 views

Programming massively parallel processors : A Hands – on approach  3,854 views

GooFit: A library for massively parallelising maximum-likelihood fits  3,852 views

Performance Analysis of CUDA and OpenCL By Implementation of Cryptographic Algorithms  3,845 views

Delaunay Triangulation in R3 on the GPU  3,844 views

Best Practice Guide – Intel Xeon Phi  3,839 views

Implementation of Diamond Search Algorithm Using Parallel Processing Architecture  3,830 views

Benchmarking Harp-DAAL: High Performance Hadoop on KNL Clusters  3,829 views

CUDArray: CUDA-based NumPy  3,826 views

Markov Chain Monte Carlo on the GPU  3,821 views

GPU Computing: Image Convolution  3,818 views

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour  3,814 views

Accelerating recurrent neural network training using sequence bucketing and multi-GPU data parallelization  3,810 views

Modular Arithmetic for Solving Linear Equations on the GPU  3,806 views

3D finite difference computation on GPUs using CUDA  3,805 views

CUD@ASP: Experimenting with GPUs in ASP solving  3,797 views

BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing  3,796 views

Heterogeneous Computing with OpenCL  3,792 views

Fast Morphological Image Processing on GPU using CUDA  3,790 views

Accelerating Computer Vision Algorithms Using OpenCL on Mobile GPU – A Case Study  3,786 views

Theano: A CPU and GPU Math Compiler in Python  3,780 views

Parallel implementation of 3D protein structure similarity searches using a GPU and the CUDA  3,779 views

Image Super-Resolution Using Deep Convolutional Networks  3,777 views

Real Time Face Detection on GPU Using OpenCL  3,776 views

Performance comparison of FPGA, GPU and CPU in image processing  3,773 views

Accelerating Fully Homomorphic Encryption Using GPU  3,771 views

2D and 3D level-set algorithms on GPU  3,771 views

Convolutional Neural Network for Sentence Classification  3,770 views

OpenCL Based High-Quality HEVC Motion Estimation on GPU  3,761 views

10×10: A General-purpose Architectural Approach to Heterogeneity and Energy Efficiency  3,757 views

GPU-Accelerated Scalable Solver for Banded Linear Systems  3,755 views

CUDA by Example: An Introduction to General-Purpose GPU Programming  3,753 views

GPU-Based Implementation of JPEG2000 Encoder  3,746 views

A Research of MapReduce with GPU Acceleration  3,745 views

Performance comparison of gauss-Jordan elimination method using OpenMP and CUDA  3,743 views

Ray Tracing in Real-Time Games  3,741 views

Numerical Simulation for the MHD System in 2D Using OpenCL  3,739 views

Accelerating Fast Fourier Transforms Using Hadoop and CUDA  3,738 views

Analysis and Review of Sorting Algorithms  3,729 views

Understanding Latency Hiding on GPUs  3,723 views

Two Approaches to Particle Simulation: OpenMPI and CUDA  3,722 views

SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters  3,720 views

OmniDB: Towards Portable and Efficient Query Processing on Parallel CPU/GPU Architectures  3,718 views

General Purpose Computation on Graphics Processing Units Using OpenCL  3,715 views

Parallel Implementation of the Wu-Manber Algorithm Using the OpenCL Framework  3,706 views

A comparison between parallelization approaches in molecular dynamics simulations on GPUs  3,699 views

Real-Time GPU Path Tracing  3,696 views

Stackless KD-Tree Traversal for High Performance GPU Ray Tracing  3,695 views

Brute-Force k-Nearest Neighbors Search on the GPU  3,692 views

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising  3,689 views

CUDA Accelerated Face Recognition Using Local Binary Patterns  3,689 views

GPU Acceleration for the C++ Standard Template Library  3,687 views

GPU Fluid Simulation using Smoothed Particle Hydrodynamics  3,680 views

Solving 3D Anisotropic Elastic Wave Equations on Parallel GPU Devices  3,678 views

Computational Fluid Dynamics using OpenCL – a Practical Introduction  3,677 views

GPU Computing  3,676 views

Towards Predictable Real-Time Performance on Multi-Core Platforms  3,674 views

Programming Massively Parallel Processors with CUDA (audio course)  3,674 views

Implementing Open-Source CUDA Runtime  3,672 views

Grex: An efficient MapReduce framework for graphics processing units  3,672 views

Analysis of GPU accelerated OpenCL applications on the Intel HD 4600 GPU  3,671 views

Implementation of Keccak hash function in Tree hashing mode on Nvidia GPU  3,671 views

HadoopCL: MapReduce on Distributed Heterogeneous Platforms Through Seamless Integration of Hadoop and OpenCL  3,665 views

GPU vs FPGA: A Comparative Analysis for Non-standard Precision  3,651 views

Comparison of SPMV performance on matrices with different matrix format using CUSP, cuSPARSE and ViennaCL  3,648 views

Brook for GPUs: Stream Computing on Graphics Hardware  3,647 views

Towards On-Chip Optical FFTs for Convolutional Neural Networks  3,643 views

GPU Implementation of the Keccak Hash Function Family  3,639 views

Whippletree: Task-based Scheduling of Dynamic Workloads on the GPU  3,638 views

 

Brief statistics for this page

Titles: 100

Total views: 379163

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org