Views of posts on hgpu.org
Parallel Sorting on the Heterogeneous AMD Fusion Accelerated Processing Unit 3,101 views
OpenCL Cryptographic Library 3,096 views
EIE: Efficient Inference Engine on Compressed Deep Neural Network 3,095 views
Efficient softmax approximation for GPUs 3,095 views
Datalog for GPUs 3,095 views
Bioinformatics Sequence Comparisons on Manycore Processors 3,093 views
Image Denoising Using Wavelet Transform and CUDA 3,091 views
CUDA Implementation of Parallel Algorithms for Animal Noseprint Identification 3,090 views
Accelerating the Conjugate Gradient Algorithm with GPUs in CFD Simulations 3,090 views
Accelerating IISPH: A Parallel GPGPU Solution Using CUDA 3,090 views
Real-time Ray tracing and Editing of Large Voxel Scenes 3,089 views
GPU implementation of JPEG2000 for hyperspectral image compression 3,089 views
Real-Time SAH BVH Construction for Ray Tracing Dynamic Scenes 3,089 views
CPU-GPU Algorithms for Triangular Surface Mesh Simplification 3,088 views
Cloth Simulation on the GPU 3,086 views
Bridging OpenCL and CUDA: A Comparative Analysis and Translation 3,086 views
A Fast Parallel Implementation of Queue-based Morphological Reconstruction using GPUs 3,085 views
Multi-Kepler GPU vs. Multi-Intel MIC for spin systems simulations 3,085 views
Multi-Tenant Virtual GPUs for Optimising Performance of a Financial Risk Application 3,084 views
Convex Clustering: An Attractive Alternative to Hierarchical Clustering 3,084 views
190 TFlops Astrophysical N-body Simulation on a Cluster of GPUs 3,083 views
3D FFT on a Single FPGA 3,081 views
GPU Pro 7: Advanced Rendering 3,081 views
A Distributed Data Mining Framework Accelerated with Graphics Processing Units 3,079 views
Deep Feature-based Face Detection on Mobile Devices 3,079 views
Fast and Flexible: Parallel Packet Processing with GPUs and Click 3,078 views
Performance Analysis of Parallel Sorting Algorithms using GPU Computing 3,078 views
GPF: a framework for general packet classification on GPU co-processors 3,077 views
Synergia CUDA: GPU-accelerated accelerator modeling package 3,077 views
Evolution of thread-level parallelism in desktop applications 3,077 views
Auto-tunable GPU BLAS 3,076 views
Efficient Data Management for GPU Databases 3,075 views
BrainSlug: Transparent Acceleration of Deep Learning Through Depth-First Parallelism 3,075 views
Local Laplacian Filters: Edge-aware Image Processing with a Laplacian Pyramid 3,075 views
GPU Asynchronous Stochastic Gradient Descent to Speed Up Neural Network Training 3,074 views
3DES ECB Optimized for Massively Parallel CUDA GPU Architecture 3,073 views
Parallel hyperbolic PDE simulation on clusters: Cell versus GPU 3,073 views
Image Object Tracking System Using Parallel Mean Shift Algorithm 3,073 views
Qualcomm Snapdragon Mobile Platform OpenCL General Programming and Optimization 3,072 views
The design and verification of Mumax3 3,070 views
Warp-Level Divergence in GPUs: Characterization, Impact, and Mitigation 3,069 views
Multi-GPU Acceleration of Black-Scholes Equation based Option Pricing 3,069 views
AES and DES Encryption with GPU 3,069 views
Performance comparison of Lattice Boltzmann fluid flow simulation using OpenCL and CUDA frameworks 3,067 views
A GEMM interface and implementation on NVIDIA GPUs for multiple small matrices 3,067 views
Using Shared Memory as a Cache in Cellular Automata Water Flow Simulations on GPUs 3,065 views
3D finite element numerical integration on GPUs 3,065 views
Numerical Computations with GPUs 3,065 views
GPU Implementation of the Particle Filter 3,064 views
Implementation of the genetic algorithm by means of CUDA technology involved in travelling salesman problem 3,064 views
A stand-alone Finite Difference Time Domain (FDTD) simulation for Integrated Optoelectronics Laboratory 3,064 views
Distributed-Shared CUDA: Virtualization of Large-Scale GPU Systems for Programmability and Reliability 3,063 views
Beauty And The Beast: Exploiting GPUs In Haskell 3,063 views
Efficient Canny Edge Detection Using a GPU 3,063 views
A Class of Hybrid LAPACK Algorithms for Multicore and GPU Architectures 3,063 views
3D Haar-Like Elliptical Features for Object Classification in Microscopy 3,063 views
A GPU Approach to Fortran Legacy Systems 3,063 views
CUDA Based CAMshift Algorithm for Object Tracking Systems 3,062 views
Data Layout Oriented Compilation Techniques in Vectorization for Multi-/Many-cores 3,061 views
High Throughput Low Latency LDPC Decoding on GPU for SDR Systems 3,061 views
Finite Pointset Method for 2D Dam-Break Problem with GPU-Acceleration 3,059 views
A multi-lane traffic simulation model via continuous cellular automata 3,058 views
Poseidon: A System Architecture for Efficient GPU-based Deep Learning on Multiple Machines 3,058 views
3D Hydrodynamic Simulation of Classical Nova Explosions 3,057 views
A Unified Approach for Registration and Depth in Depth from Defocus 3,057 views
CUBPT: Lock-free bulk insertions to B+ tree on GPU architecture 3,056 views
cuBLASTP: Fine-Grained Parallelization of Protein Sequence Search on a GPU 3,055 views
Precomputed Atmospheric Scattering 3,054 views
Fluid Simulation: Smoothed Particle Hydrodynamics on the GPU 3,054 views
Deep Learning for Mortgage Risk 3,053 views
FastSpMM: An Efficient Library for Sparse Matrix Matrix Product on GPUs 3,052 views
Design and Implementation of the Futhark Programming Language 3,051 views
Real time Multi-GPU-based Event Detection in High Definition Videos 3,050 views
3D Edge Bundling for Geographical Data Visualization 3,050 views
GPU performance comparison for accelerated radar data processing 3,049 views
A Chunking Method for Euclidean Distance Matrix Calculation on Large Dataset Using Multi-GPU 3,049 views
High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster 3,049 views
GPUGI: Global Illumination Effects on the GPU 3,048 views
A case study on porting scientific applications to GPU/CUDA 3,048 views
A case for neuromorphic ISAs 3,047 views
Deep Learning in the Automotive Industry: Applications and Tools 3,047 views
VHF SAR image formation implemented on a GPU 3,047 views
Implementing the Approximate Message Passing (AMP) Algorithm on a GPU 3,045 views
Improving Performance Portability in OpenCL Programs 3,044 views
High performance pattern matching and data remanence on graphics processing units 3,043 views
Data-rich astronomy: mining synoptic sky surveys 3,042 views
Efficient Parallel Methods for Deep Reinforcement Learning 3,041 views
LeFlow: Enabling Flexible FPGA High-Level Synthesis of Tensorflow Deep Neural Networks 3,041 views
A GPU-Based Wide-Band Radio Spectrometer 3,041 views
GPUburn: A System to Test and Mitigate GPU Hardware Failures 3,041 views
Implementing Strassen’s Algorithm with CUTLASS on NVIDIA Volta GPUs 3,038 views
A simple GPU-based approach for 3D Voronoi diagram construction and visualization 3,038 views
Hardware-Accelerated Raycasting: Towards an Effective Brain MRI Visualization 3,038 views
GPU Sparse Matrix Multiplication with CUDA 3,037 views
Titles: 100
Total views: 306721
- Programming - 186,232 views
- Login - 172,159 views
- User dashboard - 98,600 views
- Paper titles list - 92,956 views
- Add new event - 69,211 views
- Add new post - 62,808 views
- Register - 53,111 views
- Statistics - 44,259 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,522 views
- Books on OpenCL and CUDA - 31,171 views