Views of posts on hgpu.org
Image Classification with Pyramid Representation and Rotated Data Augmentation on Torch 7 2,766 views
ZUCL: A ZYNQ UltraScale+ Framework for OpenCL HLS Applications 2,764 views
GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs 2,761 views
Real-Time Pedestrian Detection With Deep Networks Cascades 2,760 views
Embedding GPU Computations in Hadoop 2,758 views
Parallel AES Encryption Engines for Many-Core Processor Arrays 2,757 views
Real root isolation for univariate polynomials on GPUs and multicores 2,757 views
The GENGA Code: Gravitational Encounters in N-body simulations with GPU Acceleration 2,757 views
OpenCL Programming Guide for Mac 2,756 views
Software Defined Radio over CUDA 2,756 views
Efficient Knowledge Extraction from Structured Data 2,755 views
GPU implementation of neural networks 2,754 views
Fast Exact String Matching on the GPU 2,753 views
BitCracker: BitLocker meets GPUs 2,753 views
A Co-Prime Blur Scheme for Data Security in Video Surveillance 2,753 views
BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1 2,753 views
Multi2Sim: a simulation framework for CPU-GPU computing 2,751 views
Parallel processing for SAR image generation in CUDA – GPGPU platform 2,750 views
GALAMOST: GPU-accelerated large-scale molecular simulation toolkit 2,750 views
Real-Time SAH BVH Construction for Ray Tracing Dynamic Scenes 2,749 views
Parallel Implementation of Finite Element Codes using CUDA 2,749 views
3D Objects Tracking by GPGPU-Enhanced Particle Filter Algorithms 2,748 views
AQUAgpusph, a free 3D SPH solver accelerated with OpenCL 2,745 views
Dissecting the NVIDIA Volta GPU Architecture via Microbenchmarking 2,745 views
Datalog for GPUs 2,745 views
Saddle Vertex Graph (SVG): A Novel Solution to the Discrete Geodesic Problem 2,744 views
GPU-Optimized Coarse-Grained MD Simulations of Protein and RNA Folding and Assembly 2,743 views
Neon: A Domain-Specific Programming Language for Image Processing 2,742 views
AES on GPU: a CUDA Implementation 2,742 views
CU2CL: A CUDA-to-OpenCL Translator for Multi-and Many-core Architectures 2,741 views
General purpose computing on graphics processing units using OpenCL 2,740 views
A Fast Parallel Implementation of Queue-based Morphological Reconstruction using GPUs 2,740 views
GPU-BLAST: Using graphics processors to accelerate protein sequence alignment 2,740 views
Speeding-up Pearson Correlation Coefficient calculation on graphical processing units 2,738 views
Fast K-selection Algorithms for Graphics Processing Units 2,737 views
Performance Upper Bound Analysis and Optimization of SGEMM on Fermi and Kepler GPUs 2,734 views
Computing Performance Benchmarks among CPU, GPU, and FPGA 2,733 views
A GPU-based Affine and Scale Invariant Feature Transform Algorithm 2,732 views
Visualization and GPU-accelerated simulation of medical ultrasound from CT images 2,731 views
Efficient Integral Image Computation on the GPU 2,731 views
A Distributed Data Mining Framework Accelerated with Graphics Processing Units 2,729 views
One weird trick for parallelizing convolutional neural networks 2,728 views
High-accuracy Optimization by Parallel Iterative Discrete Approximation and Multi-GPU Computing 2,728 views
DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices 2,728 views
Dissecting the NVidia Turing T4 GPU via Microbenchmarking 2,725 views
Implementing the Approximate Message Passing (AMP) Algorithm on a GPU 2,723 views
Moim: A Multi-GPU MapReduce Framework 2,723 views
OpenCL Acceleration for TensorFlow 2,723 views
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications 2,722 views
Acceleration of AES encryption on CUDA GPU 2,722 views
An Efficient Implementation of Double Precision 1-D FFT for GPUs Using CUDA 2,722 views
A Unified Approach for Registration and Depth in Depth from Defocus 2,722 views
An implicit Tensor-Mass solver on the GPU for soft bodies simulation 2,721 views
Accelerating Java on Embedded GPU 2,721 views
Collision detection on the GPU 2,719 views
Efficient Canny Edge Detection Using a GPU 2,719 views
A Braille Conversion Service Using GPU and Human Interaction by Computer Vision 2,719 views
GPU-Based Sparse Voxel Octree Raytracing for Rendering of Procedurally Generated Terrain 2,719 views
TVM: End-to-End Optimization Stack for Deep Learning 2,719 views
3D Object Recognition with Convolutional Neural Networks 2,718 views
Beauty And The Beast: Exploiting GPUs In Haskell 2,718 views
GPU Path Tracing 2,718 views
Cost Efficient PageRank Computation using GPU 2,717 views
MuMax: a new high-performance micromagnetic simulation tool 2,717 views
Multi-Kepler GPU vs. Multi-Intel MIC for spin systems simulations 2,715 views
The Scalable Heterogeneous Computing (SHOC) benchmark suite 2,715 views
2D Triangulation of Polygons on CUDA 2,715 views
Distributed-Shared CUDA: Virtualization of Large-Scale GPU Systems for Programmability and Reliability 2,714 views
A Multi-View Stereo Implementation on Massively Parallel Hardware 2,714 views
A design case study: CPU vs. GPGPU vs. FPGA 2,712 views
A multi-lane traffic simulation model via continuous cellular automata 2,711 views
Doctor AI: Interpretable Deep Learning for Modeling Electronic Health Records 2,711 views
Implementations of the FFT algorithm on GPU 2,711 views
Bioinformatics Sequence Comparisons on Manycore Processors 2,710 views
CUBPT: Lock-free bulk insertions to B+ tree on GPU architecture 2,709 views
Speed up Large Integer Multiplication Using Fourier Transforms and CUDA Technology 2,709 views
CUDA Implementation of Parallel Algorithms for Animal Noseprint Identification 2,708 views
Real-time Ray tracing and Editing of Large Voxel Scenes 2,707 views
A GPU-Based Transient Stability Simulation Using Runge-Kutta Integration Algorithm 2,707 views
Image Object Tracking System Using Parallel Mean Shift Algorithm 2,706 views
CUDA Parallel Algorithms for Forward and Inverse Structural Gravity Problems 2,703 views
CPU-GPU Algorithms for Triangular Surface Mesh Simplification 2,702 views
GPU implementation of JPEG2000 for hyperspectral image compression 2,701 views
Bridging OpenCL and CUDA: A Comparative Analysis and Translation 2,700 views
Accelerated Deep Learning using Intel Xeon Phi 2,700 views
High-performance Dynamic Programming on FPGAs with OpenCL 2,699 views
NMF-mGPU: non-negative matrix factorization on multi-GPU systems 2,699 views
EIE: Efficient Inference Engine on Compressed Deep Neural Network 2,699 views
A New Compilation Path: From Python/NumPy to OpenCL 2,699 views
MapGraph: A High Level API for Fast Development of High Performance Graph Analytics on GPUs 2,697 views
Efficient, High-Quality Bayer Demosaic Filtering on GPUs 2,695 views
OpenCL/CUDA algorithms for parallel decoding of any irregular LDPC code using GPU 2,695 views
A Region Growing Segmentation Algorithm for GPUs 2,695 views
Parallel SAT-Solving with OpenCL 2,694 views
Parallel Sorting on the Heterogeneous AMD Fusion Accelerated Processing Unit 2,694 views
Titles: 100
Total views: 272718
- Programming - 186,131 views
- Login - 164,415 views
- User dashboard - 90,770 views
- Paper titles list - 70,170 views
- Add new event - 64,600 views
- Add new post - 59,382 views
- Register - 49,237 views
- Statistics - 36,641 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,167 views
- Books on OpenCL and CUDA - 28,827 views