Views of posts on hgpu.org
Unsupervised Asset Cluster Analysis Implemented with Parallel Genetic Algorithms on the NVIDIA CUDA Platform 3,188 views
An efficient solution for hazardous geophysical flows simulation using GPUs 3,187 views
A Braille Conversion Service Using GPU and Human Interaction by Computer Vision 3,186 views
Efficient Model-based 3D Tracking of Hand Articulations using Kinect 3,184 views
Parallel processing for SAR image generation in CUDA – GPGPU platform 3,184 views
GPU Implementations of Object Detection using HOG Features and Deformable Models 3,183 views
Image registration on GPU 3,175 views
Parallelizing Word2Vec in Shared and Distributed Memory 3,175 views
Real root isolation for univariate polynomials on GPUs and multicores 3,175 views
OpenCL Programming Guide for Mac 3,174 views
Integer sorting on multicores: some (experiments and) observations 3,174 views
Multi2Sim: a simulation framework for CPU-GPU computing 3,173 views
Fast High-Quality Volume Ray Casting with Virtual Samplings 3,172 views
MCS 572: Introduction to Supercomputing 3,171 views
Fast K-selection Algorithms for Graphics Processing Units 3,170 views
OpenCL Performance Prediction using Architecture-Independent Features 3,170 views
Implementing Deep Neural Networks for Financial Market Prediction on the Intel Xeon Phi 3,169 views
An implicit Tensor-Mass solver on the GPU for soft bodies simulation 3,169 views
A Co-Prime Blur Scheme for Data Security in Video Surveillance 3,168 views
MuMax: a new high-performance micromagnetic simulation tool 3,167 views
Fast Exact String Matching on the GPU 3,167 views
Realtime Computation of a VST Audio Effect Plugin on the Graphics Processor 3,167 views
XBOOLE-CUDA: Fast Boolean Operations on the GPU 3,165 views
Programming CUDA and OpenCL: A Case Study Using Modern C++ Libraries 3,164 views
Using OpenCL to Implement Median Filtering and RSA Algorithms: Two GPGPU Application Case Studies 3,164 views
Auto-tuning a High-Level Language Targeted to GPU Codes 3,163 views
CU2CL: A CUDA-to-OpenCL Translator for Multi-and Many-core Architectures 3,163 views
GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs 3,162 views
Parallel SAT-Solving with OpenCL 3,161 views
High-accuracy Optimization by Parallel Iterative Discrete Approximation and Multi-GPU Computing 3,159 views
A Simplified and Accurate Model of Power-Performance Efficiency on Emergent GPU Architectures 3,158 views
OpenCL Acceleration for TensorFlow 3,158 views
Visualization and GPU-accelerated simulation of medical ultrasound from CT images 3,158 views
DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices 3,157 views
BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1 3,156 views
Dissecting the NVidia Turing T4 GPU via Microbenchmarking 3,156 views
2D Triangulation of Polygons on CUDA 3,156 views
Neon: A Domain-Specific Programming Language for Image Processing 3,155 views
GPU based particle system 3,154 views
The MOSIX Virtual OpenCL (VCL) Cluster Platform 3,153 views
Computing Performance Benchmarks among CPU, GPU, and FPGA 3,152 views
G-SNPM – A GPU-based SNP mapping tool 3,152 views
OCCA: A unified approach to multi-threading languages 3,151 views
Accelerating Java on Embedded GPU 3,150 views
Shared Memory Multiplexing: A Novel Way to Improve GPGPU Throughput 3,148 views
Deep convolutional networks for pancreas segmentation in CT imaging 3,148 views
Efficient, High-Quality Bayer Demosaic Filtering on GPUs 3,148 views
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA 3,147 views
NAS Parallel Benchmarks for GPGPUs using a Directive-based Programming Model 3,146 views
2HOT: An Improved Parallel Hashed Oct-Tree N-Body Algorithm for Cosmological Simulation 3,145 views
GPU-accelerated HMM for Speech Recognition 3,145 views
TVM: End-to-End Optimization Stack for Deep Learning 3,141 views
Fast Mersenne prime testing on the GPU 3,141 views
NMF-mGPU: non-negative matrix factorization on multi-GPU systems 3,141 views
Software Defined Radio over CUDA 3,140 views
Saddle Vertex Graph (SVG): A Novel Solution to the Discrete Geodesic Problem 3,139 views
Cost Efficient PageRank Computation using GPU 3,138 views
Billion-scale similarity search with GPUs 3,136 views
The GENGA Code: Gravitational Encounters in N-body simulations with GPU Acceleration 3,136 views
MapGraph: A High Level API for Fast Development of High Performance Graph Analytics on GPUs 3,134 views
One weird trick for parallelizing convolutional neural networks 3,134 views
Performance Upper Bound Analysis and Optimization of SGEMM on Fermi and Kepler GPUs 3,133 views
A Performance Comparison of CUDA and OpenCL 3,133 views
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications 3,133 views
An Introduction to OpenCL C++ 3,132 views
Parallel Implementation of Finite Element Codes using CUDA 3,131 views
String Matching on a Multicore GPU Using CUDA 3,131 views
Speeding-up Pearson Correlation Coefficient calculation on graphical processing units 3,129 views
Speed up Large Integer Multiplication Using Fourier Transforms and CUDA Technology 3,128 views
Moim: A Multi-GPU MapReduce Framework 3,126 views
Acceleration of CFD and data analysis using graphics processors 3,126 views
AQUAgpusph, a free 3D SPH solver accelerated with OpenCL 3,126 views
An Efficient Implementation of Double Precision 1-D FFT for GPUs Using CUDA 3,124 views
Ray Tracing on GPUs 3,124 views
3D Recursive Gaussian IIR on GPU and FPGAs: A Case Study for Accelerating Bandwidth-Bounded Applications 3,124 views
A New Compilation Path: From Python/NumPy to OpenCL 3,124 views
fastHOG – a real-time GPU implementation of HOG 3,120 views
3D Object Recognition with Convolutional Neural Networks 3,119 views
GPU-Based Sparse Voxel Octree Raytracing for Rendering of Procedurally Generated Terrain 3,117 views
A Region Growing Segmentation Algorithm for GPUs 3,117 views
Embedding GPU Computations in Hadoop 3,116 views
Collision detection on the GPU 3,115 views
CUDA Parallel Algorithms for Forward and Inverse Structural Gravity Problems 3,113 views
Real-Time Pedestrian Detection With Deep Networks Cascades 3,112 views
Efficient Integral Image Computation on the GPU 3,110 views
GPU Path Tracing 3,110 views
Parallel Implementations of the Cholesky Decomposition on CPUs and GPUs 3,109 views
A Multi-View Stereo Implementation on Massively Parallel Hardware 3,109 views
A GPU-Based Transient Stability Simulation Using Runge-Kutta Integration Algorithm 3,108 views
Implementations of the FFT algorithm on GPU 3,107 views
GPGPU Performance Estimation with Core and Memory Frequency Scaling 3,107 views
A GPU-based Affine and Scale Invariant Feature Transform Algorithm 3,106 views
Image Classification with Pyramid Representation and Rotated Data Augmentation on Torch 7 3,105 views
OpenVIDIA: parallel GPU computer vision 3,104 views
High-performance Dynamic Programming on FPGAs with OpenCL 3,104 views
Accelerated Deep Learning using Intel Xeon Phi 3,103 views
A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads 3,103 views
Titles: 100
Total views: 314486
- Programming - 186,232 views
- Login - 172,156 views
- User dashboard - 98,600 views
- Paper titles list - 92,956 views
- Add new event - 69,211 views
- Add new post - 62,808 views
- Register - 53,111 views
- Statistics - 44,259 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,522 views
- Books on OpenCL and CUDA - 31,171 views