2402

Views of posts on hgpu.org

Unsupervised Asset Cluster Analysis Implemented with Parallel Genetic Algorithms on the NVIDIA CUDA Platform  3,188 views

Distributed multi-node, multi-GPU, heterogeneous system for 3D image reconstruction in Electrical Capacitance Tomography – network performance and application analysis  3,188 views

An efficient solution for hazardous geophysical flows simulation using GPUs  3,187 views

DeepLearningKit – an GPU Optimized Deep Learning Framework for Apple’s iOS, OS X and tvOS developed in Metal and Swift  3,186 views

A Braille Conversion Service Using GPU and Human Interaction by Computer Vision  3,186 views

Efficient Model-based 3D Tracking of Hand Articulations using Kinect  3,184 views

Parallel processing for SAR image generation in CUDA – GPGPU platform  3,184 views

GPU Implementations of Object Detection using HOG Features and Deformable Models  3,183 views

Image registration on GPU  3,175 views

Parallelizing Word2Vec in Shared and Distributed Memory  3,175 views

Real root isolation for univariate polynomials on GPUs and multicores  3,175 views

OpenCL Programming Guide for Mac  3,174 views

Integer sorting on multicores: some (experiments and) observations  3,174 views

Multi2Sim: a simulation framework for CPU-GPU computing  3,173 views

Fast High-Quality Volume Ray Casting with Virtual Samplings  3,172 views

MCS 572: Introduction to Supercomputing  3,171 views

Fast K-selection Algorithms for Graphics Processing Units  3,170 views

OpenCL Performance Prediction using Architecture-Independent Features  3,170 views

Implementing Deep Neural Networks for Financial Market Prediction on the Intel Xeon Phi  3,169 views

An implicit Tensor-Mass solver on the GPU for soft bodies simulation  3,169 views

A Co-Prime Blur Scheme for Data Security in Video Surveillance  3,168 views

MuMax: a new high-performance micromagnetic simulation tool  3,167 views

Fast Exact String Matching on the GPU  3,167 views

Realtime Computation of a VST Audio Effect Plugin on the Graphics Processor  3,167 views

XBOOLE-CUDA: Fast Boolean Operations on the GPU  3,165 views

Programming CUDA and OpenCL: A Case Study Using Modern C++ Libraries  3,164 views

Using OpenCL to Implement Median Filtering and RSA Algorithms: Two GPGPU Application Case Studies  3,164 views

Auto-tuning a High-Level Language Targeted to GPU Codes  3,163 views

CU2CL: A CUDA-to-OpenCL Translator for Multi-and Many-core Architectures  3,163 views

GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs  3,162 views

Parallel SAT-Solving with OpenCL  3,161 views

High-accuracy Optimization by Parallel Iterative Discrete Approximation and Multi-GPU Computing  3,159 views

A Simplified and Accurate Model of Power-Performance Efficiency on Emergent GPU Architectures  3,158 views

OpenCL Acceleration for TensorFlow  3,158 views

Visualization and GPU-accelerated simulation of medical ultrasound from CT images  3,158 views

DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices  3,157 views

BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1  3,156 views

Dissecting the NVidia Turing T4 GPU via Microbenchmarking  3,156 views

2D Triangulation of Polygons on CUDA  3,156 views

Neon: A Domain-Specific Programming Language for Image Processing  3,155 views

GPU based particle system  3,154 views

The MOSIX Virtual OpenCL (VCL) Cluster Platform  3,153 views

Computing Performance Benchmarks among CPU, GPU, and FPGA  3,152 views

High Performance Streaming Smith-Waterman Implementation with Implicit Synchronization on Intel FPGA using OpenCL  3,152 views

G-SNPM – A GPU-based SNP mapping tool  3,152 views

OCCA: A unified approach to multi-threading languages  3,151 views

Accelerating Java on Embedded GPU  3,150 views

Shared Memory Multiplexing: A Novel Way to Improve GPGPU Throughput  3,148 views

Deep convolutional networks for pancreas segmentation in CT imaging  3,148 views

Efficient, High-Quality Bayer Demosaic Filtering on GPUs  3,148 views

Optimization principles and application performance evaluation of a multithreaded GPU using CUDA  3,147 views

NAS Parallel Benchmarks for GPGPUs using a Directive-based Programming Model  3,146 views

2HOT: An Improved Parallel Hashed Oct-Tree N-Body Algorithm for Cosmological Simulation  3,145 views

GPU-accelerated HMM for Speech Recognition  3,145 views

TVM: End-to-End Optimization Stack for Deep Learning  3,141 views

Fast Mersenne prime testing on the GPU  3,141 views

NMF-mGPU: non-negative matrix factorization on multi-GPU systems  3,141 views

Software Defined Radio over CUDA  3,140 views

Saddle Vertex Graph (SVG): A Novel Solution to the Discrete Geodesic Problem  3,139 views

Cost Efficient PageRank Computation using GPU  3,138 views

Billion-scale similarity search with GPUs  3,136 views

The GENGA Code: Gravitational Encounters in N-body simulations with GPU Acceleration  3,136 views

MapGraph: A High Level API for Fast Development of High Performance Graph Analytics on GPUs  3,134 views

One weird trick for parallelizing convolutional neural networks  3,134 views

Performance Upper Bound Analysis and Optimization of SGEMM on Fermi and Kepler GPUs  3,133 views

A Performance Comparison of CUDA and OpenCL  3,133 views

Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications  3,133 views

An Introduction to OpenCL C++  3,132 views

Parallel Implementation of Finite Element Codes using CUDA  3,131 views

String Matching on a Multicore GPU Using CUDA  3,131 views

Speeding-up Pearson Correlation Coefficient calculation on graphical processing units  3,129 views

Speed up Large Integer Multiplication Using Fourier Transforms and CUDA Technology  3,128 views

Moim: A Multi-GPU MapReduce Framework  3,126 views

Acceleration of CFD and data analysis using graphics processors  3,126 views

AQUAgpusph, a free 3D SPH solver accelerated with OpenCL  3,126 views

An Efficient Implementation of Double Precision 1-D FFT for GPUs Using CUDA  3,124 views

Ray Tracing on GPUs  3,124 views

3D Recursive Gaussian IIR on GPU and FPGAs: A Case Study for Accelerating Bandwidth-Bounded Applications  3,124 views

A New Compilation Path: From Python/NumPy to OpenCL  3,124 views

fastHOG – a real-time GPU implementation of HOG  3,120 views

3D Object Recognition with Convolutional Neural Networks  3,119 views

GPU-Based Sparse Voxel Octree Raytracing for Rendering of Procedurally Generated Terrain  3,117 views

A Region Growing Segmentation Algorithm for GPUs  3,117 views

Embedding GPU Computations in Hadoop  3,116 views

Collision detection on the GPU  3,115 views

CUDA Parallel Algorithms for Forward and Inverse Structural Gravity Problems  3,113 views

Real-Time Pedestrian Detection With Deep Networks Cascades  3,112 views

Efficient Integral Image Computation on the GPU  3,110 views

GPU Path Tracing  3,110 views

Parallel Implementations of the Cholesky Decomposition on CPUs and GPUs  3,109 views

A Multi-View Stereo Implementation on Massively Parallel Hardware  3,109 views

A GPU-Based Transient Stability Simulation Using Runge-Kutta Integration Algorithm  3,108 views

Implementations of the FFT algorithm on GPU  3,107 views

GPGPU Performance Estimation with Core and Memory Frequency Scaling  3,107 views

A GPU-based Affine and Scale Invariant Feature Transform Algorithm  3,106 views

Image Classification with Pyramid Representation and Rotated Data Augmentation on Torch 7  3,105 views

OpenVIDIA: parallel GPU computer vision  3,104 views

High-performance Dynamic Programming on FPGAs with OpenCL  3,104 views

Accelerated Deep Learning using Intel Xeon Phi  3,103 views

A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads  3,103 views

 

Brief statistics for this page

Titles: 100

Total views: 314486

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org