Views of posts on hgpu.org
An Introduction to the OpenCL Programming Model 2,878 views
GPU Accelerated Conjunction Assessment with Applications to Formation Flight and Space Debris Tracking 2,878 views
Fast Hydraulic and Thermal Erosion on GPU 2,878 views
FLASH: Randomized Algorithms Accelerated over CPU-GPU for Ultra-High Dimensional Similarity Search 2,877 views
Parallel Hashing, Compression and Encryption with OpenCL under OS X 2,877 views
Load-Balanced Multi-GPU Ambient Occlusion for Direct Volume Rendering 2,877 views
OCLoptimizer: An Iterative Optimization Tool for OpenCL 2,876 views
A Survey of Techniques For Improving Energy Efficiency in Embedded Computing Systems 2,874 views
GPU Accelerated Keccak (SHA3) Algorithm 2,873 views
PROST: Parallel robust online simple tracking 2,870 views
Importance of Explicit Vectorization for CPU and GPU Software Performance 2,868 views
Fast BVH Construction on GPUs 2,867 views
Run-time Image and Video Resizing Using CUDA-enabled GPUs 2,867 views
Deep Dynamic Neural Networks for Gesture Segmentation and Recognition 2,867 views
FuzzyGPU: a fuzzy arithmetic library for GPU 2,862 views
Scaling Deep Learning on Multiple In-Memory Processors 2,861 views
Fast algorithm of ray tracing based on KD-tree structure 2,860 views
Fast Algorithms for Convolutional Neural Networks 2,858 views
Wilson and Domainwall Kernels on Oakforest-PACS 2,853 views
Generating Custom Code for Efficient Query Execution on Heterogeneous Processors 2,851 views
An Approach to Efficient FEM Simulations on Graphics Processing Units Using CUDA 2,846 views
Real-time Flame Rendering with GPU and CUDA 2,844 views
Efficient Model-based 3D Tracking of Hand Articulations using Kinect 2,844 views
Data access optimized applications on the GPU using NVIDIA CUDA 2,842 views
Theano: A Python framework for fast computation of mathematical expressions 2,842 views
A Single (Unified) Shader GPU Microarchitecture for Embedded Systems 2,840 views
GASPP: A GPU-Accelerated Stateful Packet Processing Framework 2,838 views
A Novel Open Source Morphology Using GPU Processing With LTU-CUDA 2,838 views
SOAP3-dp: Fast, Accurate and Sensitive GPU-based Short Read Aligner 2,836 views
Fingerprint Local Invariant Feature Extraction on GPU with CUDA 2,834 views
P-HGRMS: A Parallel Hypergraph Based Root Mean Square Algorithm for Image Denoising 2,834 views
CFD Simulation of Jet Cooling and Implementation of Flow Solvers in GPU 2,834 views
Circular Hough Transform in OpenCL 2,834 views
Performance Analysis of GPU-based SAR and Interferometric SAR image processing 2,834 views
GPU Accelerated Molecular Dynamics Simulation, Visualization, and Analysis 2,831 views
The Yin and Yang of Processing Data Warehousing Queries on GPU Devices 2,831 views
An Incompressible Navier-Stokes Equations Solver on the GPU Using CUDA 2,830 views
CUDA Fortran for Scientists and Engineers 2,829 views
Real-Time Concurrent Linked List Construction on the GPU 2,825 views
A GPU Accelerated Navier-Stokes Solver with Multi-level Granularity for Solving Sparse Implicit Systems 2,822 views
GPU Implementations of Object Detection using HOG Features and Deformable Models 2,820 views
Genetic Algorithm Modeling with GPU Parallel Computing Technology 2,820 views
Development of a GPU-accelerated MIKE 21 Solver for Water Wave Dynamics 2,819 views
GPU-accelerated HMM for Speech Recognition 2,818 views
The Future of Accelerator Programming: Abstraction, Performance or Can We Have Both? 2,818 views
Collision Detection of Triangle Meshes using GPU 2,816 views
Fast High-Quality Volume Ray Casting with Virtual Samplings 2,816 views
Optimization Techniques on GPU: A Survey 2,812 views
3D Non-Local Means denoising via multi-GPU 2,812 views
Realtime Computation of a VST Audio Effect Plugin on the Graphics Processor 2,812 views
GPU-based cellular automata simulations of laser dynamics 2,811 views
Chebyshev Filter Diagonalization on Modern Manycore Processors and GPGPUs 2,810 views
pocl: A Performance-Portable OpenCL Implementation 2,807 views
Image registration on GPU 2,806 views
A Complete Descritpion of the UnPython and Jit4GPU Framework 2,804 views
High Performance Programming for Soft Computing 2,802 views
Towards GPGPU Assisted Computing in Virtualized Environments 2,802 views
Performance of OpenCL 2,802 views
OpenCL Library for Parallel Graph Search Algorithms 2,801 views
Auto-tuning a High-Level Language Targeted to GPU Codes 2,801 views
Shared Memory Multiplexing: A Novel Way to Improve GPGPU Throughput 2,801 views
PIConGPU: A Fully Relativistic Particle-in-Cell Code for a GPU Cluster 2,801 views
24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs 2,800 views
Decompilation of LLVM IR 2,800 views
A Static Load Balancing Scheme for Parallel Volume Rendering on Multi-GPU Clusters 2,799 views
PacketShader: a GPU-accelerated software router 2,798 views
Graph Processing on GPU 2,795 views
An efficient solution for hazardous geophysical flows simulation using GPUs 2,794 views
An OpenCL(TM) Deep Learning Accelerator on Arria 10 2,794 views
MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems 2,793 views
GPU accelerating the FEniCS Project 2,792 views
GPU Accelerated Pattern Matching Algorithm for DNA Sequences to Detect Cancer using CUDA 2,790 views
Speeding Up Reinforcement Learning with Graphics Processing Units 2,790 views
GAMUT: GPU accelerated microRNA analysis to uncover target genes through CUDA-miRanda 2,789 views
OCCA: A unified approach to multi-threading languages 2,789 views
GPU-based Parallel Computation Support for Stan 2,787 views
NAS Parallel Benchmarks for GPGPUs using a Directive-based Programming Model 2,787 views
Parallelizing Word2Vec in Shared and Distributed Memory 2,786 views
Minimal models for finite particles in fluctuating hydrodynamics 2,786 views
Programming CUDA and OpenCL: A Case Study Using Modern C++ Libraries 2,784 views
Integer sorting on multicores: some (experiments and) observations 2,782 views
Multicore bundle adjustment 2,781 views
Interactive Wave Simulations 2,779 views
GPU based particle system 2,779 views
A dynamically configurable coprocessor for convolutional neural networks 2,777 views
Implementing Deep Neural Networks for Financial Market Prediction on the Intel Xeon Phi 2,777 views
GPU Based Acceleration of Telegraph Equation 2,777 views
Cross-Compiling Shading Languages 2,776 views
XBOOLE-CUDA: Fast Boolean Operations on the GPU 2,775 views
Using OpenCL to Implement Median Filtering and RSA Algorithms: Two GPGPU Application Case Studies 2,775 views
Parallel execution of a parameter sweep for molecular dynamics simulations in a hybrid GPU/CPU environment 2,775 views
The MOSIX Virtual OpenCL (VCL) Cluster Platform 2,774 views
Sparse LU Factorization for Parallel Circuit Simulation on GPU 2,770 views
Optimization of Spatial Convolution in ConvNets on Intel KNL 2,769 views
OpenCL-Z Android Released on Google Play 2,767 views
Workload Analysis and Efficient OpenCL-based Implementation of SIFT Algorithm on a Smartphone 2,767 views
Hardware Implementation and Quantization of Tiny-Yolo-v2 using OpenCL 2,765 views
A Simplified and Accurate Model of Power-Performance Efficiency on Emergent GPU Architectures 2,765 views
Titles: 100
Total views: 281792
- Programming - 186,129 views
- Login - 164,346 views
- User dashboard - 90,582 views
- Paper titles list - 69,999 views
- Add new event - 64,575 views
- Add new post - 59,314 views
- Register - 49,174 views
- Statistics - 36,458 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,165 views
- Books on OpenCL and CUDA - 28,806 views