Views of posts on hgpu.org
OpenNMT: Open-Source Toolkit for Neural Machine Translation 2,972 views
Comparison of Technologies for General-Purpose Computing on Graphics Processing Units 2,972 views
Parallel Execution of AES-CTR Algorithm Using Extended Block Size 2,971 views
SiftCU: An Accelerated Cuda Based Implementation of SIFT 2,971 views
Input-Aware Auto-Tuning of Compute-Bound HPC Kernels 2,971 views
Computer Simulation of Dark Matter Effects on Galaxy Rotation 2,968 views
Darknet on OpenCL: a multi-platform tool for object detection and classification 2,968 views
Programming on Parallel Machines: GPU, Multicore, Clusters and More 2,967 views
GPU Performance Modeling and Optimization 2,965 views
CPU and/or GPU: Revisiting the GPU Vs. CPU Myth 2,964 views
Glow: Graph Lowering Compiler Techniques for Neural Networks 2,963 views
FlexGrip: A Soft GPGPU for FPGAs 2,963 views
OpenCL C++ 2,961 views
Parallel Catmull-Rom Spline Interpolation Algorithm for Image Zooming Based on CUDA 2,960 views
GPU Accelerated NIDS Search 2,960 views
Pseudorandom number generation on the GPU 2,960 views
Data Structures for Task-based Priority Scheduling 2,960 views
CUDA-enabled Optimisation of Technical Analysis Parameters 2,960 views
Scientific and Engineering Computing Using ATI Stream Technology 2,959 views
Parallel Computation of Non-Bonded Interactions in Drug Discovery: Nvidia GPUs vs. Intel Xeon Phi 2,958 views
GPU-Based Translation-Invariant 2D Discrete Wavelet Transform for Image Processing 2,956 views
Fast Gpu-Based Interpolation for SAR Backprojection 2,954 views
Implementing implicit OpenMP data sharing on GPUs 2,953 views
Energy-Efficient FPGA Implementation for Binomial Option Pricing Using OpenCL 2,953 views
Accelerating Financial Applications on the GPU 2,953 views
Implementation of Kirchhoff prestack depth migration on GPU 2,952 views
An evaluation of GPU acceleration for sparse reconstruction 2,952 views
Hierarchical Stochastic Motion Blur Rasterization 2,951 views
Autotuning Programs with Algorithmic Choice 2,950 views
gR: A GPU-based Router 2,949 views
An hybrid AES-256-GCM implementation for NEON CPU & CUDA GPU 2,948 views
Processing Posting Lists Using OpenCL 2,948 views
Performance Study of Satellite Image Processing on Graphics Processors Unit Using CUDA 2,947 views
Using many-core hardware to correlate radio astronomy signals 2,946 views
pyPaSWAS: Python-based multi-core CPU and GPU sequence alignment 2,946 views
Auto-Tuning of Level 1 and Level 2 BLAS for GPUs 2,945 views
A Scalable Lane Detection Algorithm on COTSs with OpenCL 2,945 views
CUDA 2D Stencil Computations for the Jacobi Method 2,945 views
Experience of parallelizing cryo-EM 3D reconstruction on a CPU-GPU heterogeneous system 2,944 views
Effective Multi-Modal Retrieval based on Stacked Auto-Encoders 2,943 views
Improving Cache Locality for GPU-based Volume Rendering 2,942 views
libCudaOptimize: an Open Source Library of GPU-based Metaheuristics 2,940 views
GPU Accelerated Lambert Solution Methods for the Orbital Targeting Problem 2,940 views
Fast and robust CAMShift tracking 2,940 views
REMODE: Probabilistic, Monocular Dense Reconstruction in Real Time 2,940 views
HIPAcc: A Domain-Specific Language and Compiler for Image Processing 2,939 views
High performance in silico virtual drug screening on many-core processors 2,938 views
ECM on Graphics Cards 2,938 views
GPU Programming in Functional Languages: A Comparison of Haskell GPU Embedded Domain Specific Languages 2,936 views
A Framework for General Sparse Matrix-Matrix Multiplication on GPUs and Heterogeneous Processors 2,936 views
Lossless LZW Data Compression Algorithm on CUDA 2,935 views
Revisiting the Case of ARM SoCs in High-Performance Computing Clusters 2,934 views
Efficient and Scalable k-Means on GPUs 2,933 views
rCUDA: Reducing the number of GPU-based accelerators in high performance clusters 2,933 views
Fast GPU-based fluid simulations using SPH 2,932 views
Fast Speaker Diarization Using a High-Level Scripting Language 2,930 views
A Fast and Efficient SIFT Detector Using the Mobile GPU 2,929 views
Hardware accelerators for biocomputing: A survey 2,928 views
Performance of FORTRAN and C GPU Extensions for a Benchmark Suite of Fourier Pseudospectral Algorithms 2,928 views
Beyond programmable shading (parts I and II) 2,928 views
A Micro-benchmark Suite for AMD GPUs 2,928 views
A two-fluid finite-volume solver based on OpenCL 2,928 views
GPU Gems 3 2,928 views
Investigation of GPU-based Pattern Matching 2,927 views
Exploiting Space and Time Coherence in Grid-based Sorting 2,927 views
Deep learning with COTS HPC systems 2,927 views
GPUWattch: Enabling Energy Optimizations in GPGPUs 2,926 views
A Parallel Edge Preserving Algorithm for Salt and Pepper Image Denoising 2,925 views
Dynamic Parallelism in GPU Optimized Barnes Hut Trees for Molecular Dynamics Simulations 2,924 views
GPU-based high-performance computing for radiation therapy 2,924 views
OpenCL Performance Evaluation on Modern Multi Core CPUs 2,922 views
Hyper neural network on OpenCL 2,920 views
GPU-accelerated triangle-triangle intersection tester algorithm 2,919 views
GPU Parallel Implementation of the Approximate K-SVD Algorithm Using OpenCL 2,919 views
OpenCL-Accelerated Simplified General Perturbations 4 Algorithm 2,918 views
KUDA: GPU Accelerated Split Race Checker 2,917 views
Pipelined MapReduce: A Decoupled MapReduce RunTime for Shared Memory Multi-Processors 2,917 views
Efficient Multi-GPU Computation of All-Pairs Shortest Paths 2,917 views
The Plasma Simulation Code: A modern particle-in-cell code with load-balancing and GPU support 2,917 views
The VOLNA-OP2 Tsunami Code (Version 1.0) 2,917 views
GPU-Based Asynchronous Global Optimization with Particle Swarm 2,916 views
Accelerating In-Memory Graph Database traversal using GPGPUS 2,915 views
Acceleration Techniques for GPU-based Volume Rendering 2,914 views
Improving CUDA DNA Analysis Software with Genetic Programming 2,913 views
Maximal Information Coefficient Analysis 2,913 views
Compiler and runtime techniques for bulk-synchronous programming models on CPU architectures 2,913 views
Neural scene representation and rendering 2,913 views
Compiler Fuzzing through Deep Learning 2,912 views
Contract-Based General-Purpose GPU Programming 2,912 views
Automatic generation of CUDA code performing tensor manipulations using C++ expression templates 2,911 views
Direct evaluation of NURBS curves and surfaces on the GPU 2,910 views
2PARMA: Parallel Paradigms and Run-time Management Techniques for Many-Core Architectures 2,910 views
Fast and accurate digital signal processing realized with GPGPU technology 2,909 views
Efficient JPEG2000 EBCOT Context Modeling for Massively Parallel Architectures 2,909 views
Accelerating convolutions on the sphere with hybrid GPU/CPU kernel splitting 2,909 views
GPU-based password cracking 2,909 views
Titles: 100
Total views: 293809
- Programming - 186,232 views
- Login - 172,205 views
- User dashboard - 98,617 views
- Paper titles list - 93,041 views
- Add new event - 69,217 views
- Add new post - 62,826 views
- Register - 53,126 views
- Statistics - 44,273 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,523 views
- Books on OpenCL and CUDA - 31,181 views