2402

Views of posts on hgpu.org

OpenNMT: Open-Source Toolkit for Neural Machine Translation  2,972 views

Comparison of Technologies for General-Purpose Computing on Graphics Processing Units  2,972 views

Parallel Execution of AES-CTR Algorithm Using Extended Block Size  2,971 views

SiftCU: An Accelerated Cuda Based Implementation of SIFT  2,971 views

Input-Aware Auto-Tuning of Compute-Bound HPC Kernels  2,971 views

Deep Learning for Computer Vision: A comparison between Convolutional Neural Networks and Hierarchical Temporal Memories on object recognition tasks  2,969 views

Computer Simulation of Dark Matter Effects on Galaxy Rotation  2,968 views

Darknet on OpenCL: a multi-platform tool for object detection and classification  2,968 views

Programming on Parallel Machines: GPU, Multicore, Clusters and More  2,967 views

GPU Performance Modeling and Optimization  2,965 views

CPU and/or GPU: Revisiting the GPU Vs. CPU Myth  2,964 views

Glow: Graph Lowering Compiler Techniques for Neural Networks  2,963 views

FlexGrip: A Soft GPGPU for FPGAs  2,963 views

OpenCL C++  2,961 views

Parallel Catmull-Rom Spline Interpolation Algorithm for Image Zooming Based on CUDA  2,960 views

GPU Accelerated NIDS Search  2,960 views

Pseudorandom number generation on the GPU  2,960 views

Data Structures for Task-based Priority Scheduling  2,960 views

CUDA-enabled Optimisation of Technical Analysis Parameters  2,960 views

Scientific and Engineering Computing Using ATI Stream Technology  2,959 views

Parallel Computation of Non-Bonded Interactions in Drug Discovery: Nvidia GPUs vs. Intel Xeon Phi  2,958 views

GPU-Based Translation-Invariant 2D Discrete Wavelet Transform for Image Processing  2,956 views

An open source MATLAB program for fast numerical Feynman integral calculations for open quantum system dynamics on GPUs  2,956 views

Fast Gpu-Based Interpolation for SAR Backprojection  2,954 views

Implementing implicit OpenMP data sharing on GPUs  2,953 views

Energy-Efficient FPGA Implementation for Binomial Option Pricing Using OpenCL  2,953 views

Accelerating Financial Applications on the GPU  2,953 views

Implementation of Kirchhoff prestack depth migration on GPU  2,952 views

An evaluation of GPU acceleration for sparse reconstruction  2,952 views

Hierarchical Stochastic Motion Blur Rasterization  2,951 views

Efficient Quicksort and 2D Convex Hull for CUDA, and MSIMD as a Realistic Model of Massively Parallel Computations  2,951 views

Autotuning Programs with Algorithmic Choice  2,950 views

gR: A GPU-based Router  2,949 views

An hybrid AES-256-GCM implementation for NEON CPU & CUDA GPU  2,948 views

Processing Posting Lists Using OpenCL  2,948 views

Performance Study of Satellite Image Processing on Graphics Processors Unit Using CUDA  2,947 views

Using many-core hardware to correlate radio astronomy signals  2,946 views

pyPaSWAS: Python-based multi-core CPU and GPU sequence alignment  2,946 views

Auto-Tuning of Level 1 and Level 2 BLAS for GPUs  2,945 views

A Scalable Lane Detection Algorithm on COTSs with OpenCL  2,945 views

CUDA 2D Stencil Computations for the Jacobi Method  2,945 views

Experience of parallelizing cryo-EM 3D reconstruction on a CPU-GPU heterogeneous system  2,944 views

Effective Multi-Modal Retrieval based on Stacked Auto-Encoders  2,943 views

Improving Cache Locality for GPU-based Volume Rendering  2,942 views

libCudaOptimize: an Open Source Library of GPU-based Metaheuristics  2,940 views

GPU Accelerated Lambert Solution Methods for the Orbital Targeting Problem  2,940 views

Fast and robust CAMShift tracking  2,940 views

REMODE: Probabilistic, Monocular Dense Reconstruction in Real Time  2,940 views

HIPAcc: A Domain-Specific Language and Compiler for Image Processing  2,939 views

High performance in silico virtual drug screening on many-core processors  2,938 views

ECM on Graphics Cards  2,938 views

GPU Programming in Functional Languages: A Comparison of Haskell GPU Embedded Domain Specific Languages  2,936 views

A Framework for General Sparse Matrix-Matrix Multiplication on GPUs and Heterogeneous Processors  2,936 views

Lossless LZW Data Compression Algorithm on CUDA  2,935 views

Revisiting the Case of ARM SoCs in High-Performance Computing Clusters  2,934 views

Efficient and Scalable k-Means on GPUs  2,933 views

rCUDA: Reducing the number of GPU-based accelerators in high performance clusters  2,933 views

Fast GPU-based fluid simulations using SPH  2,932 views

Fast Speaker Diarization Using a High-Level Scripting Language  2,930 views

A Fast and Efficient SIFT Detector Using the Mobile GPU  2,929 views

Hardware accelerators for biocomputing: A survey  2,928 views

Performance of FORTRAN and C GPU Extensions for a Benchmark Suite of Fourier Pseudospectral Algorithms  2,928 views

Beyond programmable shading (parts I and II)  2,928 views

A Micro-benchmark Suite for AMD GPUs  2,928 views

A two-fluid finite-volume solver based on OpenCL  2,928 views

GPU Gems 3  2,928 views

Investigation of GPU-based Pattern Matching  2,927 views

Exploiting Space and Time Coherence in Grid-based Sorting  2,927 views

Deep learning with COTS HPC systems  2,927 views

Python Non-Uniform Fast Fourier Transform (PyNUFFT): An Accelerated Non-Cartesian MRI Package on a Heterogeneous Platform (CPU/GPU)  2,926 views

GPUWattch: Enabling Energy Optimizations in GPGPUs  2,926 views

A Parallel Edge Preserving Algorithm for Salt and Pepper Image Denoising  2,925 views

Dynamic Parallelism in GPU Optimized Barnes Hut Trees for Molecular Dynamics Simulations  2,924 views

GPU-based high-performance computing for radiation therapy  2,924 views

OpenCL Performance Evaluation on Modern Multi Core CPUs  2,922 views

Hyper neural network on OpenCL  2,920 views

GPU-accelerated triangle-triangle intersection tester algorithm  2,919 views

GPU Parallel Implementation of the Approximate K-SVD Algorithm Using OpenCL  2,919 views

OpenCL-Accelerated Simplified General Perturbations 4 Algorithm  2,918 views

KUDA: GPU Accelerated Split Race Checker  2,917 views

Pipelined MapReduce: A Decoupled MapReduce RunTime for Shared Memory Multi-Processors  2,917 views

Efficient Multi-GPU Computation of All-Pairs Shortest Paths  2,917 views

The Plasma Simulation Code: A modern particle-in-cell code with load-balancing and GPU support  2,917 views

The VOLNA-OP2 Tsunami Code (Version 1.0)  2,917 views

GPU-Based Asynchronous Global Optimization with Particle Swarm  2,916 views

Accelerating In-Memory Graph Database traversal using GPGPUS  2,915 views

Acceleration Techniques for GPU-based Volume Rendering  2,914 views

Improving CUDA DNA Analysis Software with Genetic Programming  2,913 views

Maximal Information Coefficient Analysis  2,913 views

Compiler and runtime techniques for bulk-synchronous programming models on CPU architectures  2,913 views

Neural scene representation and rendering  2,913 views

Compiler Fuzzing through Deep Learning  2,912 views

Contract-Based General-Purpose GPU Programming  2,912 views

Automatic generation of CUDA code performing tensor manipulations using C++ expression templates  2,911 views

Direct evaluation of NURBS curves and surfaces on the GPU  2,910 views

2PARMA: Parallel Paradigms and Run-time Management Techniques for Many-Core Architectures  2,910 views

Fast and accurate digital signal processing realized with GPGPU technology  2,909 views

Efficient JPEG2000 EBCOT Context Modeling for Massively Parallel Architectures  2,909 views

Accelerating convolutions on the sphere with hybrid GPU/CPU kernel splitting  2,909 views

GPU-based password cracking  2,909 views

 

Brief statistics for this page

Titles: 100

Total views: 293809

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org