Views of posts on hgpu.org
GPU & CPU implementation of Young – Van Vliet’s Recursive Gaussian Smoothing Filter 2,175 views
Loo.py: transformation-based code generation for GPUs and CPUs 2,175 views
Optical Flow via Locally Adaptive Fusion of Complementary Data Costs 2,174 views
Efficient fMRI Analysis and Clustering on GPUs 2,174 views
Real-Time Photon Mapping on GPU 2,174 views
Heat Load Modelling for District Heating Plants Using an OpenCL-based Algorithm 2,173 views
GPU Accelerated Fractal Image Compression for Medical Imaging in Parallel Computing Platform 2,173 views
Exploring Programming Multi-GPUs using OpenMP & OpenACC-based Hybrid Model 2,173 views
Fractals Image Rendering and Compression using GPUs 2,173 views
Solving the Boltzmann equation on GPUs 2,172 views
An Overview on the Latest Nature-Inspired and Metaheuristics-Based Image Registration Algorithms 2,172 views
A Study of Successive Over-relaxation Method Parallelization Over Modern HPC Languages 2,172 views
Dynamic Buffer Overflow Detection for GPGPUs 2,172 views
Assembly-Free Large-Scale Modal Analysis on the GPU 2,172 views
Parallel GMRES implementation for solving sparse linear systems on GPU clusters 2,171 views
GPU architecture overview 2,171 views
A Novel CPU/GPU Simulation Environment for Large-Scale Biologically-Realistic Neural Modeling 2,171 views
Optimising Purely Functional GPU Programs 2,171 views
On learning optimized reaction diffusion processes for effective image restoration 2,170 views
Image segmentation using CUDA implementations of the Runge-Kutta-Merson and GMRES methods 2,170 views
GPU-based Iterative Cone Beam CT Reconstruction Using Tight Frame Regularization 2,169 views
Static Memory Access Pattern Analysis on a Massively Parallel GPU 2,169 views
CPU and GPU Co-processing for Sound 2,169 views
MapSQ: A MapReduce-based Framework for SPARQL Queries on GPU 2,168 views
Efficient Processing of MRFs for Unconstrained-Pose Face Recognition 2,168 views
GPUMP: A Multiple-Precision Integer Library for GPUs 2,168 views
Formalizing Address Spaces with application to Cuda, OpenCL, and beyond 2,167 views
A Case Study of SWIM: Optimization of Memory Intensive Application on GPGPU 2,167 views
A new CUDA-based GPU implementation of the two-dimensional Athena code 2,167 views
A Compiler for Throughput Optimization of Graph Algorithms on GPUs 2,167 views
Lyra2: Password Hashing Scheme with improved security against time-memory trade-offs 2,166 views
A New Architecture for Games and Simulations Using GPUs 2,166 views
Flip-Flop: Convex Hull Construction via Star-Shaped Polyhedron in 3D 2,166 views
Developing a massive real-time crowd simulation framework on the GPU 2,166 views
Accelerating NTRU based Homomorphic Encryption using GPUs 2,166 views
Accelerating the Nussinov RNA folding algorithm with CUDA/GPU 2,166 views
A GPU based saliency map for high-fidelity selective rendering 2,166 views
Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs 2,165 views
A High-efficiency FPGA-based Accelerator for Convolutional Neural Networks using Winograd Algorithm 2,165 views
Proteus: Exploiting Numerical Precision Variability in Deep Neural Networks 2,164 views
Neural GPUs Learn Algorithms 2,164 views
Design Exploration of AES Accelerators on FPGAs and GPUs 2,163 views
Automatic test case reduction of randomly generated OpenCL kernels 2,163 views
Towards Enhancing Performance, Programmability, and Portability in Heterogeneous Computing 2,163 views
GPU-Based Shooting and Bouncing Ray Method for Fast RCS Prediction 2,163 views
A Real-time GPU Implementation of the SIFT Algorithm for Large-Scale Video Analysis Tasks 2,163 views
A 3D radiative transfer framework. VIII. OpenCL implementation 2,162 views
Implementing a Sparse Matrix Vector Product for the SELL-C/SELL-C-sigma formats on NVIDIA GPUs 2,162 views
FSCL: Homogeneous programming, scheduling and execution on heterogeneous platforms 2,162 views
Introduction to GPU programming for EDA 2,162 views
The Rodinia Benchmark Suite in SYCL 2,161 views
Binary Interval Search (BITS): A Scalable Algorithm for Counting Interval Intersections 2,161 views
Error Resilience Evaluation on GPGPU Applications 2,160 views
Optimization of a Machine Learning Algorithm on the Heterogeneous system using OpenCL 2,160 views
A GPGPU Transparent Virtualization Component for High Performance Computing Clouds 2,160 views
Parallel Implementations of Hopfield Neural Networks On GPU 2,160 views
Automatic SIMD Code Generation 2,160 views
CUDA optimization strategies for compute- and memory-bound neuroimaging algorithms 2,160 views
Parallel implementation of the Finite-Difference Time-Domain method in Open Computing Language 2,160 views
Analysis and implementation of a BLAST-Like algorithm for MIC architectures 2,159 views
Larrabee: a many-core x86 architecture for visual computing 2,158 views
Simulating Quantum Computers Using OpenCL 2,158 views
Warp Size Impact in GPUs: Large or Small? 2,157 views
MVAPICH2-GPU: optimized GPU to GPU communication for InfiniBand clusters 2,157 views
Real-time dynamic tone-mapping operator on GPU 2,157 views
Object support for OpenMP-style programming of GPU clusters in Java 2,157 views
High Quality Cone-beam CT Reconstruction on the GPU 2,157 views
The Distribution of OpenCL Kernel Execution Across Multiple Devices 2,157 views
Applications of Deep Neural Networks 2,157 views
Parallelization of a novel frequent itemset hiding algorithm on a CPU-GPU platform 2,156 views
SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences 2,156 views
Acceleration of PET Monte Carlo simulation using the graphics hardware ray-tracing engine 2,156 views
Real-time GPU-based Simulation of Dynamic Terrain in Virtual Battlefield 2,156 views
Multicore and GPU Parallelization of Neural Networks for Face Recognition 2,155 views
LAMMPS’ PPPM Long-Range Solver for the Second Generation Xeon Phi 2,155 views
Characterizing and Evaluating a Key-value Store Application on Heterogeneous CPU-GPU Systems 2,155 views
An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness 2,155 views
Scalable and Interactive Segmentation and Visualization of Neural Processes in EM Datasets 2,155 views
Efficient Implementation of the Simplex Method on a CPU-GPU System 2,155 views
Using GPUs to Crack Android Pattern-based Passwords 2,154 views
A fast high quality pseudo random number generator for nVidia CUDA 2,153 views
GPUmotif: An Ultra-Fast and Energy-Efficient Motif Analysis Program Using Graphics Processing Units 2,153 views
Implementing Molecular Dynamics on Hybrid High Performance Computers – Three-Body Potentials 2,153 views
A Unified Optimization Approach for Sparse Tensor Operations on GPUs 2,153 views
GPU acceleration of Runge Kutta-Fehlberg and its comparison with Dormand-Prince method 2,153 views
A 3D radiative transfer framework: XIII. OpenCL implementation 2,153 views
Local Alignment Tool Based on Hadoop Framework and GPU Architecture 2,152 views
Best bang for your buck: GPU nodes for GROMACS biomolecular simulations 2,152 views
Seismic Wave Propagation Simulation Using Support Operator Method on multi-GPU system 2,152 views
GPU-based infrared thermography for NDE of minefields 2,152 views
Modern Gyrokinetic Particle-In-Cell Simulation of Fusion Plasmas on Top Supercomputers 2,152 views
Parallel SIFT-detector implementation for images matching 2,151 views
Stochastic Gradient Descent on GPUs 2,151 views
Titles: 100
Total views: 216267
- Programming - 186,133 views
- Login - 164,571 views
- User dashboard - 91,320 views
- Paper titles list - 71,383 views
- Add new event - 64,819 views
- Add new post - 59,622 views
- Register - 49,322 views
- Statistics - 37,181 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,194 views
- Books on OpenCL and CUDA - 28,901 views