Views of posts on hgpu.org
Development of High-Performance Software Components for Emerging Architectures 2,480 views
PUGACE, a cellular Evolutionary Algorithm framework on GPUs 2,479 views
New High Performance GPGPU Code Transformation Framework Applied to Large Production Weather Prediction Code 2,479 views
Password Recovery Using MPI and CUDA 2,479 views
Supervised Hashing with Deep Neural Networks 2,479 views
A Comparison of Modern GPU and CPU Architectures: And the Common Convergence of Both 2,478 views
Anatomy of High-Performance Many-Threaded Matrix Multiplication 2,478 views
Architectural Support for Virtual Memory in GPUs 2,478 views
Divide and Conquer G-Buffer Ray Tracing 2,477 views
GPU-Accelerated Recurrent Neural Networks: OpenCLLink and SymbolicC 2,477 views
CURFIL: Random Forests for Image Labeling on GPU 2,476 views
Atomic-free Irregular Computations on GPUs 2,476 views
Accelerating Binarized Neural Networks: Comparison of FPGA, CPU, GPU, and ASIC 2,475 views
Optimizing OpenCL Local Work Group Size With Machine Learning 2,473 views
A Fast GVF Snake Algorithm on the GPU 2,473 views
Sparse Matrix Multiplication using CUDA and Mex Interface 2,472 views
A GPU-Based Enhanced Genetic Algorithm for Power-Aware Task Scheduling Problem in HPC Cloud 2,472 views
Darknet on OpenCL: a multi-platform tool for object detection and classification 2,472 views
Binomial American Option Pricing on CPU-GPU Hetergenous System 2,471 views
Distributed genetic programming on GPUs using CUDA 2,471 views
phiGEMM: a CPU-GPU library for porting Quantum ESPRESSO on hybrid systems 2,471 views
A Parallel Image Segmentation Algorithm on GPUs 2,470 views
Performance Evaluations of Graph Database using CUDA and OpenMP-Compatible Libraries 2,470 views
Implementing Computer Vision Functions with OpenCL on the Qualcomm Adreno 420 2,470 views
Medical Image Registration using OpenCL 2,469 views
Systematic Performance Optimization of Cone-Beam Back-Projection on the Kepler Architecture 2,469 views
Lattice QCD on Intel Xeon Phi 2,469 views
Parallel Explicit FEM Algorithms Using GPU’s 2,469 views
Spectral volume rendering using GPU-based raycasting 2,469 views
OpenCL Implementation of LiDAR Data Processing 2,468 views
A Duality Based Approach for Realtime TV-L1 Optical Flow 2,468 views
GPU-based ray-casting of non-rigid deformations: a comparison between direct and indirect approaches 2,468 views
Hardware-Oblivious Parallelism for In-Memory Column-Stores 2,468 views
SYCL-Bench: A Versatile Single-Source Benchmark Suite for Heterogeneous Computing 2,468 views
Secret Key Cryptography Using Graphics Cards 2,466 views
Contributions of hybrid architectures to depth imaging: a CPU, APU and GPU comparative study 2,466 views
Work Efficient Parallel Algorithms for Large Graph Exploration 2,466 views
GPU-based Video Feature Tracking and Matching 2,466 views
FPGA vs. GPU for sparse matrix vector multiply 2,466 views
A simple and flexible volume rendering framework for graphics-hardware-based raycasting 2,465 views
FPGA-based Tsunami Simulation: Performance Comparison with GPUs, and Roofline Model for Scalability Analysis 2,465 views
Kernel Tuner: A search-optimizing GPU code auto-tuner 2,464 views
Adaptation of the MapReduce programming framework to compute-intensive data-analytics kernels 2,463 views
Fast Effective Deterministic Primality Test Using CUDA/GPGPU 2,463 views
Real-Time Stereo Matching using Adaptive Window based Disparity Refinement 2,462 views
Scope for performance enhancement of CMU Sphinx by parallelising with OpenCL 2,462 views
GPU acceleration of a production molecular docking code 2,462 views
Optimization of HEP codes on GPUs 2,462 views
Fast GPGPU Data Rearrangement Kernels using CUDA 2,461 views
cuTT: A High-Performance Tensor Transpose Library for CUDA Compatible GPUs 2,461 views
High Precision Integer Multiplication with a GPU Using Strassen’s Algorithm with Multiple FFT Sizes 2,461 views
Password Cracking in the Cloud 2,461 views
A Study of Complex Deep Learning Networks on High Performance, Neuromorphic, and Quantum Computers 2,460 views
A Micro-benchmark Suite for AMD GPUs 2,460 views
Billion-scale similarity search with GPUs 2,460 views
CFMDS: CUDA-based fast multidimensional scaling for genome-scale data 2,460 views
A Datalog Engine for GPUs 2,459 views
Implementation of QR Updating Algorithms on the GPU 2,458 views
Automatic C-to-CUDA Code Generation for Affine Programs 2,458 views
Solving Molecular Distance Geometry Problems in OpenCL 2,458 views
An Introduction to High Performance Computing on AWS 2,457 views
NCRF++: An Open-source Neural Sequence Labeling Toolkit 2,456 views
Writing self-adaptive codes for heterogeneous systems 2,455 views
Utilising OpenCL Framework for Ray-Tracing Acceleration 2,455 views
A Survey of FPGA Based Neural Network Accelerator 2,455 views
Study of basic vector operations on Intel Xeon Phi and NVIDIA Tesla using OpenCL 2,454 views
NeMo: A Platform for Neural Modelling of Spiking Neurons Using GPUs 2,454 views
Parallelizing LINQ Program for GPGPU 2,454 views
GPU-based simulation of brain neuron models 2,454 views
A GPU operations framework for WattDB 2,453 views
GPU-Accelerated Non-negative Matrix Factorization for Text Mining 2,452 views
Platform-independent parallelization of the Lattice Boltzmann method with OpenCL 2,452 views
Odeint – Solving ordinary differential equations in C++ 2,450 views
GPU Accelerated Computation and Real-time Rendering of Cellular Automata Model for Spatial Simulation 2,450 views
A balanced programming model for emerging heterogeneous multicore systems 2,449 views
CrowdCL: Web-Based Volunteer Computing with WebCL 2,448 views
Implementation of k-Means Clustering Algorithm in CUDA 2,447 views
OpenCL Implementation of Montgomery Multiplication on FPGA 2,446 views
GPU Accelerated Radio Wave Propagation Modeling Using Ray Tracing 2,446 views
In-Place Recursive Approach for All-Pairs Shortest Paths Problem Using OpenCL 2,446 views
Multi-GPU accelerated multi-spin Monte Carlo simulations of the 2D Ising model 2,444 views
Full Covariance Gaussian Mixture Models Evaluation on GPU 2,443 views
Performance models for CPU-GPU data transfers 2,443 views
OpenCL for Database Query Processing 2,442 views
Particle method on GPU 2,442 views
Dynamic Parallelism in GPU Optimized Barnes Hut Trees for Molecular Dynamics Simulations 2,441 views
Evolutionary Algorithm for Optimizing Parameters of GPGPU-based Image Segmentation 2,440 views
CUDA Based Polyphase Filter 2,440 views
GPU in Physics Computation: Case Geant4 Navigation 2,439 views
The Power-Performance Tradeoffs of the Intel Xeon Phi on HPC Applications 2,438 views
A New Data Layout For Set Intersection on GPUs 2,437 views
Parallel Multi Channel Convolution using General Matrix Multiplication 2,437 views
MrBayes on a Graphics Processing Unit 2,437 views
An Algorithm for Fast Edit Distance Computation on GPUs 2,437 views
Titles: 100
Total views: 246038
- Programming - 186,133 views
- Login - 164,571 views
- User dashboard - 91,323 views
- Paper titles list - 71,400 views
- Add new event - 64,819 views
- Add new post - 59,626 views
- Register - 49,323 views
- Statistics - 37,182 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,194 views
- Books on OpenCL and CUDA - 28,901 views