Views of posts on hgpu.org
GPU Monte Carlo scatter calculations for Cone Beam Computed Tomography 2,908 views
GPU Accelerated 3-D Modeling and Simulation of a Blended Kinetic Impact and Nuclear Subsurface Explosion 2,908 views
Multi-GPU accelerated multi-spin Monte Carlo simulations of the 2D Ising model 2,907 views
Parallel Explicit FEM Algorithms Using GPU’s 2,907 views
A multi-Teraflop Constituency Parser using GPUs 2,907 views
SPH on GPU with CUDA 2,906 views
Comparison of Random Number Generators in Particle Swarm Optimization Algorithm 2,905 views
3D nonrigid registration via optimal mass transport on the GPU 2,905 views
Efficient Partitioning Based Hierarchical Agglomerative Clustering Using Graphics Accelerators with CUDA 2,905 views
B-Calm: an Open-Source Multi-Gpu-Based 3D-FDTD with Multi-Pole Dispersion for Plasmonics 2,904 views
Fast and Robust Linear Motion Deblurring 2,904 views
Application of GPU Smooth Particle Hydrodynamics: Wave Runup and Overtopping on Composite Slopes 2,903 views
File I/O on Intel Xeon Phi Coprocessors: RAM disks, VirtIO, NFS and Lustre 2,903 views
A Parallel Image Segmentation Algorithm on GPUs 2,903 views
GPU Parallel Statistical and Cube Test Analysis of the SHA-3 Finalist Candidate Hash Functions 2,902 views
High performance transcription factor-DNA docking with GPU computing 2,902 views
Mint: realizing CUDA performance in 3D stencil methods with annotated C 2,901 views
Parallelization of calculations using GPU in optimization approach for macromodels construction 2,901 views
Implementation of medical image segmentation in CUDA 2,901 views
Fast k Nearest Neighbor Search using GPU 2,901 views
Gunrock: A High-Performance Graph Processing Library on the GPU 2,900 views
Parallel kNN on GPU Architecture Using OpenCL 2,900 views
OpenCL programming using Python syntax 2,900 views
Integrated GPUs: how useful are they in HPC? 2,900 views
Jump flooding in GPU with applications to Voronoi diagram and distance transform 2,899 views
MLitB: Machine Learning in the Browser 2,899 views
Parallel scalable simulations of biological neural networks using TensorFlow: A beginner’s guide 2,899 views
Neurokernel: An Open Source Platform for Emulating the Fruit Fly Brain 2,899 views
A Multi-GPU Programming Library for Real-Time Applications 2,899 views
Efficient GPU-based Graph Cuts for Stereo Matching 2,899 views
Loo.py: From Fortran to performance via transformation and substitution rules 2,899 views
Supervised Hashing with Deep Neural Networks 2,898 views
Beyond 16GB: Out-of-Core Stencil Computations 2,898 views
Dynamic Programming with CUDA – Part II 2,897 views
GPU Programming with CUDA: A brief overview 2,897 views
Implementation of K-shortest Path Algorithm in GPU Using CUDA 2,896 views
A GPGPU-based Pipeline for Accelerated Rendering of Point Clouds 2,895 views
GPU acceleration of a production molecular docking code 2,895 views
Deep Voice: Real-time Neural Text-to-Speech 2,894 views
Vision based Navigation (VBN) of Unmanned Aerial Vehicles (UAV) 2,893 views
Accelerating Binarized Neural Networks: Comparison of FPGA, CPU, GPU, and ASIC 2,893 views
High Performance Implementation of Ultrasound Color Doppler Imaging on GPU platform 2,891 views
Reducing GPU Offload Latency via Fine-Grained CPU-GPU Synchronization 2,890 views
An instruction-systolic programmable shader architecture for multi-threaded 3D graphics processing 2,890 views
Parallel Unsteady Flow Line Integral Convolution for High-Performance Dense Visualization 2,888 views
List Mode PET reconstruction 2,887 views
Exploiting Heterogeneous Systems: Keccak on OpenCL 2,886 views
BioEM: GPU-accelerated computing of Bayesian inference of electron microscopy images 2,886 views
Optimising Hydrodynamics applications for the Cray XC30 with the application tool suite 2,885 views
A Novel Learning Algorithm for Bayesian Network and Its Efficient Implementation on GPU 2,885 views
3.5-D Blocking Optimization for Stencil Computations on Modern CPUs and GPUs 2,885 views
Sylkan: Towards a Vulkan Compute Target Platform for SYCL 2,884 views
Volume Raycasting Performance Using DirectCompute 2,884 views
Solving the Caputo Fractional Reaction-Diffusion Equation on GPU 2,884 views
Large Graphs on multi-GPUs 2,882 views
Automated Tool to Generate Parallel CUDA code from a Serial C Code 2,882 views
Contributions of hybrid architectures to depth imaging: a CPU, APU and GPU comparative study 2,882 views
phiGEMM: a CPU-GPU library for porting Quantum ESPRESSO on hybrid systems 2,882 views
Burrows-Wheeler Aligner: A Parallel Approach 2,882 views
Distributed Massive Model Rendering 2,881 views
A fast marching method based back projection algorithm for photoacoustic tomography in heterogeneous media 2,880 views
Monte Carlo Path Tracing with OpenCL 2,880 views
GPU Matrix Multiplication 2,880 views
Variational Bayesian Image Super-Resolution with GPU Acceleration 2,880 views
Embedding OpenCL in C++ for Expressive GPU Programming 2,878 views
A CUDA Monte Carlo simulator for radiation therapy dosimetry based on Geant4 2,878 views
CT to Cone-beam CT Deformable Registration With Simultaneous Intensity Correction 2,877 views
Evaluating Performance Tradeoffs on the Radeon Open Compute Platform 2,877 views
Multi-Level Graph Layout on the GPU 2,876 views
Running Financial Risk Management Applications on FPGA in the Amazon Cloud 2,875 views
Accelerating GPU Programs by Reducing Irregular Control Flow and Memory Access 2,874 views
Real-Time Stereo Matching using Adaptive Window based Disparity Refinement 2,874 views
GPU Cluster for High Performance Computing 2,874 views
OpenCL Implementation of Montgomery Multiplication on FPGA 2,874 views
An improved implementation of Preconditioned Conjugate Gradient Method on GPU 2,873 views
A framework for cost based optimization of hybrid CPU/GPU query plans in database systems 2,873 views
Full Covariance Gaussian Mixture Models Evaluation on GPU 2,873 views
A collision detection algorithm using adaptive particle sensor 2,873 views
Optimization of HEP codes on GPUs 2,873 views
cuTT: A High-Performance Tensor Transpose Library for CUDA Compatible GPUs 2,872 views
Parallelization of the Ant Colony Optimization for the Shortest Path Problem using OpenMP and CUDA 2,871 views
Particle-based volume rendering 2,871 views
Development of High-Performance Software Components for Emerging Architectures 2,871 views
Solving Molecular Distance Geometry Problems in OpenCL 2,871 views
Parallel and Scalable Sparse Basic Linear Algebra Subprograms 2,870 views
Kernel Tuner: A search-optimizing GPU code auto-tuner 2,869 views
Large Scale Monte Carlo Tree Search on GPU 2,869 views
Sparse-Matrix-CG-Solver in CUDA 2,868 views
Implementing Interactive 3D Segmentation on CUDA Using Graph-Cuts and Watershed Transformation 2,868 views
Computer Vision on the GPU — Tools, Algorithms and Frameworks 2,868 views
Fast GPU-based Locality Sensitive Hashing for K-Nearest Neighbor Computation 2,867 views
A balanced programming model for emerging heterogeneous multicore systems 2,867 views
KBLAS: An Optimized Library for Dense Matrix-Vector Multiplication on GPU Accelerators 2,867 views
Blink: Fast and Generic Collectives for Distributed ML 2,867 views
Parallel Implementations for Solving Shortest Path Problem using Bellman-Ford 2,866 views
High Performance Multi-dimensional (2D/3D) FFT-Shift Implementation on Graphics Processing Units (GPUs) 2,865 views
Titles: 100
Total views: 288745
- Programming - 186,232 views
- Login - 172,232 views
- User dashboard - 98,624 views
- Paper titles list - 93,060 views
- Add new event - 69,220 views
- Add new post - 62,834 views
- Register - 53,133 views
- Statistics - 44,280 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,523 views
- Books on OpenCL and CUDA - 31,189 views