Views of posts on hgpu.org
libmolgrid: GPU Accelerated Molecular Gridding for Deep Learning Applications 2,611 views
Deep Learning in the Automotive Industry: Applications and Tools 2,610 views
Real time Multi-GPU-based Event Detection in High Definition Videos 2,610 views
High performance pattern matching and data remanence on graphics processing units 2,609 views
Data Structures for Task-based Priority Scheduling 2,609 views
Optimized Broadcast for Deep Learning Workloads on Dense-GPU InfiniBand Clusters: MPI or NCCL? 2,607 views
Implementing Strassen’s Algorithm with CUTLASS on NVIDIA Volta GPUs 2,606 views
OpenVIDIA: parallel GPU computer vision 2,605 views
A Chunking Method for Euclidean Distance Matrix Calculation on Large Dataset Using Multi-GPU 2,604 views
GPU Pro 7: Advanced Rendering 2,603 views
SWM: Simplified Wu-Manber for GPU-based Deep Packet Inspection 2,603 views
Legolizer: A Real-Time System for Modeling and Rendering LEGO Representations of Boundary Models 2,603 views
Optimizing Linpack Benchmark on GPU-Accelerated Petascale Supercomputer 2,602 views
Hierarchical Stochastic Motion Blur Rasterization 2,599 views
A GPU Approach to Fortran Legacy Systems 2,598 views
Autotuning Programs with Algorithmic Choice 2,598 views
Adapting the GA Approach to Solve Traveling Salesman Problems on CUDA Architecture 2,598 views
Design and Implementation of the Futhark Programming Language 2,597 views
Parallel hyperbolic PDE simulation on clusters: Cell versus GPU 2,595 views
2HOT: An Improved Parallel Hashed Oct-Tree N-Body Algorithm for Cosmological Simulation 2,595 views
Hardware-Accelerated Raycasting: Towards an Effective Brain MRI Visualization 2,595 views
FlexGrip: A Soft GPGPU for FPGAs 2,594 views
A case study on porting scientific applications to GPU/CUDA 2,594 views
clMAGMA: High Performance Dense Linear Algebra with OpenCL 2,594 views
3D Haar-Like Elliptical Features for Object Classification in Microscopy 2,593 views
Implementation of digital down converter in GPU 2,593 views
Parallel Voronoi Diagram computation on scaled distance planes using CUDA 2,590 views
GPF: a framework for general packet classification on GPU co-processors 2,590 views
Fluid Simulation: Smoothed Particle Hydrodynamics on the GPU 2,590 views
State Lattice-based Motion Planning for Autonomous On-Road Driving 2,589 views
190 TFlops Astrophysical N-body Simulation on a Cluster of GPUs 2,589 views
FastMag: Fast micromagnetic simulator for complex magnetic structures 2,588 views
Converting Data to Task-Parallelism by Rewrites 2,588 views
Beyond 16GB: Out-of-Core Stencil Computations 2,587 views
Improving Cache Locality for GPU-based Volume Rendering 2,586 views
A Hybrid Approach to Parallel Connected Component Labeling Using CUDA 2,586 views
Programming on Parallel Machines: GPU, Multicore, Clusters and More 2,586 views
Swarm-NG: a CUDA Library for Parallel n-body Integrations with focus on Simulations of Planetary Systems 2,585 views
CUDA-enabled Optimisation of Technical Analysis Parameters 2,585 views
A Class of Hybrid LAPACK Algorithms for Multicore and GPU Architectures 2,584 views
3D Edge Bundling for Geographical Data Visualization 2,580 views
Lossless LZW Data Compression Algorithm on CUDA 2,580 views
GPU performance comparison for accelerated radar data processing 2,579 views
The design and verification of Mumax3 2,577 views
Fast Speaker Diarization Using a High-Level Scripting Language 2,577 views
Energy-Efficient FPGA Implementation for Binomial Option Pricing Using OpenCL 2,576 views
SiftCU: An Accelerated Cuda Based Implementation of SIFT 2,576 views
Accelerating convolutions on the sphere with hybrid GPU/CPU kernel splitting 2,576 views
High Performance Histograms on SIMT and SIMD Architectures 2,575 views
Parallel Catmull-Rom Spline Interpolation Algorithm for Image Zooming Based on CUDA 2,574 views
A two-fluid finite-volume solver based on OpenCL 2,574 views
Revisiting the Case of ARM SoCs in High-Performance Computing Clusters 2,573 views
Fast and robust CAMShift tracking 2,572 views
Hybrid CPU-GPU Pipeline Framework 2,571 views
rCUDA: Reducing the number of GPU-based accelerators in high performance clusters 2,570 views
A case for neuromorphic ISAs 2,570 views
A Parallel Edge Preserving Algorithm for Salt and Pepper Image Denoising 2,570 views
REMODE: Probabilistic, Monocular Dense Reconstruction in Real Time 2,570 views
Performance comparison of Lattice Boltzmann fluid flow simulation using OpenCL and CUDA frameworks 2,569 views
FastTree: A Hardware KD-Tree Construction Acceleration Engine for Real-Time Ray Tracing 2,569 views
A Case for Work-stealing on FPGAs with OpenCL Atomics 2,568 views
High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster 2,568 views
Introducing CURRENNT: The Munich Open-Source CUDA RecurREnt Neural Network Toolkit 2,568 views
Accelerating In-Memory Graph Database traversal using GPGPUS 2,566 views
Scalable Kernel Fusion for Memory-Bound GPU Applications 2,566 views
OpenGL SuperBible: Comprehensive Tutorial and Reference (5th Edition) 2,565 views
Acceleration of CFD and data analysis using graphics processors 2,565 views
Parallel Computation of Non-Bonded Interactions in Drug Discovery: Nvidia GPUs vs. Intel Xeon Phi 2,564 views
Burrows-Wheeler Aligner: A Parallel Approach 2,563 views
CPU and/or GPU: Revisiting the GPU Vs. CPU Myth 2,562 views
GHOST: GPGPU-Offloaded High Performance Storage I/O Deduplication for Primary Storage System 2,561 views
Parallel kNN on GPU Architecture Using OpenCL 2,561 views
Multi-Threaded Automatic Integration Using OpenMP and CUDA 2,559 views
Hybrid GPU-Based Single- and Double-Bounce SAR Simulation 2,559 views
Fast Gpu-Based Interpolation for SAR Backprojection 2,558 views
Maximal Information Coefficient Analysis 2,557 views
GPU Programming with CUDA: A brief overview 2,557 views
3D finite element numerical integration on GPUs 2,557 views
Matrix Multiplication with CUDA – A basic introduction to the CUDA programming model 2,557 views
Machine Learning from Streaming Data in Heterogeneous Computing Environments 2,557 views
Fast GPU-based fluid simulations using SPH 2,557 views
Mint: realizing CUDA performance in 3D stencil methods with annotated C 2,556 views
OpenCL Performance Evaluation on Modern Multi Core CPUs 2,555 views
Comparison of Technologies for General-Purpose Computing on Graphics Processing Units 2,554 views
Rubus: A compiler for seamless and extensible parallelism 2,554 views
A simple GPU-based approach for 3D Voronoi diagram construction and visualization 2,553 views
HIPAcc: A Domain-Specific Language and Compiler for Image Processing 2,552 views
Parallelization of calculations using GPU in optimization approach for macromodels construction 2,552 views
Support for Parallel Scan in OpenMP 2,552 views
Investigation of GPU-based Pattern Matching 2,551 views
Design and Development of an Efficient H. 264 Video Encoder for CPU/GPU using OpenCL 2,550 views
Optimizing Performance of Recurrent Neural Networks on GPUs 2,550 views
Data-rich astronomy: mining synoptic sky surveys 2,550 views
An hybrid AES-256-GCM implementation for NEON CPU & CUDA GPU 2,549 views
A multi-Teraflop Constituency Parser using GPUs 2,549 views
Titles: 100
Total views: 257714
- Programming - 186,126 views
- Login - 164,192 views
- User dashboard - 90,260 views
- Paper titles list - 69,527 views
- Add new event - 64,519 views
- Add new post - 59,091 views
- Register - 49,124 views
- Statistics - 36,149 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,159 views
- Books on OpenCL and CUDA - 28,751 views