Views of posts on hgpu.org
Fast Makespan Estimation for GPU Threads on a Single Streaming Multiprocessor 1,929 views
Multi-GPU Graph Analytics 1,929 views
GPU-accelerated time-domain circuit simulation 1,928 views
HSPA+/LTE-A Turbo Decoder on GPU and Multicore CPU 1,928 views
GPU-Assisted Cryptography of Log-Structured Indices 1,928 views
GPU-based Fast Low-dose Cone Beam CT Reconstruction via Total Variation 1,928 views
GPU Accelerated Smith-Waterman 1,928 views
gpustats: GPU Library for Statistical Computing in Python 1,928 views
MALBEC: a new CUDA-C ray-tracer in General Relativity 1,928 views
Detecting Computer Viruses using GPUs 1,928 views
Mass-spring systems on the GPU 1,928 views
Optimization of Lattice Boltzmann Simulations on Heterogeneous Computers 1,928 views
A uniform approach for programming distributed heterogeneous computing systems 1,928 views
PARIS: A Parallel RSA-Prime Inspection Tool 1,928 views
A Survey of Architectural Techniques For Improving Cache Power Efficiency 1,928 views
Maximum mipmaps for fast, accurate, and scalable dynamic height field rendering 1,928 views
Oct-tree Method on GPU 1,927 views
Multi GPU Implementation of Iterative Tomographic Reconstruction Algorithms 1,927 views
Hybrid Acceleration of a Molecular Dynamics Simulation Using Short-Ranged Potentials 1,927 views
Analysis of Parallel Montgomery Multiplication in CUDA 1,927 views
Implementation of a High Throughput Soft MIMO Detector on GPU 1,927 views
Programming GPUs with C++14 and Just-In-Time Compilation 1,927 views
Accelerating Image Reconstruction in Dual-Head PET System by GPU and Symmetry Properties 1,927 views
Waste Not… Efficient Co-Processing of Relational Data 1,927 views
An unsupervised parallel genetic cluster algorithm for graphics processing units 1,927 views
Code Generation Compiler for the OpenMP 4.0 Accelerator Model onto OMPSS 1,926 views
Fast Global Illumination for Interactive Volume Visualization 1,926 views
Acceleration of real-life stencil codes on GPUs 1,926 views
Multi-Object Geodesic Active Contours (MOGAC): A Parallel Sparse-Field Algorithm for Image Segmentation 1,926 views
MODESTO: Data-centric Analytic Optimization of Complex Stencil Programs on Heterogeneous Architectures 1,926 views
Acceleration of computational quantum chemistry by heterogeneous computer architectures 1,926 views
Median Based Parallel Steering Kernel Regression for Image Reconstruction 1,925 views
CUDA-Accelerated ODETLAP: A Parallel Lossy Compression Implementation 1,925 views
Nonnegative Tensor Factorization Accelerated Using GPGPU 1,925 views
Benchmarking TPU, GPU, and CPU Platforms for Deep Learning 1,925 views
Unsupervised Deep Learning of Incompressible Fluid Dynamics 1,925 views
A Study on Efficient Application Mapping on Parallel Computing Accelerators 1,924 views
Increasing Deep Neural Network Acoustic Model Size for Large Vocabulary Continuous Speech Recognition 1,924 views
How well do STARLAB and NBODY compare? II: Hardware and accuracy 1,924 views
A Fast GEMM Implementation On a Cypress GPU 1,924 views
Parallel simulation of Petri nets on desktop PC hardware 1,924 views
Dynamic Warp Resizing in High-Performance SIMT 1,924 views
G-Heart: A GPU-based System for Electrophysiological Simulation and Multi-modality Cardiac Visualization 1,924 views
Autotuning CUDA Compiler Parameters for Heterogeneous Applications using the OpenTuner Framework 1,923 views
Quantum.Ligand.Dock: protein-ligand docking with quantum entanglement refinement on a GPU system 1,923 views
Multi-GPU implementation of a VMAT treatment plan optimization algorithm 1,923 views
High performance MRI simulations of motion on multi-GPU systems 1,923 views
Ray Tracing Visualization Toolkit 1,923 views
Accelerating the Smith-Waterman Algorithm for Bio-sequence Matching on GPU 1,923 views
MEDINA: MECCA Development in Accelerators – KPP Fortran to CUDA source-to-source Preprocessor 1,923 views
Automatic Code Generation for Stencil Computations on GPU Architectures 1,923 views
Power-efficient medical image processing using PUMA 1,922 views
Implementing the Himeno benchmark with CUDA on GPU clusters 1,922 views
Efficient Convolutional Patch Networks for Scene Understanding 1,922 views
Improving GPU Performance: Reducing Memory Conflicts and Latency 1,922 views
A computationally efficient and scalable approach for privacy preserving kNN classification 1,922 views
CUSHAW: a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform 1,922 views
An improved study of real-time fluid simulation on GPU 1,922 views
A CUDA-based parallel implementation of K-nearest neighbor algorithm 1,921 views
CuMF_SGD: Fast and Scalable Matrix Factorization 1,921 views
Challenges for compiler support for exascale computing 1,921 views
Analysis of KECCAK Tree Hashing on GPU Architectures 1,921 views
Toward a Generic Hybrid CPU-GPU Parallelization of Divide-and-Conquer Algorithms 1,920 views
Magnetohydrodynamics on Heterogeneous architectures: a performance comparison 1,920 views
Lattice Boltzmann Method for Simulating Turbulent Flows 1,920 views
An Analysis of Programmer Productivity versus Performance for High Level Data Parallel Programming 1,920 views
Solving lattice QCD systems of equations using mixed precision solvers on GPUs 1,920 views
Real-time Sliding Phase Vocoder using a Commodity GPU 1,920 views
Hybrid Sample-based Surface Rendering 1,919 views
Scalable GPU Acceleration of B-Spline Signal Processing Operations 1,919 views
Matrix Factorization on GPUs with Memory Optimization and Approximate Computing 1,919 views
Inertial-aided KLT feature tracking for a moving camera 1,919 views
ARC: Adaptive Ray-tracing with CUDA, a New Ray Tracing Code for Parallel GPUs 1,919 views
CELES: CUDA-accelerated simulation of electromagnetic scattering by large ensembles of spheres 1,919 views
Inter-Warp Instruction Temporal Locality in Deep-Multithreaded GPUs 1,919 views
An Exploration of OpenCL for a Numerical Relativity Application 1,919 views
Many-threaded Differential Evolution on the GPU 1,919 views
A Parallel Depth-aided Exemplar-based Inpainting for Real-time View Synthesis on GPU 1,919 views
Dynamically tuned push-relabel algorithm for the maximum flow problem on CPU-GPU-Hybrid platforms 1,919 views
Incoherent Ray tracing on GPU 1,919 views
Dynamic load balancing on single- and multi-GPU systems 1,918 views
A Feedback Approach to Task Partitioning in Heterogeneous Architectures 1,918 views
Accelerated Wide Baseline Matching using OpenCL 1,918 views
CUDA Tutorial – Cryptanalysis of Classical Ciphers Using Modern GPUs and CUDA 1,918 views
EigenCFA: accelerating flow analysis with GPUs 1,918 views
A constant-space belief propagation algorithm for stereo matching 1,917 views
CBench: Analyzing Compute Performance for Modern NVIDIA and AMD GPUs 1,917 views
On Dynamic Load Balancing on Graphics Processors 1,917 views
Multilayered Abstractions for Partial Differential Equations 1,917 views
GPU-Based Ray-Casting of Spherical Functions Applied to High Angular Resolution Diffusion Imaging 1,917 views
Motion Estimation with Non-Local Total Variation Regularization 1,916 views
Large, Pruned or Continuous Space Language Models on a GPU for Statistical Machine Translation 1,916 views
Implementing AES on GPU: Final Report 1,916 views
Titles: 100
Total views: 192299
- Programming - 186,133 views
- Login - 164,567 views
- User dashboard - 91,306 views
- Paper titles list - 71,303 views
- Add new event - 64,807 views
- Add new post - 59,600 views
- Register - 49,319 views
- Statistics - 37,168 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,188 views
- Books on OpenCL and CUDA - 28,899 views