2402

Views of posts on hgpu.org

Fast Makespan Estimation for GPU Threads on a Single Streaming Multiprocessor  1,929 views

Multi-GPU Graph Analytics  1,929 views

GPU-accelerated time-domain circuit simulation  1,928 views

HSPA+/LTE-A Turbo Decoder on GPU and Multicore CPU  1,928 views

GPU-Assisted Cryptography of Log-Structured Indices  1,928 views

GPU-based Fast Low-dose Cone Beam CT Reconstruction via Total Variation  1,928 views

GPU Accelerated Smith-Waterman  1,928 views

gpustats: GPU Library for Statistical Computing in Python  1,928 views

MALBEC: a new CUDA-C ray-tracer in General Relativity  1,928 views

Detecting Computer Viruses using GPUs  1,928 views

Efficient GPU implementation of parameter estimation of a statistical model for online advertisement optimization  1,928 views

Mass-spring systems on the GPU  1,928 views

Optimization of Lattice Boltzmann Simulations on Heterogeneous Computers  1,928 views

A uniform approach for programming distributed heterogeneous computing systems  1,928 views

PARIS: A Parallel RSA-Prime Inspection Tool  1,928 views

A Survey of Architectural Techniques For Improving Cache Power Efficiency  1,928 views

Maximum mipmaps for fast, accurate, and scalable dynamic height field rendering  1,928 views

Oct-tree Method on GPU  1,927 views

Multi GPU Implementation of Iterative Tomographic Reconstruction Algorithms  1,927 views

Hybrid Acceleration of a Molecular Dynamics Simulation Using Short-Ranged Potentials  1,927 views

Analysis of Parallel Montgomery Multiplication in CUDA  1,927 views

Implementation of a High Throughput Soft MIMO Detector on GPU  1,927 views

Programming GPUs with C++14 and Just-In-Time Compilation  1,927 views

Accelerating Image Reconstruction in Dual-Head PET System by GPU and Symmetry Properties  1,927 views

Waste Not… Efficient Co-Processing of Relational Data  1,927 views

An unsupervised parallel genetic cluster algorithm for graphics processing units  1,927 views

Code Generation Compiler for the OpenMP 4.0 Accelerator Model onto OMPSS  1,926 views

Fast Global Illumination for Interactive Volume Visualization  1,926 views

Acceleration of real-life stencil codes on GPUs  1,926 views

Multi-Object Geodesic Active Contours (MOGAC): A Parallel Sparse-Field Algorithm for Image Segmentation  1,926 views

MODESTO: Data-centric Analytic Optimization of Complex Stencil Programs on Heterogeneous Architectures  1,926 views

Acceleration of computational quantum chemistry by heterogeneous computer architectures  1,926 views

CNNLab: a Novel Parallel Framework for Neural Networks using GPU and FPGA-a Practical Study with Trade-off Analysis  1,925 views

Median Based Parallel Steering Kernel Regression for Image Reconstruction  1,925 views

CUDA-Accelerated ODETLAP: A Parallel Lossy Compression Implementation  1,925 views

GPGPU Test Suite Minimisation: Search Based Software Engineering Performance Improvement Using Graphics Cards  1,925 views

Nonnegative Tensor Factorization Accelerated Using GPGPU  1,925 views

A Comparative Study on ASIC, FPGAs, GPUs and General Purpose Processors in the O(N^2) Gravitational N-body Simulation  1,925 views

Benchmarking TPU, GPU, and CPU Platforms for Deep Learning  1,925 views

Unsupervised Deep Learning of Incompressible Fluid Dynamics  1,925 views

A Study on Efficient Application Mapping on Parallel Computing Accelerators  1,924 views

Increasing Deep Neural Network Acoustic Model Size for Large Vocabulary Continuous Speech Recognition  1,924 views

How well do STARLAB and NBODY compare? II: Hardware and accuracy  1,924 views

A Fast GEMM Implementation On a Cypress GPU  1,924 views

Parallel simulation of Petri nets on desktop PC hardware  1,924 views

Dynamic Warp Resizing in High-Performance SIMT  1,924 views

G-Heart: A GPU-based System for Electrophysiological Simulation and Multi-modality Cardiac Visualization  1,924 views

Boda-RTC: Productive Generation of Portable, Efficient Code for Convolutional Neural Networks on Mobile Computing Platforms  1,924 views

Autotuning CUDA Compiler Parameters for Heterogeneous Applications using the OpenTuner Framework  1,923 views

Quantum.Ligand.Dock: protein-ligand docking with quantum entanglement refinement on a GPU system  1,923 views

Multi-GPU implementation of a VMAT treatment plan optimization algorithm  1,923 views

High performance MRI simulations of motion on multi-GPU systems  1,923 views

Ray Tracing Visualization Toolkit  1,923 views

Accelerating the Smith-Waterman Algorithm for Bio-sequence Matching on GPU  1,923 views

MEDINA: MECCA Development in Accelerators – KPP Fortran to CUDA source-to-source Preprocessor  1,923 views

Automatic Code Generation for Stencil Computations on GPU Architectures  1,923 views

Power-efficient medical image processing using PUMA  1,922 views

Implementing the Himeno benchmark with CUDA on GPU clusters  1,922 views

Efficient Convolutional Patch Networks for Scene Understanding  1,922 views

Improving GPU Performance: Reducing Memory Conflicts and Latency  1,922 views

A computationally efficient and scalable approach for privacy preserving kNN classification  1,922 views

CUSHAW: a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform  1,922 views

An improved study of real-time fluid simulation on GPU  1,922 views

A CUDA-based parallel implementation of K-nearest neighbor algorithm  1,921 views

CuMF_SGD: Fast and Scalable Matrix Factorization  1,921 views

Challenges for compiler support for exascale computing  1,921 views

Analysis of KECCAK Tree Hashing on GPU Architectures  1,921 views

Toward a Generic Hybrid CPU-GPU Parallelization of Divide-and-Conquer Algorithms  1,920 views

Magnetohydrodynamics on Heterogeneous architectures: a performance comparison  1,920 views

Lattice Boltzmann Method for Simulating Turbulent Flows  1,920 views

An Analysis of Programmer Productivity versus Performance for High Level Data Parallel Programming  1,920 views

Solving lattice QCD systems of equations using mixed precision solvers on GPUs  1,920 views

Real-time Sliding Phase Vocoder using a Commodity GPU  1,920 views

Parallel Tempering Simulation of the three-dimensional Edwards-Anderson Model with Compact Asynchronous Multispin Coding on GPU  1,919 views

Hybrid Sample-based Surface Rendering  1,919 views

Scalable GPU Acceleration of B-Spline Signal Processing Operations  1,919 views

Matrix Factorization on GPUs with Memory Optimization and Approximate Computing  1,919 views

Inertial-aided KLT feature tracking for a moving camera  1,919 views

ARC: Adaptive Ray-tracing with CUDA, a New Ray Tracing Code for Parallel GPUs  1,919 views

CELES: CUDA-accelerated simulation of electromagnetic scattering by large ensembles of spheres  1,919 views

Inter-Warp Instruction Temporal Locality in Deep-Multithreaded GPUs  1,919 views

An Exploration of OpenCL for a Numerical Relativity Application  1,919 views

Many-threaded Differential Evolution on the GPU  1,919 views

A Parallel Depth-aided Exemplar-based Inpainting for Real-time View Synthesis on GPU  1,919 views

Dynamically tuned push-relabel algorithm for the maximum flow problem on CPU-GPU-Hybrid platforms  1,919 views

Incoherent Ray tracing on GPU  1,919 views

Dynamic load balancing on single- and multi-GPU systems  1,918 views

A Feedback Approach to Task Partitioning in Heterogeneous Architectures  1,918 views

Accelerated Wide Baseline Matching using OpenCL  1,918 views

CUDA Tutorial – Cryptanalysis of Classical Ciphers Using Modern GPUs and CUDA  1,918 views

EigenCFA: accelerating flow analysis with GPUs  1,918 views

Heuristics for Conversion Process of GPU’s Kernels for Multiples Kernels with Concurrent Optimization Divergence  1,918 views

A constant-space belief propagation algorithm for stereo matching  1,917 views

CBench: Analyzing Compute Performance for Modern NVIDIA and AMD GPUs  1,917 views

On Dynamic Load Balancing on Graphics Processors  1,917 views

Multilayered Abstractions for Partial Differential Equations  1,917 views

GPU-Based Ray-Casting of Spherical Functions Applied to High Angular Resolution Diffusion Imaging  1,917 views

Motion Estimation with Non-Local Total Variation Regularization  1,916 views

Large, Pruned or Continuous Space Language Models on a GPU for Statistical Machine Translation  1,916 views

Implementing AES on GPU: Final Report  1,916 views

 

Brief statistics for this page

Titles: 100

Total views: 192299

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: