2402

Views of posts on hgpu.org

Neurokernel: An Open Source Platform for Emulating the Fruit Fly Brain  2,555 views

A Scalable Lane Detection Algorithm on COTSs with OpenCL  2,555 views

Data-rich astronomy: mining synoptic sky surveys  2,555 views

A multi-Teraflop Constituency Parser using GPUs  2,554 views

CUDA 2D Stencil Computations for the Jacobi Method  2,553 views

Integrated GPUs: how useful are they in HPC?  2,552 views

An evaluation of GPU acceleration for sparse reconstruction  2,552 views

Face Detection CUDA Accelerating  2,552 views

Efficient Data Management for GPU Databases  2,551 views

Hyper neural network on OpenCL  2,551 views

Using GPUs for Machine Learning Algorithms  2,550 views

Compiler Fuzzing through Deep Learning  2,549 views

Fast and Robust Linear Motion Deblurring  2,549 views

Demystifying GPU microarchitecture through microbenchmarking  2,549 views

Implementation of Kirchhoff prestack depth migration on GPU  2,548 views

GPU Accelerated 3-D Modeling and Simulation of a Blended Kinetic Impact and Nuclear Subsurface Explosion  2,548 views

Input-Aware Auto-Tuning of Compute-Bound HPC Kernels  2,547 views

Improved Finite Difference Schemes for a 3-D Viscothermal Wave Equation on a GPU  2,547 views

Effective Multi-Modal Retrieval based on Stacked Auto-Encoders  2,546 views

Variational Bayesian Image Super-Resolution with GPU Acceleration  2,545 views

B-Calm: an Open-Source Multi-Gpu-Based 3D-FDTD with Multi-Pole Dispersion for Plasmonics  2,545 views

OpenNMT: Open-Source Toolkit for Neural Machine Translation  2,544 views

Experience of parallelizing cryo-EM 3D reconstruction on a CPU-GPU heterogeneous system  2,543 views

Anisotropic Kuwahara Filtering on the GPU  2,543 views

OpenCL Cryptographic Library  2,542 views

cf4ocl: a C framework for OpenCL  2,542 views

GPU-accelerated triangle-triangle intersection tester algorithm  2,542 views

Application of GPU Smooth Particle Hydrodynamics: Wave Runup and Overtopping on Composite Slopes  2,541 views

Point Based Approximate Color Bleeding With Cuda  2,541 views

Beyond programmable shading (parts I and II)  2,540 views

Fast and accurate digital signal processing realized with GPGPU technology  2,540 views

A Framework for General Sparse Matrix-Matrix Multiplication on GPUs and Heterogeneous Processors  2,539 views

GPU Monte Carlo scatter calculations for Cone Beam Computed Tomography  2,538 views

Finding Longest Common Subsequences by GPU-Based Parallel Ant Colony Optimization  2,538 views

Understanding the efficiency of GPU algorithms for matrix-matrix multiplication  2,538 views

Efficient JPEG2000 EBCOT Context Modeling for Massively Parallel Architectures  2,538 views

Loo.py: From Fortran to performance via transformation and substitution rules  2,538 views

gR: A GPU-based Router  2,537 views

Accelerating the ANSYS Direct Sparse Solver with GPUs  2,537 views

Improving Performance Portability in OpenCL Programs  2,537 views

FlexTensor: An Automatic Schedule Exploration and Optimization Framework for Tensor Computation on Heterogeneous System  2,537 views

High Performance Implementation of Ultrasound Color Doppler Imaging on GPU platform  2,536 views

Transparent CPU-GPU Collaboration for Data-Parallel Kernels on Heterogeneous Systems  2,535 views

Multi-Tenant Virtual GPUs for Optimising Performance of a Financial Risk Application  2,535 views

Processing Posting Lists Using OpenCL  2,534 views

High performance in silico virtual drug screening on many-core processors  2,533 views

3D GPU Architecture using Cache Stacking: Performance, Cost, Power and Thermal analysis  2,533 views

GPU Parallel Statistical and Cube Test Analysis of the SHA-3 Finalist Candidate Hash Functions  2,529 views

Dynamic Programming with CUDA – Part II  2,528 views

GPU-based password cracking  2,528 views

Pseudorandom number generation on the GPU  2,527 views

GPUWattch: Enabling Energy Optimizations in GPGPUs  2,526 views

Compiler and runtime techniques for bulk-synchronous programming models on CPU architectures  2,526 views

GPU Accelerated NIDS Search  2,526 views

PyFR: An Open Source Framework for Solving Advection-Diffusion Type Problems on Streaming Architectures using the Flux Reconstruction Approach  2,525 views

RASR/NN: The RWTH Neural Network Toolkit for Speech Recognition  2,525 views

A CUDA Monte Carlo simulator for radiation therapy dosimetry based on Geant4  2,524 views

Implementation of the SYCL Heterogeneous Computing Library  2,524 views

GPU Programming in Functional Languages: A Comparison of Haskell GPU Embedded Domain Specific Languages  2,523 views

GPU-based high-performance computing for radiation therapy  2,522 views

GPU-Based Translation-Invariant 2D Discrete Wavelet Transform for Image Processing  2,522 views

File I/O on Intel Xeon Phi Coprocessors: RAM disks, VirtIO, NFS and Lustre  2,521 views

Acceleration Techniques for GPU-based Volume Rendering  2,520 views

3D FFT on a Single FPGA  2,520 views

Contract-Based General-Purpose GPU Programming  2,519 views

Efficient GPU-based Graph Cuts for Stereo Matching  2,519 views

Efficient and Scalable k-Means on GPUs  2,519 views

Parallel Implementations for Solving Shortest Path Problem using Bellman-Ford  2,518 views

Molecular dynamics recipes for genome research  2,518 views

Deep learning with COTS HPC systems  2,518 views

Comparison of Random Number Generators in Particle Swarm Optimization Algorithm  2,518 views

Automated Tool to Generate Parallel CUDA code from a Serial C Code  2,517 views

A Multi-GPU Programming Library for Real-Time Applications  2,516 views

A Case Study in Using OpenCL on FPGAs: Creating an Open-Source Accelerator of the AutoDock Molecular Docking Software  2,516 views

GPU accelerated fast FEM deformation simulation  2,516 views

A framework for cost based optimization of hybrid CPU/GPU query plans in database systems  2,515 views

Implementing Interactive 3D Segmentation on CUDA Using Graph-Cuts and Watershed Transformation  2,515 views

Performance of FORTRAN and C GPU Extensions for a Benchmark Suite of Fourier Pseudospectral Algorithms  2,515 views

Parallel Unsteady Flow Line Integral Convolution for High-Performance Dense Visualization  2,515 views

Efficient Multi-GPU Computation of All-Pairs Shortest Paths  2,514 views

OpenCL-Accelerated Simplified General Perturbations 4 Algorithm  2,514 views

Auto-Tuning of Level 1 and Level 2 BLAS for GPUs  2,513 views

CHO: A Benchmark Suite for OpenCL-based FPGA Accelerators  2,513 views

HPerf: A Lightweight Profiler for Task Distribution on CPU+GPU Platforms  2,512 views

Exploiting Heterogeneous Systems: Keccak on OpenCL  2,511 views

Monte Carlo Path Tracing with OpenCL  2,511 views

Computer Simulation of Dark Matter Effects on Galaxy Rotation  2,509 views

Particle-based volume rendering  2,508 views

Performance Study of Satellite Image Processing on Graphics Processors Unit Using CUDA  2,508 views

Towards Interactive Visual Exploration of Parallel Programs using a Domain-specific Language  2,508 views

Solving the Caputo Fractional Reaction-Diffusion Equation on GPU  2,507 views

A GPGPU-based Pipeline for Accelerated Rendering of Point Clouds  2,507 views

Instructions’ Latencies Characterization for NVIDIA GPGPUs  2,506 views

GPU Accelerated Lambert Solution Methods for the Orbital Targeting Problem  2,506 views

Fast GPU-based Locality Sensitive Hashing for K-Nearest Neighbor Computation  2,505 views

Gunrock: A High-Performance Graph Processing Library on the GPU  2,505 views

Glow: Graph Lowering Compiler Techniques for Neural Networks  2,504 views

Reducing GPU Offload Latency via Fine-Grained CPU-GPU Synchronization  2,503 views

Implementation of K-shortest Path Algorithm in GPU Using CUDA  2,503 views

PG-PuReMD: A Parallel-GPU Reactive Molecular Dynamics Package  2,503 views

 

Brief statistics for this page

Titles: 100

Total views: 252894

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: