2402

Views of posts on hgpu.org

Modification of self-organizing migration algorithm for OpenCL framework  34,158 views

The OoO VLIW JIT Compiler for GPU Inference  18,403 views

Parallel Ray Tracing Simulations with MATLAB for Dynamic Lens Systems  15,261 views

Data Layout Pruning on GPU  12,272 views

Domain-Specific Code Language Models: Unraveling the Potential for HPC Codes and Tasks  10,914 views

Code Optimization Techniques for Graphics Processing Units  10,619 views

FPGA implementation of a Convolutional Neural Network for "Wake up word" detection  10,167 views

OpenMP Programming on Intel R Xeon Phi TM Coprocessors: An Early Performance Comparison  9,984 views

Matrix inversion speed up with CUDA  9,932 views

Performance Evaluation of Container-based Virtualization for High Performance Computing Environments  9,850 views

OpenCL Programming by Example  9,685 views

Computing Treewidth on the GPU  9,673 views

[Serbian] The Methods and Procedures for Accelerating Operations and Queries in Large Database Systems and Data Warehouse (Big Data Systems)  9,551 views

Mixed Precision Solver Scalable to 16000 MPI Processes for Lattice Quantum Chromodynamics Simulations on the Oakforest-PACS System  9,551 views

Energy efficiency of finite difference algorithms on multicore CPUs, GPUs, and Intel Xeon Phi processors  9,514 views

OpenCL Actors – Adding Data Parallelism to Actor-based Programming with CAF  9,455 views

An Efficient Load Balancing Method for Tree Algorithms  9,368 views

GMM based Fisher vector calculation on GPGPU  9,324 views

Modeling the Resource Requirements of Convolutional Neural Networks on Mobile Devices  9,299 views

GALARIO: a GPU Accelerated Library for Analysing Radio Interferometer Observations  9,130 views

Breaking DVB-CSA  8,655 views

Torch7: A Matlab-like Environment for Machine Learning  8,524 views

End-to-end Deep Learning of Optimization Heuristics  8,517 views

Experiences Building an MLIR-based SYCL Compiler  8,204 views

Accelerating Radio Astronomy with Auto-Tuning  7,869 views

GPU implementation of a deep learning network for image recognition tasks  7,858 views

4kUHD H264 wireless live video streaming using CUDA  7,833 views

Monte Carlo methods for massively parallel computers  7,718 views

PySPH: A Python framework for SPH  7,687 views

GPU Octrees and Optimized Search  7,596 views

Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds  7,500 views

CUDA Programming: A Developer’s Guide to Parallel Computing with GPUs  7,469 views

On Optimizing Complex Stencils on GPUs  7,461 views

Distributed wideband software-defined radio receiver for heterogeneous systems  7,393 views

Automated Testing of Graphics Shader Compilers  7,380 views

Asynchronous Task-Based Polar Decomposition on Single Node Manycore Architectures  7,373 views

Compoundly weighted Voronoi: a sequential and parallel implementation  7,300 views

Meta Networks for Neural Style Transfer  7,293 views

IBM Deep Learning Service  7,248 views

NaNet:a low-latency NIC enabling GPU-based, real-time low level trigger systems  7,205 views

Empower Sequence Labeling with Task-Aware Neural Language Model  7,186 views

Understanding the Topics and Challenges of GPU Programming by Classifying and Analyzing Stack Overflow Posts  7,140 views

Fast Parallel Sorting Algorithms on GPUs  6,986 views

A code-based analytical approach for using separate device coprocessors in computing systems  6,905 views

An OpenCL Method of Parallel Sorting Algorithms for GPU Architecture  6,731 views

Quasi-real-time analysis of dynamic near field scattering data using a graphics processing unit  6,559 views

A Common GPU n-Dimensional Array for Python and C  6,549 views

GPU-Accelerated Parallel Finite-Difference Time-Domain Method for Electromagnetic Waves Propagation in Unmagnetized Plasma Media  6,461 views

Random Forests of Very Fast Decision Trees on GPU for Mining Evolving Big Data Streams  6,267 views

gSLIC: a real-time implementation of SLIC superpixel segmentation  6,224 views

A Comparative Study of 2D Numerical Methods with GPU Computing  6,202 views

Interactive Soft Tissue for Surgical Simulation  6,169 views

SoAx: A generic C++ Structure of Arrays for handling Particles in HPC Codes  6,120 views

An octree-based proxy for collision detection in large-scale particle systems  6,120 views

Performance Comparison of GPU, DSP and FPGA implementations of image processing and computer vision algorithms in embedded systems  6,078 views

Implementing Level-3 BLAS Routines in OpenCL on Different Processing Units  6,052 views

GPU-PIV  6,026 views

Deep learning for galaxy surface brightness profile fitting  5,959 views

The CUDA Handbook: A Comprehensive Guide to GPU Programming  5,955 views

libWater: Heterogeneous Distributed Computing Made Easy  5,900 views

Advanced 2D Rasterization on Modern CPUs  5,858 views

Domain-Specific Acceleration and Auto-Parallelization of Legacy Scientific Code in FORTRAN 77 using Source-to-Source Compilation  5,834 views

Sorting with GPUs: A Survey  5,809 views

Accelerating HPC codes on Intel(R) Omni-Path Architecture networks: From particle physics to Machine Learning  5,770 views

Report: Performance comparison between C2075 and P100 GPU cards using cosmological correlation functions  5,723 views

Accelerating Genomics Research with OpenCL and FPGAs  5,722 views

Optimization of the Brillouin operator on the KNL architecture  5,694 views

BIDMach: Large-scale Learning with Zero Memory Allocation  5,683 views

Fast in-place sorting with CUDA based on bitonic sort  5,664 views

DTAM: Dense tracking and mapping in real-time  5,606 views

Industrial Robot Collision Handling in Harsh Environments  5,595 views

Build and Travel KD-Tree with CUDA  5,587 views

Language Modeling with Gated Convolutional Networks  5,587 views

Synkhronos: a Multi-GPU Theano Extension for Data Parallelism  5,579 views

Usage of GPU in LS-DYNA  5,565 views

Scalable Streaming Tools for Analyzing N-body Simulations: Finding Halos and Investigating Excursion Sets in One Pass  5,550 views

Hydra: a C++11 framework for data analysis in massively parallel platforms  5,528 views

Best Practice Guide – GPGPU  5,509 views

OpenCL Programming Guide  5,467 views

GPU sample sort  5,453 views

Vectorized algorithm for multidimensional Monte Carlo integration on modern GPU, CPU and MIC architectures  5,442 views

vCUDA Framework Development for GPU Virtualization  5,429 views

Low-power System-on-Chip Processors for Energy Efficient High Performance Computing: The Texas Instruments Keystone II  5,382 views

Parallel Medical Image Reconstruction: From Graphics Processors to Grids  5,335 views

Performance Modeling and Evaluation of Distributed Deep Learning Frameworks on GPUs  5,323 views

Acceleration of tensor-product operations for high-order finite element methods  5,309 views

Radeon PRO Solid State Graphics (SSG) API User Manual  5,309 views

Scalable and massively parallel Monte Carlo photon transport simulations for heterogeneous computing platforms  5,307 views

Collision Detection Based on Fuzzy Scene Subdivision  5,280 views

Launch-time Optimization of OpenCL Kernels  5,278 views

Simulation of Biological Tissue using Mass-Spring-Damper Models  5,270 views

Implementing Neural Networks Efficiently  5,261 views

Efficient Algorithms for Sorting on GPUs  5,260 views

On Pre-Trained Image Features and Synthetic Images for Deep Learning  5,183 views

A Dynamic Hash Table for the GPU  5,181 views

A novel sorting algorithm for many-core architectures based on adaptive bitonic sort  5,161 views

Comparison of Parallelisation Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUs  5,137 views

Parallel Neural Network Training with OpenCL  5,136 views

Fast sort on CPUs and GPUs: a case for bandwidth oblivious SIMD sort  5,125 views

Cue-independent extending inverse kinematics for robust pose estimation in 3D point clouds  5,119 views

 

Brief statistics for this page

Titles: 100

Total views: 735792

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: