2402

Views of posts on hgpu.org

Modification of self-organizing migration algorithm for OpenCL framework  33,347 views

Parallel Ray Tracing Simulations with MATLAB for Dynamic Lens Systems  13,924 views

Data Layout Pruning on GPU  11,478 views

Performance Evaluation of Container-based Virtualization for High Performance Computing Environments  8,487 views

Computing Treewidth on the GPU  8,447 views

FPGA implementation of a Convolutional Neural Network for "Wake up word" detection  8,374 views

GMM based Fisher vector calculation on GPGPU  8,298 views

Energy efficiency of finite difference algorithms on multicore CPUs, GPUs, and Intel Xeon Phi processors  8,296 views

An Efficient Load Balancing Method for Tree Algorithms  8,225 views

OpenCL Actors – Adding Data Parallelism to Actor-based Programming with CAF  8,177 views

Modeling the Resource Requirements of Convolutional Neural Networks on Mobile Devices  8,096 views

Mixed Precision Solver Scalable to 16000 MPI Processes for Lattice Quantum Chromodynamics Simulations on the Oakforest-PACS System  8,012 views

GALARIO: a GPU Accelerated Library for Analysing Radio Interferometer Observations  7,978 views

OpenCL Programming by Example  7,215 views

Torch7: A Matlab-like Environment for Machine Learning  7,199 views

OpenMP Programming on Intel R Xeon Phi TM Coprocessors: An Early Performance Comparison  6,725 views

PySPH: A Python framework for SPH  6,378 views

GPU implementation of a deep learning network for image recognition tasks  6,366 views

Accelerating Radio Astronomy with Auto-Tuning  6,362 views

NaNet:a low-latency NIC enabling GPU-based, real-time low level trigger systems  6,303 views

Code Optimization Techniques for Graphics Processing Units  6,270 views

End-to-end Deep Learning of Optimization Heuristics  6,173 views

IBM Deep Learning Service  6,142 views

Monte Carlo methods for massively parallel computers  6,141 views

Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds  6,048 views

Automated Testing of Graphics Shader Compilers  6,021 views

Asynchronous Task-Based Polar Decomposition on Single Node Manycore Architectures  6,019 views

Meta Networks for Neural Style Transfer  5,931 views

Empower Sequence Labeling with Task-Aware Neural Language Model  5,844 views

Matrix inversion speed up with CUDA  5,584 views

Breaking DVB-CSA  5,452 views

CUDA Programming: A Developer’s Guide to Parallel Computing with GPUs  5,217 views

Advanced 2D Rasterization on Modern CPUs  4,781 views

BIDMach: Large-scale Learning with Zero Memory Allocation  4,764 views

Sorting with GPUs: A Survey  4,595 views

Implementing Neural Networks Efficiently  4,579 views

Report: Performance comparison between C2075 and P100 GPU cards using cosmological correlation functions  4,517 views

A Comparative Study of 2D Numerical Methods with GPU Computing  4,506 views

GPU-Accelerated Parallel Finite-Difference Time-Domain Method for Electromagnetic Waves Propagation in Unmagnetized Plasma Media  4,497 views

Implementing Level-3 BLAS Routines in OpenCL on Different Processing Units  4,492 views

Optimization of the Brillouin operator on the KNL architecture  4,475 views

Accelerating HPC codes on Intel(R) Omni-Path Architecture networks: From particle physics to Machine Learning  4,462 views

Hydra: a C++11 framework for data analysis in massively parallel platforms  4,401 views

Domain-Specific Acceleration and Auto-Parallelization of Legacy Scientific Code in FORTRAN 77 using Source-to-Source Compilation  4,369 views

Random Forests of Very Fast Decision Trees on GPU for Mining Evolving Big Data Streams  4,279 views

Scandalously Parallelizable Mesh Generation  4,255 views

Deep learning for galaxy surface brightness profile fitting  4,233 views

libWater: Heterogeneous Distributed Computing Made Easy  4,227 views

Adaptive Task Size Control on High Level Programming for GPU/CPU Work Sharing  4,166 views

Comparison of Parallelisation Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUs  4,087 views

Best Practice Guide – GPGPU  4,061 views

Launch-time Optimization of OpenCL Kernels  4,060 views

gSLIC: a real-time implementation of SLIC superpixel segmentation  4,032 views

Scalable Streaming Tools for Analyzing N-body Simulations: Finding Halos and Investigating Excursion Sets in One Pass  4,015 views

Vectorized algorithm for multidimensional Monte Carlo integration on modern GPU, CPU and MIC architectures  4,008 views

Acceleration of tensor-product operations for high-order finite element methods  4,007 views

Scalable and massively parallel Monte Carlo photon transport simulations for heterogeneous computing platforms  3,907 views

Radeon PRO Solid State Graphics (SSG) API User Manual  3,857 views

Low-power System-on-Chip Processors for Energy Efficient High Performance Computing: The Texas Instruments Keystone II  3,846 views

Distributed Training Large-Scale Deep Architectures  3,809 views

Performance Comparison of GPU, DSP and FPGA implementations of image processing and computer vision algorithms in embedded systems  3,783 views

HUGO: Hierarchical mUlti-reference Genome cOmpression for aligned reads  3,780 views

Nemo: A parallelized Lagrangian particle-tracking model  3,753 views

The CUDA Handbook: A Comprehensive Guide to GPU Programming  3,711 views

Accelerating Genomics Research with OpenCL and FPGAs  3,680 views

BbmTTP: Beat-based Parallel Simulated Annealing Algorithm on GPGPUs for the Mirrored Traveling Tournament Problem  3,678 views

A Framework for Productive, Efficient and Portable Parallel Computing  3,670 views

Warps and Atomics: Beyond Barrier Synchronization in the Verification of GPU Kernels  3,645 views

Efficient 2D Software Rendering  3,628 views

Tesla vs. Xeon Phi vs. Radeon A Compiler Writer’s Perspective  3,570 views

SoAx: A generic C++ Structure of Arrays for handling Particles in HPC Codes  3,569 views

Nengo: a Python tool for building large-scale functional brain models  3,553 views

Flexible FPGA design for FDTD using OpenCL  3,533 views

Parallel Neural Network Training with OpenCL  3,514 views

Towards Portable Performance for Explicit Hydrodynamics Codes  3,459 views

An OpenCL Method of Parallel Sorting Algorithms for GPU Architecture  3,444 views

Theano: Deep Learning on GPUs with Python  3,422 views

High Performance Algorithms to Improve the Runtime Computation of Spacecraft Trajectories  3,393 views

Synkhronos: a Multi-GPU Theano Extension for Data Parallelism  3,382 views

DTAM: Dense tracking and mapping in real-time  3,361 views

Fast Parallel Sorting Algorithms on GPUs  3,353 views

ChainerMN: Scalable Distributed Deep Learning Framework  3,344 views

A Highly Extensible Framework for Molecule Dynamic Simulation on GPUs  3,341 views

cudaMap: a GPU accelerated program for gene expression connectivity mapping  3,321 views

k+-buffer: Fragment Synchronized k-buffer  3,310 views

Robust GPGPU plugin development for RapidMiner  3,302 views

A Study of Time and Energy Efficient Algorithms for Parallel and Heterogeneous Computing  3,286 views

The Parallel Bayesian Toolbox for High-performance Bayesian Filtering in Metrology  3,273 views

Hybrid Fortran: High Productivity GPU Porting Framework Applied to Japanese Weather Prediction Model  3,263 views

OpenCL Programming Guide  3,261 views

Data Coherence Analysis and Optimization for Heterogeneous Computing  3,246 views

Graphics Processing Units in Acceleration of Bandwidth Selection for Kernel Density Estimation  3,233 views

On Pre-Trained Image Features and Synthetic Images for Deep Learning  3,233 views

BigKernel — High Performance CPU-GPU Communication Pipelining for Big Data-style Applications  3,195 views

Enabling High Performance Computing in Cloud Infrastructure using Virtualized GPUs  3,183 views

PCIeHLS: an OpenCL HLS framework  3,175 views

GPU Passthrough Performance: A Comparison of KVM, Xen, VMWare ESXi, and LXC for CUDA and OpenCL Applications  3,173 views

GooFit 2.0  3,173 views

Deep and Shallow convections in Atmosphere Models on Intel Xeon Phi Coprocessor Systems  3,172 views

A Dynamic Hash Table for the GPU  3,156 views

 

Brief statistics for this page

Titles: 100

Total views: 514337

 

Most viewed items:
Page 1 of 11012345...102030...Last »

* * *

* * *

HGPU group © 2010-2018 hgpu.org

All rights belong to the respective authors

Contact us: