Views of posts on hgpu.org
Parallel Computing for the Inverse of SPD matrix 4,574 views
Robust GPGPU plugin development for RapidMiner 4,570 views
Graphics Processing Units in Acceleration of Bandwidth Selection for Kernel Density Estimation 4,534 views
ScatterAlloc: Massively Parallel Dynamic Memory Allocation for the GPU 4,512 views
OpenCL vs. OpenMP: A Programmability Debate 4,511 views
BENCHIP: Benchmarking Intelligence Processors 4,502 views
Using OpenCL: Programming Massively Parallel Computers 4,497 views
Early Results of Deep Learning on the Stampede2 Supercomputer 4,492 views
Deterministic Sample Sort For GPUs 4,488 views
Lattice QCD on new chips: a community summary 4,487 views
Optimizing Stencil Computations for NVIDIA Kepler GPUs 4,474 views
Efficient Sparse Matrix-Vector Multiplication on x86-Based Many-Core Processors 4,465 views
Parallel Implementation of Moving Averages and Stock Market Prediction 4,465 views
GPU-Powered Coherent Beamforming 4,450 views
Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs 4,439 views
Adaptation of algorithms for underwater sonar data processing to GPU-based systems 4,422 views
Adaptation of an acoustic propagation model to the parallel architecture of a graphics processor 4,410 views
Architecting SOT-RAM Based GPU Register File 4,407 views
clpeak – peak performance of your opencl device 4,403 views
Data Transfer Matters for GPU Computing 4,383 views
Anisotropic mesh coarsening and refinement on GPU architecture 4,382 views
A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units 4,371 views
BigKernel — High Performance CPU-GPU Communication Pipelining for Big Data-style Applications 4,365 views
Enabling High Performance Computing in Cloud Infrastructure using Virtualized GPUs 4,358 views
CL2QCD – Lattice QCD based on OpenCL 4,355 views
A Development Platform for Embedded Domain-Specific Languages 4,354 views
Bitmap Filter: Speeding up Exact Set Similarity Joins with Bitwise Operations 4,346 views
Hadoop+Aparapi: Making heterogenous MapReduce programming easier 4,343 views
Semi-Global Matching-Motivation, Developments and Applications 4,326 views
GPU Random Numbers via the Tiny Encryption Algorithm 4,326 views
A 3D Convex Hull Algorithm for Graphics Hardware 4,308 views
CUD@SAT: SAT Solving on GPUs 4,307 views
A survey on graphic processing unit computing for large-scale data mining 4,304 views
Introducing CURRENNT – the Munich open-source CUDA RecurREnt Neural Network Toolkit 4,301 views
An Exploratory Study of High Performance Graphics Application Programming Interfaces 4,300 views
Efficient Hash Tables on the GPU 4,295 views
Parallel Irradiance Caching on the GPU 4,284 views
OpenCL Parallel Programming Development Cookbook 4,284 views
A portable implementation of the radix sort algorithm in OpenCL 4,277 views
A Semi-Automated Tool Flow for Roofline Anaylsis of OpenCL Kernels on Accelerators 4,276 views
GPU acceleration and performance of the particle-beam-dynamics code Elegant 4,259 views
You Can Type, but You Can’t Hide: A Stealthy GPU-based Keylogger 4,251 views
GPU Parallelization for Unstructured Sparse Matrix Problems with OpenMP 4.5 and OpenACC 4,246 views
Efficient Parallel RSA Decryption Algorithm for Many-core GPUs with CUDA 4,243 views
Uses of GPU Powered Interval Optimization for Parameter Identification in the Context of SO Fuel Cells 4,241 views
Multi-view Rendering Approach for Cloud-based Gaming Services 4,240 views
Performance Evaluation of R with Intel Xeon Phi Coprocessor 4,238 views
Efficient Inference For Neural Machine Translation 4,208 views
Real-Time Hair Simulation and Rendering with OpenCL and OpenGL 4,190 views
Learning Random Forests on the GPU 4,190 views
Deep API Learning 4,188 views
maxDNN: An Efficient Convolution Kernel for Deep Learning with Maxwell GPUs 4,183 views
Solving Linear Equations with Conjugate Gradient Method on OpenCL Platforms 4,172 views
Efficient Cubic B-spline Image Interpolation on a GPU 4,163 views
GPU Accelerated Vessel Segmentation Using Laplacian Eigenmaps 4,160 views
GPU Programming in Rust: Implementing High Level Abstractions in a Systems Level Language 4,149 views
Non-separable 2D, 3D and 4D filtering with CUDA 4,146 views
GPGPU-Aided 3D Staggered-grid Finite-difference Seismic Wave Modeling 4,141 views
Designing Scientific Applications on GPUs 4,140 views
Progressive Photon Mapping on GPUs 4,138 views
Bigger Buffer k-d Trees on Multi-Many-Core Systems 4,129 views
GPU-ABiSort: Optimal Parallel Sorting on Stream Architectures 4,126 views
Bilateral Filtering with CUDA 4,120 views
Multi-Platform LU-Decomposition Solution in OpenCL 4,117 views
Deep Learning on FPGAs: Past, Present, and Future 4,114 views
CUDA-OpenGL Interoperability to Visualize Electromagnetic Fields Calculated by FDTD 4,110 views
The Hitchhiker’s Guide to Cross-Platform OpenCL Application Development 4,103 views
Experiences Porting a Molecular Dynamics Code to GPUs on a Cray XK7 4,103 views
Offload Compiler Runtime for the Intel Xeon Phi Coprocessor 4,103 views
Professional CUDA C Programming 4,087 views
Parallel and Concurrent Programming in Haskell: Techniques for Multicore and Multithreaded Programming 4,082 views
Duplicate Detection on GPUs 4,069 views
GPU-accelerated computation for robust motion tracking using the CUDA framework 4,059 views
Accelerating Simulation Codes through the GeMTC Framework 4,057 views
A GPU Accelerated Algorithm for Compressive Sensing Based Image Super-Resolution 4,055 views
CudaRF: A CUDA-based Implementation of Random Forests 4,030 views
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks 4,027 views
High Performance Extreme Learning Machines: A Complete Toolbox for Big Data Applications 4,022 views
The Virtual OpenCL (VCL) Cluster Platform 4,021 views
Hierarchical belief propagation to reduce search space using CUDA for stereo and motion estimation 4,017 views
GPU Parallel Collections For Scala 3,997 views
Hybrid strategy for stencil computations on the APU 3,992 views
State of the Art Report on Real-time Rendering with Hardware Tessellation 3,984 views
Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation 3,982 views
Benchmarking the Memory Hierarchy of Modern GPUs 3,979 views
Sparse Matrix-Vector Multiplication on GPU 3,977 views
CUDA Application Design and Development 3,977 views
Implementation of Just In Time Value Specialization for the Optimization of Data Parallel Kernels 3,973 views
GPU Implementation of a Deep Learning Network for Financial Prediction 3,968 views
Titles: 100
Total views: 423272
- Programming - 186,231 views
- Login - 172,134 views
- User dashboard - 98,584 views
- Paper titles list - 92,722 views
- Add new event - 69,206 views
- Add new post - 62,798 views
- Register - 53,100 views
- Statistics - 44,246 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,520 views
- Books on OpenCL and CUDA - 31,164 views