2402

Views of posts on hgpu.org

Parallel Computing for the Inverse of SPD matrix  4,574 views

Robust GPGPU plugin development for RapidMiner  4,570 views

Graphics Processing Units in Acceleration of Bandwidth Selection for Kernel Density Estimation  4,534 views

ScatterAlloc: Massively Parallel Dynamic Memory Allocation for the GPU  4,512 views

OpenCL vs. OpenMP: A Programmability Debate  4,511 views

BENCHIP: Benchmarking Intelligence Processors  4,502 views

Using OpenCL: Programming Massively Parallel Computers  4,497 views

Early Results of Deep Learning on the Stampede2 Supercomputer  4,492 views

Massively Parallel Suffix Array Queries and On-Demand Phrase Extraction for Statistical Machine Translation Using GPUs  4,490 views

Deterministic Sample Sort For GPUs  4,488 views

Lattice QCD on new chips: a community summary  4,487 views

Optimizing Stencil Computations for NVIDIA Kepler GPUs  4,474 views

Efficient Sparse Matrix-Vector Multiplication on x86-Based Many-Core Processors  4,465 views

Parallel Implementation of Moving Averages and Stock Market Prediction  4,465 views

GPU-Powered Coherent Beamforming  4,450 views

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs  4,439 views

Adaptation of algorithms for underwater sonar data processing to GPU-based systems  4,422 views

Adaptation of an acoustic propagation model to the parallel architecture of a graphics processor  4,410 views

Architecting SOT-RAM Based GPU Register File  4,407 views

clpeak – peak performance of your opencl device  4,403 views

Current performance gains from utilizing the GPU or the ASIC MDGRAPE-3 within an enhanced Poisson Boltzmann approach  4,401 views

Data Transfer Matters for GPU Computing  4,383 views

Anisotropic mesh coarsening and refinement on GPU architecture  4,382 views

A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units  4,371 views

BigKernel — High Performance CPU-GPU Communication Pipelining for Big Data-style Applications  4,365 views

Enabling High Performance Computing in Cloud Infrastructure using Virtualized GPUs  4,358 views

CL2QCD – Lattice QCD based on OpenCL  4,355 views

A Development Platform for Embedded Domain-Specific Languages  4,354 views

Bitmap Filter: Speeding up Exact Set Similarity Joins with Bitwise Operations  4,346 views

Hadoop+Aparapi: Making heterogenous MapReduce programming easier  4,343 views

Semi-Global Matching-Motivation, Developments and Applications  4,326 views

GPU Random Numbers via the Tiny Encryption Algorithm  4,326 views

A 3D Convex Hull Algorithm for Graphics Hardware  4,308 views

CUD@SAT: SAT Solving on GPUs  4,307 views

A survey on graphic processing unit computing for large-scale data mining  4,304 views

Introducing CURRENNT – the Munich open-source CUDA RecurREnt Neural Network Toolkit  4,301 views

An Exploratory Study of High Performance Graphics Application Programming Interfaces  4,300 views

Efficient Hash Tables on the GPU  4,295 views

Parallel Irradiance Caching on the GPU  4,284 views

OpenCL Parallel Programming Development Cookbook  4,284 views

Simulating Dam-Break Flooding with Floating Objects through Intricate City Layouts Using GPU-based SPH Method  4,280 views

A portable implementation of the radix sort algorithm in OpenCL  4,277 views

A Semi-Automated Tool Flow for Roofline Anaylsis of OpenCL Kernels on Accelerators  4,276 views

A Parallel Ant Colony Optimization Algorithm for the Travelling Salesman Problem: Improving Performance Using CUDA  4,267 views

GPU acceleration and performance of the particle-beam-dynamics code Elegant  4,259 views

You Can Type, but You Can’t Hide: A Stealthy GPU-based Keylogger  4,251 views

GPU Parallelization for Unstructured Sparse Matrix Problems with OpenMP 4.5 and OpenACC  4,246 views

Efficient Parallel RSA Decryption Algorithm for Many-core GPUs with CUDA  4,243 views

Uses of GPU Powered Interval Optimization for Parameter Identification in the Context of SO Fuel Cells  4,241 views

Multi-view Rendering Approach for Cloud-based Gaming Services  4,240 views

Performance Evaluation of R with Intel Xeon Phi Coprocessor  4,238 views

Efficient Inference For Neural Machine Translation  4,208 views

Real-Time Hair Simulation and Rendering with OpenCL and OpenGL  4,190 views

Learning Random Forests on the GPU  4,190 views

The Reconstruction Toolkit (RTK), an open-source cone-beam CT reconstruction toolkit based on the Insight Toolkit (ITK)  4,189 views

Deep API Learning  4,188 views

maxDNN: An Efficient Convolution Kernel for Deep Learning with Maxwell GPUs  4,183 views

MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph  4,182 views

Solving Linear Equations with Conjugate Gradient Method on OpenCL Platforms  4,172 views

A GPGPU Implementation of Approximate String Matching with Regular Expression Operators and Comparison with Its FPGA Implementation  4,165 views

Efficient Cubic B-spline Image Interpolation on a GPU  4,163 views

GPU Accelerated Vessel Segmentation Using Laplacian Eigenmaps  4,160 views

GPU Programming in Rust: Implementing High Level Abstractions in a Systems Level Language  4,149 views

Non-separable 2D, 3D and 4D filtering with CUDA  4,146 views

GPGPU-Aided 3D Staggered-grid Finite-difference Seismic Wave Modeling  4,141 views

Designing Scientific Applications on GPUs  4,140 views

Progressive Photon Mapping on GPUs  4,138 views

Bigger Buffer k-d Trees on Multi-Many-Core Systems  4,129 views

GPU-ABiSort: Optimal Parallel Sorting on Stream Architectures  4,126 views

Bilateral Filtering with CUDA  4,120 views

Multi-Platform LU-Decomposition Solution in OpenCL  4,117 views

Deep Learning on FPGAs: Past, Present, and Future  4,114 views

CUDA-OpenGL Interoperability to Visualize Electromagnetic Fields Calculated by FDTD  4,110 views

The Hitchhiker’s Guide to Cross-Platform OpenCL Application Development  4,103 views

Experiences Porting a Molecular Dynamics Code to GPUs on a Cray XK7  4,103 views

Offload Compiler Runtime for the Intel Xeon Phi Coprocessor  4,103 views

Professional CUDA C Programming  4,087 views

Parallel and Concurrent Programming in Haskell: Techniques for Multicore and Multithreaded Programming  4,082 views

Duplicate Detection on GPUs  4,069 views

Texturing and Modeling, Third Edition: A Procedural Approach (The Morgan Kaufmann Series in Computer Graphics)  4,060 views

GPU-accelerated computation for robust motion tracking using the CUDA framework  4,059 views

Accelerating Simulation Codes through the GeMTC Framework  4,057 views

A GPU Accelerated Algorithm for Compressive Sensing Based Image Super-Resolution  4,055 views

A Parallel Implementation of the Galerkin Method for Solving Partial Differential Equations on a Triangular Mesh  4,051 views

CudaRF: A CUDA-based Implementation of Random Forests  4,030 views

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks  4,027 views

High Performance Extreme Learning Machines: A Complete Toolbox for Big Data Applications  4,022 views

The Virtual OpenCL (VCL) Cluster Platform  4,021 views

Hierarchical belief propagation to reduce search space using CUDA for stereo and motion estimation  4,017 views

Accelerating Electron Tomography Reconstruction Algorithm ICON Using the Intel Xeon Phi Coprocessor on Tianhe-2 Supercomputer  4,001 views

GPU Parallel Collections For Scala  3,997 views

Hybrid strategy for stencil computations on the APU  3,992 views

State of the Art Report on Real-time Rendering with Hardware Tessellation  3,984 views

Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation  3,982 views

Benchmarking the Memory Hierarchy of Modern GPUs  3,979 views

Sparse Matrix-Vector Multiplication on GPU  3,977 views

CUDA Application Design and Development  3,977 views

Hybrid algorithms for efficient Cholesky decomposition and matrix inverse using multicore CPUs with GPU accelerators  3,973 views

Implementation of Just In Time Value Specialization for the Optimization of Data Parallel Kernels  3,973 views

GPU Implementation of a Deep Learning Network for Financial Prediction  3,968 views

 

Brief statistics for this page

Titles: 100

Total views: 423272

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org