2402

Views of posts on hgpu.org

An Introduction to the OpenCL Programming Model  2,878 views

GPU Accelerated Conjunction Assessment with Applications to Formation Flight and Space Debris Tracking  2,878 views

Fast Hydraulic and Thermal Erosion on GPU  2,878 views

FLASH: Randomized Algorithms Accelerated over CPU-GPU for Ultra-High Dimensional Similarity Search  2,877 views

Parallel Hashing, Compression and Encryption with OpenCL under OS X  2,877 views

Load-Balanced Multi-GPU Ambient Occlusion for Direct Volume Rendering  2,877 views

OCLoptimizer: An Iterative Optimization Tool for OpenCL  2,876 views

A Survey of Techniques For Improving Energy Efficiency in Embedded Computing Systems  2,874 views

Material Removal Simulation and Cutting Force Prediction of Multi-Axis Machining Processes on General-Purpose Graphics Processing Units  2,874 views

GPU Accelerated Keccak (SHA3) Algorithm  2,873 views

PROST: Parallel robust online simple tracking  2,870 views

Importance of Explicit Vectorization for CPU and GPU Software Performance  2,868 views

Fast BVH Construction on GPUs  2,867 views

Run-time Image and Video Resizing Using CUDA-enabled GPUs  2,867 views

Deep Dynamic Neural Networks for Gesture Segmentation and Recognition  2,867 views

FuzzyGPU: a fuzzy arithmetic library for GPU  2,862 views

Scaling Deep Learning on Multiple In-Memory Processors  2,861 views

Fast algorithm of ray tracing based on KD-tree structure  2,860 views

Fast Algorithms for Convolutional Neural Networks  2,858 views

Wilson and Domainwall Kernels on Oakforest-PACS  2,853 views

Generating Custom Code for Efficient Query Execution on Heterogeneous Processors  2,851 views

An Approach to Efficient FEM Simulations on Graphics Processing Units Using CUDA  2,846 views

Real-time Flame Rendering with GPU and CUDA  2,844 views

Efficient Model-based 3D Tracking of Hand Articulations using Kinect  2,844 views

The MOSIX Cluster Operating System for High-Performance Computing on Linux Clusters, Multi-Clusters, GPU Clusters and Clouds  2,844 views

Data access optimized applications on the GPU using NVIDIA CUDA  2,842 views

Theano: A Python framework for fast computation of mathematical expressions  2,842 views

A Single (Unified) Shader GPU Microarchitecture for Embedded Systems  2,840 views

GASPP: A GPU-Accelerated Stateful Packet Processing Framework  2,838 views

A Novel Open Source Morphology Using GPU Processing With LTU-CUDA  2,838 views

SOAP3-dp: Fast, Accurate and Sensitive GPU-based Short Read Aligner  2,836 views

Fingerprint Local Invariant Feature Extraction on GPU with CUDA  2,834 views

P-HGRMS: A Parallel Hypergraph Based Root Mean Square Algorithm for Image Denoising  2,834 views

CFD Simulation of Jet Cooling and Implementation of Flow Solvers in GPU  2,834 views

Circular Hough Transform in OpenCL  2,834 views

Performance Analysis of GPU-based SAR and Interferometric SAR image processing  2,834 views

GPU Accelerated Molecular Dynamics Simulation, Visualization, and Analysis  2,831 views

The Yin and Yang of Processing Data Warehousing Queries on GPU Devices  2,831 views

An Incompressible Navier-Stokes Equations Solver on the GPU Using CUDA  2,830 views

CUDA Fortran for Scientists and Engineers  2,829 views

Real-Time Concurrent Linked List Construction on the GPU  2,825 views

A GPU Accelerated Navier-Stokes Solver with Multi-level Granularity for Solving Sparse Implicit Systems  2,822 views

GPU Implementations of Object Detection using HOG Features and Deformable Models  2,820 views

Genetic Algorithm Modeling with GPU Parallel Computing Technology  2,820 views

Development of a GPU-accelerated MIKE 21 Solver for Water Wave Dynamics  2,819 views

GPU-accelerated HMM for Speech Recognition  2,818 views

The Future of Accelerator Programming: Abstraction, Performance or Can We Have Both?  2,818 views

Collision Detection of Triangle Meshes using GPU  2,816 views

Fast High-Quality Volume Ray Casting with Virtual Samplings  2,816 views

Optimization Techniques on GPU: A Survey  2,812 views

3D Non-Local Means denoising via multi-GPU  2,812 views

Realtime Computation of a VST Audio Effect Plugin on the Graphics Processor  2,812 views

GPU-based cellular automata simulations of laser dynamics  2,811 views

Chebyshev Filter Diagonalization on Modern Manycore Processors and GPGPUs  2,810 views

pocl: A Performance-Portable OpenCL Implementation  2,807 views

Image registration on GPU  2,806 views

A Complete Descritpion of the UnPython and Jit4GPU Framework  2,804 views

High Performance Programming for Soft Computing  2,802 views

Towards GPGPU Assisted Computing in Virtualized Environments  2,802 views

Performance of OpenCL  2,802 views

OpenCL Library for Parallel Graph Search Algorithms  2,801 views

Auto-tuning a High-Level Language Targeted to GPU Codes  2,801 views

Shared Memory Multiplexing: A Novel Way to Improve GPGPU Throughput  2,801 views

PIConGPU: A Fully Relativistic Particle-in-Cell Code for a GPU Cluster  2,801 views

24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs  2,800 views

Decompilation of LLVM IR  2,800 views

A Static Load Balancing Scheme for Parallel Volume Rendering on Multi-GPU Clusters  2,799 views

PacketShader: a GPU-accelerated software router  2,798 views

Graph Processing on GPU  2,795 views

An efficient solution for hazardous geophysical flows simulation using GPUs  2,794 views

An OpenCL(TM) Deep Learning Accelerator on Arria 10  2,794 views

MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems  2,793 views

GPU accelerating the FEniCS Project  2,792 views

GPU Accelerated Pattern Matching Algorithm for DNA Sequences to Detect Cancer using CUDA  2,790 views

Speeding Up Reinforcement Learning with Graphics Processing Units  2,790 views

GAMUT: GPU accelerated microRNA analysis to uncover target genes through CUDA-miRanda  2,789 views

OCCA: A unified approach to multi-threading languages  2,789 views

GPU-based Parallel Computation Support for Stan  2,787 views

NAS Parallel Benchmarks for GPGPUs using a Directive-based Programming Model  2,787 views

Parallelizing Word2Vec in Shared and Distributed Memory  2,786 views

Minimal models for finite particles in fluctuating hydrodynamics  2,786 views

Programming CUDA and OpenCL: A Case Study Using Modern C++ Libraries  2,784 views

Integer sorting on multicores: some (experiments and) observations  2,782 views

Multicore bundle adjustment  2,781 views

Interactive Wave Simulations  2,779 views

GPU based particle system  2,779 views

A dynamically configurable coprocessor for convolutional neural networks  2,777 views

Implementing Deep Neural Networks for Financial Market Prediction on the Intel Xeon Phi  2,777 views

GPU Based Acceleration of Telegraph Equation  2,777 views

Cross-Compiling Shading Languages  2,776 views

XBOOLE-CUDA: Fast Boolean Operations on the GPU  2,775 views

Using OpenCL to Implement Median Filtering and RSA Algorithms: Two GPGPU Application Case Studies  2,775 views

Parallel execution of a parameter sweep for molecular dynamics simulations in a hybrid GPU/CPU environment  2,775 views

The MOSIX Virtual OpenCL (VCL) Cluster Platform  2,774 views

Sparse LU Factorization for Parallel Circuit Simulation on GPU  2,770 views

Optimization of Spatial Convolution in ConvNets on Intel KNL  2,769 views

OpenCL-Z Android Released on Google Play  2,767 views

Workload Analysis and Efficient OpenCL-based Implementation of SIFT Algorithm on a Smartphone  2,767 views

Hardware Implementation and Quantization of Tiny-Yolo-v2 using OpenCL  2,765 views

A Simplified and Accurate Model of Power-Performance Efficiency on Emergent GPU Architectures  2,765 views

 

Brief statistics for this page

Titles: 100

Total views: 281792

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: