2402

Views of posts on hgpu.org

OpenCL-Darknet: implementation and optimization of OpenCL-based deep learning object detection framework  2,109 views

OpenCL-accelerated Point Feature Histogram and Its Application in Railway Track Point Cloud Data Processing  2,109 views

Parallel Algorithms for Hybrid Multi-core CPU-GPU Implementations of Component Labelling in Critical Phase Models  2,108 views

Fast Monte Carlo Simulation for Patient-specific CT/CBCT Imaging Dose Calculation  2,108 views

Development of a GPU-based Monte Carlo dose calculation code for coupled electron-photon transport  2,108 views

A Domain Specific Approach to Heterogeneous Computing: From Availability to Accessibility  2,108 views

Parallel Unsmoothed Aggregation Algebraic Multigrid Algorithms on GPUs  2,108 views

Evaluating Performance Portability of Accelerator Programming Models using SPEC ACCEL 1.2 Benchmarks  2,107 views

Massively parallel approximate Gaussian process regression  2,107 views

A fast Texture-by-numbers synthesis method based on texture optimization  2,107 views

Learning hash codes for efficient content reuse detection  2,107 views

DeepBach: a Steerable Model for Bach chorales generation  2,107 views

A GPU accelerated storage system  2,107 views

Sparse matrix solvers on the GPU: conjugate gradients and multigrid  2,107 views

Parallel Benefit on Different Programming Paradigms  2,107 views

Early Experiences With The OpenMP Accelerator Model  2,106 views

BEAGLE: an Application Programming Interface and High-Performance Computing Library for Statistical Phylogenetics  2,106 views

Computing finite models using free Boolean generators  2,106 views

A Survey Of Architectural Approaches for Data Compression in Cache and Main Memory Systems  2,106 views

ProGraML: Graph-based Deep Learning for Program Optimization and Analysis  2,106 views

Serial and Parallel Bayesian Spam Filtering using Aho-Corasick and PFAC  2,106 views

Lightweight modular staging: a pragmatic approach to runtime code generation and compiled DSLs  2,105 views

Real-time colouring and filtering with graphics shaders  2,105 views

CUDACLAW: a Data Parallel Solution Framework for Hyperbolic PDEs  2,105 views

Adding GPU Computing to Computer Organization Courses  2,105 views

On the Characterization of OpenCL Dwarfs on Fixed and Reconfigurable Platforms  2,105 views

GPU-based NSEC3 Hash Breaking  2,104 views

Efficient Parallel Proximity Queries and an Application to Highly Complex Motion Planning Problems with Many Narrow Passages  2,104 views

Declarative Parallel Programming for GPUs  2,104 views

Architectural Analysis and Performance Characterization of NVIDIA GPUs using Microbenchmarking  2,104 views

Fast quantum Monte Carlo on a GPU  2,104 views

Performance Tradeoff Spectrum of Integer and Floating Point Applications  2,104 views

On the Use of Remote GPUs and Low-Power Processors for the Acceleration of Scientific Applications  2,104 views

Graphics Processing Units and High-Dimensional Optimization  2,104 views

Design and Storage Optimization of GPU-based Parallel Program of Image Registration for Remote Sensing  2,104 views

A GPU-Based Accelerator for Chinese Word Segmentation  2,103 views

Neural Networks through Shared Maps in Mobile Devices  2,103 views

Algorithms for Rapid Characterization and Optimization of Aperture and Reflector Antennas  2,103 views

Efficient Parallel Video Processing Techniques on GPU: From Framework to Implementation  2,103 views

Enhancing and Porting the HPC-Lab Snow Simulator to OpenCL on Mobile Platforms  2,103 views

A CUDA Back-End for the Equelle Compiler  2,103 views

A Novel Implementation of QuickHull Algorithm on the GPU  2,102 views

Ultra-fast treatment plan optimization for volumetric modulated arc therapy (VMAT)  2,102 views

Finding, Measuring, and Reducing Inefficiencies in Contemporary Computer Systems  2,102 views

Programming Abstractions and Optimization Techniques for GPU-based Heterogeneous Systems  2,102 views

Practical CFD Simulations on Programmable Graphics Hardware using SMAC  2,102 views

Real-time adaptive fluid simulation with complex boundaries  2,102 views

Accelerating Kirchhoff Migration by CPU and GPU Cooperation  2,102 views

Efficient GPU-Based Texture Interpolation using Uniform B-Splines  2,102 views

Attaining system performance points: revisiting the end-to-end argument in system design for heterogeneous many-core systems  2,102 views

Machine Learning at the Limit  2,101 views

Classiffication-based Financial Markets Prediction using Deep Neural Networks  2,101 views

Architecting an LTE Base Station with Graphics Processing Units  2,100 views

Astrophysical Supercomputing with GPUs: Critical Decisions for Early Adopters  2,100 views

A Code Optimization Framework for Performance Portability of GPU Kernels onto Custom Accelerators  2,100 views

FMM-based vortex method for simulation of isotropic turbulence on GPUs, compared with a spectral method  2,100 views

A compiler framework for optimization of affine loop nests for gpgpus  2,100 views

GPU-Based Local-Dimming for Power Efficient Imaging  2,100 views

Characterization of FPGA-based High Performance Computers  2,100 views

GPU-accelerated MRF segmentation algorithm for SAR images  2,099 views

Optimizing Deep CNN-Based Queries over Video Streams at Scale  2,099 views

Parallel Graph Mining with GPUs  2,099 views

Code Optimization on Kepler GPUs and Xeon Phi  2,099 views

D5.5.3 – Design and implementation of the SIMD-MIMD GPU architecture  2,099 views

Parallel Approach for Longest Common Subsequence problem on GPU  2,099 views

An adaptive Expectation-Maximization algorithm with GPU implementation for electron cryomicroscopy  2,098 views

Ultra-Fast Hybrid CPU-GPU Multiple Scatter Simulation for 3D PET  2,098 views

Accelerating Collapsed Variational Bayesian Inference for Latent Dirichlet Allocation with Nvidia CUDA Compatible Devices  2,098 views

Hybrid MPI/GPU Interpolation for Grid DEM Construction  2,098 views

Finite Difference Time Domain (FDTD) Simulations Using Graphics Processors  2,098 views

MGPUSim: Enabling Multi-GPU Performance Modeling and Optimization  2,098 views

Signed distance transform using graphics hardware  2,097 views

Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation  2,097 views

GPU-Mapping: Robotic Map Building with Graphical Multiprocessors  2,097 views

Learning Representation for Scene Understanding: Epitomes, CRFs, and CNNs  2,097 views

Performance and Portability of Accelerated Lattice Boltzmann Applications with OpenACC  2,097 views

A Reduction of the Elastic Net to Support Vector Machines with an Application to GPU Computing  2,097 views

Improvements to Physically Based Cloth Simulation  2,096 views

Fast Neural Network Training on General Purpose Computers  2,096 views

Visualization in the Einstein Year 2005: a case study on explanatory and illustrative visualization of relativity and astrophysics  2,096 views

Adding special-purpose processor support to the Erlang VM  2,096 views

Hermes: an integrated CPU/GPU microarchitecture for IP routing  2,096 views

Fast CT Image Processing using Parallelized Non-local Means  2,096 views

Gauge Field Generation on Large-Scale GPU-Enabled Systems  2,096 views

Volume-preserving FFD for programmable graphics hardware  2,095 views

Development of JavaScript-based deep learning platform and application to distributed training  2,095 views

Improving GPU programming models through hardware cache coherence  2,095 views

A GPU-Accelerated Framework for Image Processing and Computer Vision  2,095 views

Enabling OpenCL on a Configurable, VLIW Chip-Multiprocessor  2,095 views

Cluster-Level Tuning of a Shallow Water Equation Solver on the Intel MIC Architecture  2,095 views

A Comparative Study of Neighborhood Filters for Artifact Reduction in Iterative Low-Dose CT  2,095 views

Split tiling for GPUs: automatic parallelization using trapezoidal tiles  2,094 views

GPU-Accelerated Nearest Neighbor Search for 3D Registration  2,094 views

Grids, Clouds and Virtualization  2,094 views

Algorithms and Data Structures for Interactive Ray Tracing on Commodity Hardware  2,094 views

Benchmarking Parallel Performance on Many-Core Processors  2,094 views

Parallel Game Tree Search Using GPU  2,094 views

Real-Time Automatic Object Classification and Tracking using Genetic Programming and NVIDIA CUDA  2,094 views

Performance Evaluations of Document-Oriented Databases using GPU and Cache Structure  2,094 views

GPU-based Cloud Computing for Comparing the Structure of Protein Binding Sites  2,093 views

 

Brief statistics for this page

Titles: 100

Total views: 210095

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: