Views of posts on hgpu.org
A Co-Design Framework with OpenCL Support for Low-Energy Wide SIMD Processor 2,033 views
A Framework for Profiling and Performance Monitoring of Heterogeneous Applications 2,033 views
Heterogeneous GPU&CPU cluster for High Performance Computing in cryptography 2,033 views
CULA: hybrid GPU accelerated linear algebra routines 2,033 views
A Similarity-Based Analysis Tool for Scientific Application Porting 2,033 views
Developing a CUDA solver for large sparse matrices for MARIN 2,033 views
Using Image Morphing for Memory-Efficient Impostor Rendering on GPU 2,032 views
Interactive Computer Graphics: A Top-Down Approach Using OpenGL (5th Edition) 2,032 views
Investigation of heterogeneous computing through novel parallel programming platforms 2,032 views
A GPU Implementation of Local Search Operators for Symmetric Travelling Salesman Problem 2,032 views
Kd-tree Based N-Body Simulations with Volume-Mass Heuristic on the GPU 2,032 views
Solutions For Optimizing The Radix Sort Algorithmic Function Using The Compute Unified Device Architecture 2,032 views
GIS Polygon Overlay Processing: New Parallel Algorithm and System Prototype 2,031 views
Fast, parallel implementation of particle filtering on the GPU architecture 2,031 views
Automatic Data Layout Generation and Kernel Mapping for CPU+GPU Architectures 2,031 views
Massively parallel read mapping on GPUs with PEANUT 2,031 views
A Comprehensive Performance Analysis of HSA and OpenCL 2.0 2,030 views
GPU-based Monte Carlo radiotherapy dose calculation using phase-space sources 2,030 views
A Case Study for Petascale Applications in Astrophysics: Simulating Gamma-Ray Bursts 2,030 views
Blocked All-Pairs Shortest Paths Algorithm on Intel Xeon Phi KNL Processor: A Case Study 2,030 views
Parallel Programming using OpenCL on Modern Architectures 2,030 views
Towards Path Tracing in Games 2,029 views
Optimizing Communication for Clusters of GPUs 2,029 views
The Feasibility of Using OpenCL Instead of OpenMP for Parallel CPU Programming 2,029 views
Minerva: A Scalable and Highly Efficient Training Platform for Deep Learning 2,029 views
Physically Based Rendering: Implementation of Path Tracer 2,029 views
clSPARSE: A Vendor-Optimized Open-Source Sparse BLAS Library 2,029 views
Acceleration of Linear Finite-Difference Poisson-Boltzmann Methods on Graphics Processing Units 2,029 views
Ameliorating Memory Contention of OLAP operators on GPU Processors 2,029 views
A Performance Analysis Framework for Identifying Potential Benefits in GPGPU Applications 2,029 views
Flexible, high performance convolutional neural networks for image classification 2,028 views
Zero-copy I/O processing for low-latency GPU computing 2,028 views
A (Somewhat Dated) Comparative Study of Betweenness Centrality Algorithms on GPU 2,028 views
Live Migration for OpenCL FPGA Accelerators 2,028 views
Ristretto: Hardware-Oriented Approximation of Convolutional Neural Networks 2,028 views
Parallelization, Scalability, and Reproducibility in Next-Generation Sequencing Analysis 2,028 views
OpenCL vs: Accelerated Finite-Difference Digital Synthesis 2,027 views
Volumetric Ambient Occlusion for Real-Time Rendering and Games 2,027 views
Effectiveness of program transformations and compilers for directive-based GPU programming models 2,027 views
Utilizing state-of-art NeuroES and GPGPU to optimize Mario AI 2,027 views
Kokkos: Enabling performance portability across manycore architectures 2,026 views
Molecular Dynamics Simulation of Macromolecules Using Graphics Processing Unit 2,026 views
GPU-based Batched Spatial Query Processing on R-Trees 2,026 views
Implementation and performance evaluation of a GPU particle-in-cell code 2,026 views
OpenCL Numerical Simulations of Two-Fluid Compressible Flows With a 2D Random Choice Method 2,025 views
GPU-ClustalW: Using Graphics Hardware to Accelerate Multiple Sequence Alignment 2,025 views
GA3C: GPU-based A3C for Deep Reinforcement Learning 2,025 views
Teaching Parallel Programming Using Java 2,025 views
Reflector Antenna Analysis using Physical Optics on Graphics Processing Units 2,024 views
Directive-Based, High-Level Programming and Optimizations for High-Performance Computing with FPGAs 2,024 views
Parallel Ray Tracing in Scientific Visualization 2,024 views
GPU-to-CPU callbacks 2,024 views
Concurrent learning of a Probabilistic Graphical Model on the GPU 2,023 views
Real-Time Tone Mapping for High-Resolution HDR Images 2,023 views
DeepSmith: Compiler Fuzzing through Deep Learning 2,023 views
A Test Drive of the NVIDIA Jetson TX1 Developer Kit for Deep Learning and Computer Vision Applications 2,023 views
Scaling CUDA for Distributed Heterogeneous Processors 2,022 views
GPU-accelerated adjoint algorithmic differentiation 2,022 views
Optimizing Web Virtual Reality 2,022 views
Relativistic Hydrodynamics on Graphic Cards 2,022 views
Efficient bayesian multi-view deconvolution 2,022 views
Improving the Programmability of GPU Architectures 2,022 views
Analysis and Optimization Techniques for Massively Parallel Processors 2,021 views
MrBayes tgMC3: A Tight GPU Implementation of MrBayes 2,021 views
A Comparative Study of Parallel Algorithms for the Girth Problem 2,021 views
Parallelized Vlasov-Fokker-Planck solver for desktop personal computers 2,021 views
Using mobile GPU for general-purpose computing – a case study of face recognition on smartphones 2,021 views
Dynamic Task-Scheduling and Resource Management for GPU Accelerators in Medical Imaging 2,021 views
JAX, M.D.: End-to-End Differentiable, Hardware Accelerated, Molecular Dynamics in Pure Python 2,021 views
Real-Time Adaptive Image Compression 2,021 views
Parallel data mining on graphics processors 2,021 views
GPU Accelerated Inverse Photon Mapping for Real-Time Surface Reflectance Modeling 2,020 views
A comprehensive study of Dynamic Memory Management in OpenCL kernels 2,020 views
Multi-user real-time speech recognition with a GPU 2,020 views
Benchmarking Deep Learning Models on Jetson TX2 2,020 views
A performance/cost evaluation for a GPU-based drug discovery application on volunteer computing 2,020 views
Removing the Barrier for FPGA-Based OpenCL Data Center Servers 2,020 views
GPGPU accelerated optimization method of Interconnection Network Topology 2,020 views
Parallel CYK Membership Test on GPUs 2,020 views
Two improved GPU acceleration strategies for force-directed graph layout 2,020 views
Exercising high-level parallel programming on streams: a systems biology use case 2,019 views
GPU: Power vs Performance 2,019 views
Automatic Discovery of Algorithms for Multi-Agent Systems 2,019 views
OpenDwarfs: Characterization of Dwarf-Based Benchmarks on Fixed and Reconfigurable Architectures 2,019 views
A Package for Multi-Dimensional Monte Carlo Integration on Multi-GPUs 2,019 views
Scalable and High Performance Betweenness Centrality on the GPU 2,018 views
Bifrost: a Python/C++ Framework for High-Throughput Stream Processing in Astronomy 2,018 views
cltorch: a Hardware-Agnostic Backend for the Torch Deep Neural Network Library, Based on OpenCL 2,018 views
GPU-Accelerated Bayesian Learning and Forecasting in Simultaneous Graphical Dynamic Linear Models 2,018 views
Fast Multipole Method vs. Spectral Method for the Simulation of Isotropic Turbulence on GPUs 2,017 views
LBM based flow simulation using GPU computing processor 2,017 views
Parallel Finite Volume Algorithm on Graphic Processing Units (GPU) 2,017 views
Motion Estimation for H.264/AVC using Programmable Graphics Hardware 2,017 views
GPU computing for shallow water flow simulation based on finite volume schemes 2,017 views
Titles: 100
Total views: 202496
- Programming - 186,129 views
- Login - 164,395 views
- User dashboard - 90,727 views
- Paper titles list - 70,147 views
- Add new event - 64,592 views
- Add new post - 59,372 views
- Register - 49,231 views
- Statistics - 36,601 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,167 views
- Books on OpenCL and CUDA - 28,821 views