2402

Views of posts on hgpu.org

A Co-Design Framework with OpenCL Support for Low-Energy Wide SIMD Processor  2,033 views

A Framework for Profiling and Performance Monitoring of Heterogeneous Applications  2,033 views

Heterogeneous GPU&CPU cluster for High Performance Computing in cryptography  2,033 views

CULA: hybrid GPU accelerated linear algebra routines  2,033 views

A Similarity-Based Analysis Tool for Scientific Application Porting  2,033 views

Developing a CUDA solver for large sparse matrices for MARIN  2,033 views

Using Image Morphing for Memory-Efficient Impostor Rendering on GPU  2,032 views

Interactive Computer Graphics: A Top-Down Approach Using OpenGL (5th Edition)  2,032 views

QMCPACK: An open source ab initio Quantum Monte Carlo package for the electronic structure of atoms, molecules, and solids  2,032 views

Investigation of heterogeneous computing through novel parallel programming platforms  2,032 views

A GPU Implementation of Local Search Operators for Symmetric Travelling Salesman Problem  2,032 views

Kd-tree Based N-Body Simulations with Volume-Mass Heuristic on the GPU  2,032 views

Solutions For Optimizing The Radix Sort Algorithmic Function Using The Compute Unified Device Architecture  2,032 views

Multifactor dimensionality reduction for graphics processing units enables genome-wide testing of epistasis in sporadic ALS  2,032 views

GIS Polygon Overlay Processing: New Parallel Algorithm and System Prototype  2,031 views

Fast, parallel implementation of particle filtering on the GPU architecture  2,031 views

Automatic Data Layout Generation and Kernel Mapping for CPU+GPU Architectures  2,031 views

Massively parallel read mapping on GPUs with PEANUT  2,031 views

A Comprehensive Performance Analysis of HSA and OpenCL 2.0  2,030 views

GPU-based Monte Carlo radiotherapy dose calculation using phase-space sources  2,030 views

A Case Study for Petascale Applications in Astrophysics: Simulating Gamma-Ray Bursts  2,030 views

Blocked All-Pairs Shortest Paths Algorithm on Intel Xeon Phi KNL Processor: A Case Study  2,030 views

Parallel Programming using OpenCL on Modern Architectures  2,030 views

Using the Tsetlin Machine to Learn Human-Interpretable Rules for High-Accuracy Text Categorization with Medical Applications  2,030 views

Towards Path Tracing in Games  2,029 views

Optimizing Communication for Clusters of GPUs  2,029 views

The Feasibility of Using OpenCL Instead of OpenMP for Parallel CPU Programming  2,029 views

Minerva: A Scalable and Highly Efficient Training Platform for Deep Learning  2,029 views

Physically Based Rendering: Implementation of Path Tracer  2,029 views

clSPARSE: A Vendor-Optimized Open-Source Sparse BLAS Library  2,029 views

Acceleration of Linear Finite-Difference Poisson-Boltzmann Methods on Graphics Processing Units  2,029 views

Ameliorating Memory Contention of OLAP operators on GPU Processors  2,029 views

A Performance Analysis Framework for Identifying Potential Benefits in GPGPU Applications  2,029 views

Flexible, high performance convolutional neural networks for image classification  2,028 views

Zero-copy I/O processing for low-latency GPU computing  2,028 views

A (Somewhat Dated) Comparative Study of Betweenness Centrality Algorithms on GPU  2,028 views

Live Migration for OpenCL FPGA Accelerators  2,028 views

Ristretto: Hardware-Oriented Approximation of Convolutional Neural Networks  2,028 views

Parallelization, Scalability, and Reproducibility in Next-Generation Sequencing Analysis  2,028 views

OpenCL vs: Accelerated Finite-Difference Digital Synthesis  2,027 views

Volumetric Ambient Occlusion for Real-Time Rendering and Games  2,027 views

Effectiveness of program transformations and compilers for directive-based GPU programming models  2,027 views

Utilizing state-of-art NeuroES and GPGPU to optimize Mario AI  2,027 views

Kokkos: Enabling performance portability across manycore architectures  2,026 views

Molecular Dynamics Simulation of Macromolecules Using Graphics Processing Unit  2,026 views

GPU-based Batched Spatial Query Processing on R-Trees  2,026 views

Implementation and performance evaluation of a GPU particle-in-cell code  2,026 views

OpenCL Numerical Simulations of Two-Fluid Compressible Flows With a 2D Random Choice Method  2,025 views

GPU-ClustalW: Using Graphics Hardware to Accelerate Multiple Sequence Alignment  2,025 views

GA3C: GPU-based A3C for Deep Reinforcement Learning  2,025 views

Teaching Parallel Programming Using Java  2,025 views

Reflector Antenna Analysis using Physical Optics on Graphics Processing Units  2,024 views

Directive-Based, High-Level Programming and Optimizations for High-Performance Computing with FPGAs  2,024 views

Parallel Ray Tracing in Scientific Visualization  2,024 views

GPU-to-CPU callbacks  2,024 views

Concurrent learning of a Probabilistic Graphical Model on the GPU  2,023 views

Real-Time Tone Mapping for High-Resolution HDR Images  2,023 views

DeepSmith: Compiler Fuzzing through Deep Learning  2,023 views

A Test Drive of the NVIDIA Jetson TX1 Developer Kit for Deep Learning and Computer Vision Applications  2,023 views

Scaling CUDA for Distributed Heterogeneous Processors  2,022 views

GPU-accelerated adjoint algorithmic differentiation  2,022 views

Optimizing Web Virtual Reality  2,022 views

Relativistic Hydrodynamics on Graphic Cards  2,022 views

Efficient bayesian multi-view deconvolution  2,022 views

Improving the Programmability of GPU Architectures  2,022 views

Analysis and Optimization Techniques for Massively Parallel Processors  2,021 views

MrBayes tgMC3: A Tight GPU Implementation of MrBayes  2,021 views

A Comparative Study of Parallel Algorithms for the Girth Problem  2,021 views

Parallelized Vlasov-Fokker-Planck solver for desktop personal computers  2,021 views

Using mobile GPU for general-purpose computing – a case study of face recognition on smartphones  2,021 views

Dynamic Task-Scheduling and Resource Management for GPU Accelerators in Medical Imaging  2,021 views

JAX, M.D.: End-to-End Differentiable, Hardware Accelerated, Molecular Dynamics in Pure Python  2,021 views

Real-Time Adaptive Image Compression  2,021 views

Parallel data mining on graphics processors  2,021 views

GPU Accelerated Inverse Photon Mapping for Real-Time Surface Reflectance Modeling  2,020 views

A comprehensive study of Dynamic Memory Management in OpenCL kernels  2,020 views

Multi-user real-time speech recognition with a GPU  2,020 views

Interventional 4-D Motion Estimation and Reconstruction of Cardiac Vasculature without Motion Periodicity Assumption  2,020 views

Benchmarking Deep Learning Models on Jetson TX2  2,020 views

A performance/cost evaluation for a GPU-based drug discovery application on volunteer computing  2,020 views

Removing the Barrier for FPGA-Based OpenCL Data Center Servers  2,020 views

GPGPU accelerated optimization method of Interconnection Network Topology  2,020 views

Parallel CYK Membership Test on GPUs  2,020 views

Two improved GPU acceleration strategies for force-directed graph layout  2,020 views

Exercising high-level parallel programming on streams: a systems biology use case  2,019 views

GPU: Power vs Performance  2,019 views

Automatic Discovery of Algorithms for Multi-Agent Systems  2,019 views

Accelerating incompressible flow computations with a Pthreads-CUDA implementation on small-footprint multi-GPU platforms  2,019 views

OpenDwarfs: Characterization of Dwarf-Based Benchmarks on Fixed and Reconfigurable Architectures  2,019 views

A Package for Multi-Dimensional Monte Carlo Integration on Multi-GPUs  2,019 views

Scalable and High Performance Betweenness Centrality on the GPU  2,018 views

Bifrost: a Python/C++ Framework for High-Throughput Stream Processing in Astronomy  2,018 views

cltorch: a Hardware-Agnostic Backend for the Torch Deep Neural Network Library, Based on OpenCL  2,018 views

GPU-Accelerated Bayesian Learning and Forecasting in Simultaneous Graphical Dynamic Linear Models  2,018 views

Fast GPU-Based Seismogram Simulation from Microseismic Events in Marine Environments Using Heterogeneous Velocity Models  2,017 views

Fast Multipole Method vs. Spectral Method for the Simulation of Isotropic Turbulence on GPUs  2,017 views

LBM based flow simulation using GPU computing processor  2,017 views

Parallel Finite Volume Algorithm on Graphic Processing Units (GPU)  2,017 views

Motion Estimation for H.264/AVC using Programmable Graphics Hardware  2,017 views

GPU computing for shallow water flow simulation based on finite volume schemes  2,017 views

 

Brief statistics for this page

Titles: 100

Total views: 202496

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: