2402

Views of posts on hgpu.org

Asynchronous Parallel Computing Model of Global Motion Estimation with CUDA  1,983 views

Abstraction and Implementation of Unstructured Grid Algorithms on Massively Parallel Heterogeneous Architectures  1,982 views

GPU-Based Spherical Light Field Rendering with Per-Fragment Depth Correction  1,982 views

Implications of the Turing completeness of reaction-diffusion models, informed by GPGPU simulations on an XBox 360: cardiac arrhythmias, re-entry and the Halting problem  1,982 views

Graduate Operating Systems: Project Report  1,982 views

MSTg: Cryptographically strong pseudorandom number generator and its realization  1,982 views

Distributed OpenCL Distributing OpenCL Platform on Network Scale  1,982 views

Exploiting Concurrent GPU Operations for Efficient Work Stealing on Multi-GPUs  1,982 views

GPU Accelerated Computation of the ICON Model  1,982 views

Accelerating Sparse Matrix-Matrix Multiplication with GPU Tensor Cores  1,982 views

GPU Accelerated Range Trees with Applications  1,982 views

An FPGA-based processing pipeline for high definition stereo video  1,982 views

Deep Convolutional Neural Networks for Smile Recognition  1,982 views

Exploring Traditional and Emerging Parallel Programming Models using a Proxy Application  1,982 views

Fast Feature Selection in a GPU Cluster Using the Delta Test  1,982 views

A comparative benchmarking of the FFT on Fermi and Evergreen GPUs  1,982 views

Structured Orthogonal Inversion of Block p-Cyclic Matrices on Multicore with GPU Accelerators  1,982 views

Task migration of DSP application specified with a DFG and implemented with the BSP computing model on a CPU-GPU cluster  1,981 views

Understanding the ISA impact on GPU Architecture  1,981 views

Parallel Algorithms for the Summed Area Table on the Asynchronous Hierarchical Memory Machine, with GPU implementations  1,981 views

GPU-accelerated deep shadow maps for direct volume rendering  1,981 views

GROMACS on Hybrid CPU-GPU and CPU-MIC Clusters: Preliminary Porting Experiences, Results and Next Steps  1,981 views

ASW: Accelerating Smith-Waterman Algorithm on Coupled CPU-GPU Architecture  1,981 views

GPU-based image manipulation and enhancement techniques for dynamic volumetric medical image visualization  1,981 views

Parallel Tree Traversal for Nearest Neighbor Query on the GPU  1,981 views

Fast calculation of HELAS amplitudes using graphics processing unit (GPU)  1,981 views

Portable HPC Programming on Intel Many-Integrated-Core Hardware with MAGMA Port to Xeon Phi  1,981 views

Tradeoffs in designing accelerator architectures for visual computing  1,981 views

Inter-block synchronization on a GPGPU  1,980 views

Scalable Verification Techniques for Data-Parallel Programs  1,980 views

Toward Accelerating the Matrix Inversion Computation of Symmetric Positive-Definite Matrices on Heterogeneous GPU-Based Systems  1,980 views

An improved scheme of an interactive finite element model for 3D soft-tissue cutting and deformation  1,980 views

Implementation of Filtering Beamforming Algorithms for Sonar Devices Using GPU  1,980 views

ImageCL: An Image Processing Language for Performance Portability on Heterogeneous Systems  1,980 views

SMAA: Enhanced Subpixel Morphological Antialiasing  1,979 views

Approximation of Loop Subdivision Surfaces for Fast Rendering  1,979 views

High Performance Poisson Equation Solver for Hybrid CPU/GPU Systems  1,979 views

Geospatial visualization using hardware accelerated real-time volume rendering  1,979 views

Real-time multi-agent path planning on arbitrary surfaces  1,979 views

Coherent Photon Mapping on the Intel MIC Architecture  1,979 views

Performance Analysis of Sobel Edge Detection Filter on GPU using CUDA & OpenGL  1,979 views

Streaming GPU Singular Value and Dynamic Mode Decompositions  1,979 views

Performance Comparison of GPUs with a Genetic Algorithm based on CUDA  1,979 views

SABER: Window-Based Hybrid Stream Processing for Heterogeneous Architectures  1,978 views

Direct solution of the Boltzmann equation for a binary mixture on GPUs  1,978 views

Accelerating Eulerian Fluid Simulation With Convolutional Networks  1,978 views

Exploiting Data Parallelism in GPUs  1,978 views

Decreasing NAME III Solution Time Using GP-GPU  1,978 views

Semi-Global Filtering of Airborne LiDAR Data for Fast Extraction of Digital Terrain Models  1,978 views

GPU Acceleration of a Genetic Algorithm for the Synthesis of FSM-based Bimodal Predictors  1,978 views

Finite differences numerical method for two-dimensional superlattice Boltzmann transport equation and case comparison of CPU(C) and GPGPU(CUDA) implementations  1,977 views

Breaking ECC2K-130  1,977 views

Accelerating Dynamic Time Warping Subsequence Search with GPUs and FPGAs  1,977 views

Parallel GPU Implementation of Hough Transform for Circles  1,977 views

Time-stepping methods for the simulation of the self-assembly of nano-crystals in Matlab on a GPU  1,977 views

GPU phase-field lattice Boltzmann simulations of growth and motion of a binary alloy dendrite  1,977 views

A Highly Efficient GPU-CPU Hybrid Parallel Implementation of Sparse LU Factorization  1,977 views

Adjoint Algorithmic Differentiation of a GPU Accelerated Application  1,976 views

Reduction of a Symmetrical Matrix to Tridiagonal Form on GPUs  1,976 views

Accelerating linpack with CUDA on heterogenous clusters  1,976 views

Optimal loop unrolling for GPGPU programs (thesis)  1,976 views

Implementing CFD (Computational Fluid Dynamics) in OpenCL for Building Simulation  1,976 views

Efficient Ray Tracing of Dynamic Scenes on the GPU  1,976 views

The performances of R GPU implementations of the GMRES method  1,976 views

Comprehensive Evaluations of Cone-beam CT dose in Image-guided Radiation Therapy via GPU-based Monte Carlo simulations  1,976 views

OpenCL-Accelerated Computation of a 3D SPECT Projection Operator for the Content Adaptive Mesh Model  1,976 views

Searching for a counterexample of Kurepa’s Conjecture  1,976 views

Large data visualization on distributed memory multi-GPU clusters  1,976 views

Feasibility Analysis of Low Cost Graphical Processing Units for Electromagnetic Field Simulations by Finite Difference Time Domain Method  1,976 views

POMPEI: Programming with OpenMP4 for Exascale Investigations  1,975 views

Adapting Particle Filter Algorithms to Many-Core Architectures  1,975 views

Linear algebra operators for GPU implementation of numerical algorithms  1,975 views

Thread Block Compaction for Efficient SIMT Control Flow  1,975 views

Real-Time Optical Flow Calculations on FPGA and GPU Architectures: A Comparison Study  1,975 views

Systolic-CNN: An OpenCL-defined Scalable Run-time-flexible FPGA Accelerator Architecture for Accelerating Convolutional Neural Network Inference in Cloud/Edge Computing  1,975 views

Fast Simulations of Gravitational Many-body Problem on RV770 GPU  1,975 views

GPU processing of particle system animation  1,974 views

Modernizing the core quantum chemistry algorithms  1,974 views

A GPU based real-time GPS software receiver  1,974 views

Early Experiences Running the 3D Stencil Jacobi Method in Intel Xeon Phi  1,974 views

MIMD Interpretation on a GPU  1,974 views

Efficient Interleaved Batch Matrix Solvers for CUDA  1,974 views

Performance study of interference on GPU and CPU resources with multiple applications  1,974 views

Analysis of Real-Time Stereo Vision Algorithms On GPU  1,974 views

Multithreaded Transposition of Square Matrices with Common Code for Intel Xeon Processors and Intel Xeon Phi Coprocessors  1,974 views

Performance and Productivity of Parallel Python Programming: A study with a CFD Test Case  1,974 views

Visualization of Pareto Solutions by Spherical Self-Organizing Map and It’s acceleration on a GPU  1,974 views

Towards Improving Programmability of Heterogeneous Parallel Architectures  1,973 views

OpenCL Sparse Linear Solver for Circuit Simulation  1,973 views

A CUDA-Based Real Parameter Optimization Benchmark  1,973 views

A new approach to the lattice Boltzmann method for graphics processing units  1,973 views

Autotuning, Code Generation and Optimizing Compiler Technology for GPUs  1,973 views

Origami: A Convolutional Network Accelerator  1,973 views

Faster Radix Sort via Virtual Memory and Write-Combining  1,973 views

Viewpoints: A high-performance high-dimensional exploratory data analysis tool  1,973 views

CloudCL: Single-Paradigm Distributed Heterogeneous Computing for Cloud Infrastructures  1,972 views

md_poly: A Performance-Portable Polyhedral Compiler Based on Multi-Dimensional Homomorphisms  1,972 views

A Highly-Efficient Memory-Compression Scheme for GPU-Accelerated Intrusion Detection Systems  1,972 views

A Software-Based Self Test of CUDA Fermi GPUs  1,972 views

Parallelizing Exact and Approximate String Matching via Inclusive Scan on a GPU  1,972 views

 

Brief statistics for this page

Titles: 100

Total views: 197757

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: