Views of posts on hgpu.org
Asynchronous Parallel Computing Model of Global Motion Estimation with CUDA 1,983 views
GPU-Based Spherical Light Field Rendering with Per-Fragment Depth Correction 1,982 views
Graduate Operating Systems: Project Report 1,982 views
MSTg: Cryptographically strong pseudorandom number generator and its realization 1,982 views
Distributed OpenCL Distributing OpenCL Platform on Network Scale 1,982 views
Exploiting Concurrent GPU Operations for Efficient Work Stealing on Multi-GPUs 1,982 views
GPU Accelerated Computation of the ICON Model 1,982 views
Accelerating Sparse Matrix-Matrix Multiplication with GPU Tensor Cores 1,982 views
GPU Accelerated Range Trees with Applications 1,982 views
An FPGA-based processing pipeline for high definition stereo video 1,982 views
Deep Convolutional Neural Networks for Smile Recognition 1,982 views
Exploring Traditional and Emerging Parallel Programming Models using a Proxy Application 1,982 views
Fast Feature Selection in a GPU Cluster Using the Delta Test 1,982 views
A comparative benchmarking of the FFT on Fermi and Evergreen GPUs 1,982 views
Structured Orthogonal Inversion of Block p-Cyclic Matrices on Multicore with GPU Accelerators 1,982 views
Understanding the ISA impact on GPU Architecture 1,981 views
GPU-accelerated deep shadow maps for direct volume rendering 1,981 views
GROMACS on Hybrid CPU-GPU and CPU-MIC Clusters: Preliminary Porting Experiences, Results and Next Steps 1,981 views
ASW: Accelerating Smith-Waterman Algorithm on Coupled CPU-GPU Architecture 1,981 views
GPU-based image manipulation and enhancement techniques for dynamic volumetric medical image visualization 1,981 views
Parallel Tree Traversal for Nearest Neighbor Query on the GPU 1,981 views
Fast calculation of HELAS amplitudes using graphics processing unit (GPU) 1,981 views
Portable HPC Programming on Intel Many-Integrated-Core Hardware with MAGMA Port to Xeon Phi 1,981 views
Tradeoffs in designing accelerator architectures for visual computing 1,981 views
Inter-block synchronization on a GPGPU 1,980 views
Scalable Verification Techniques for Data-Parallel Programs 1,980 views
An improved scheme of an interactive finite element model for 3D soft-tissue cutting and deformation 1,980 views
Implementation of Filtering Beamforming Algorithms for Sonar Devices Using GPU 1,980 views
ImageCL: An Image Processing Language for Performance Portability on Heterogeneous Systems 1,980 views
SMAA: Enhanced Subpixel Morphological Antialiasing 1,979 views
Approximation of Loop Subdivision Surfaces for Fast Rendering 1,979 views
High Performance Poisson Equation Solver for Hybrid CPU/GPU Systems 1,979 views
Geospatial visualization using hardware accelerated real-time volume rendering 1,979 views
Real-time multi-agent path planning on arbitrary surfaces 1,979 views
Coherent Photon Mapping on the Intel MIC Architecture 1,979 views
Performance Analysis of Sobel Edge Detection Filter on GPU using CUDA & OpenGL 1,979 views
Streaming GPU Singular Value and Dynamic Mode Decompositions 1,979 views
Performance Comparison of GPUs with a Genetic Algorithm based on CUDA 1,979 views
SABER: Window-Based Hybrid Stream Processing for Heterogeneous Architectures 1,978 views
Direct solution of the Boltzmann equation for a binary mixture on GPUs 1,978 views
Accelerating Eulerian Fluid Simulation With Convolutional Networks 1,978 views
Exploiting Data Parallelism in GPUs 1,978 views
Decreasing NAME III Solution Time Using GP-GPU 1,978 views
Semi-Global Filtering of Airborne LiDAR Data for Fast Extraction of Digital Terrain Models 1,978 views
GPU Acceleration of a Genetic Algorithm for the Synthesis of FSM-based Bimodal Predictors 1,978 views
Breaking ECC2K-130 1,977 views
Accelerating Dynamic Time Warping Subsequence Search with GPUs and FPGAs 1,977 views
Parallel GPU Implementation of Hough Transform for Circles 1,977 views
Time-stepping methods for the simulation of the self-assembly of nano-crystals in Matlab on a GPU 1,977 views
GPU phase-field lattice Boltzmann simulations of growth and motion of a binary alloy dendrite 1,977 views
A Highly Efficient GPU-CPU Hybrid Parallel Implementation of Sparse LU Factorization 1,977 views
Adjoint Algorithmic Differentiation of a GPU Accelerated Application 1,976 views
Reduction of a Symmetrical Matrix to Tridiagonal Form on GPUs 1,976 views
Accelerating linpack with CUDA on heterogenous clusters 1,976 views
Optimal loop unrolling for GPGPU programs (thesis) 1,976 views
Implementing CFD (Computational Fluid Dynamics) in OpenCL for Building Simulation 1,976 views
Efficient Ray Tracing of Dynamic Scenes on the GPU 1,976 views
The performances of R GPU implementations of the GMRES method 1,976 views
OpenCL-Accelerated Computation of a 3D SPECT Projection Operator for the Content Adaptive Mesh Model 1,976 views
Searching for a counterexample of Kurepa’s Conjecture 1,976 views
Large data visualization on distributed memory multi-GPU clusters 1,976 views
POMPEI: Programming with OpenMP4 for Exascale Investigations 1,975 views
Adapting Particle Filter Algorithms to Many-Core Architectures 1,975 views
Linear algebra operators for GPU implementation of numerical algorithms 1,975 views
Thread Block Compaction for Efficient SIMT Control Flow 1,975 views
Real-Time Optical Flow Calculations on FPGA and GPU Architectures: A Comparison Study 1,975 views
Fast Simulations of Gravitational Many-body Problem on RV770 GPU 1,975 views
GPU processing of particle system animation 1,974 views
Modernizing the core quantum chemistry algorithms 1,974 views
A GPU based real-time GPS software receiver 1,974 views
Early Experiences Running the 3D Stencil Jacobi Method in Intel Xeon Phi 1,974 views
MIMD Interpretation on a GPU 1,974 views
Efficient Interleaved Batch Matrix Solvers for CUDA 1,974 views
Performance study of interference on GPU and CPU resources with multiple applications 1,974 views
Analysis of Real-Time Stereo Vision Algorithms On GPU 1,974 views
Performance and Productivity of Parallel Python Programming: A study with a CFD Test Case 1,974 views
Visualization of Pareto Solutions by Spherical Self-Organizing Map and It’s acceleration on a GPU 1,974 views
Towards Improving Programmability of Heterogeneous Parallel Architectures 1,973 views
OpenCL Sparse Linear Solver for Circuit Simulation 1,973 views
A CUDA-Based Real Parameter Optimization Benchmark 1,973 views
A new approach to the lattice Boltzmann method for graphics processing units 1,973 views
Autotuning, Code Generation and Optimizing Compiler Technology for GPUs 1,973 views
Origami: A Convolutional Network Accelerator 1,973 views
Faster Radix Sort via Virtual Memory and Write-Combining 1,973 views
Viewpoints: A high-performance high-dimensional exploratory data analysis tool 1,973 views
CloudCL: Single-Paradigm Distributed Heterogeneous Computing for Cloud Infrastructures 1,972 views
md_poly: A Performance-Portable Polyhedral Compiler Based on Multi-Dimensional Homomorphisms 1,972 views
A Highly-Efficient Memory-Compression Scheme for GPU-Accelerated Intrusion Detection Systems 1,972 views
A Software-Based Self Test of CUDA Fermi GPUs 1,972 views
Parallelizing Exact and Approximate String Matching via Inclusive Scan on a GPU 1,972 views
Titles: 100
Total views: 197757
- Programming - 186,133 views
- Login - 164,567 views
- User dashboard - 91,314 views
- Paper titles list - 71,335 views
- Add new event - 64,814 views
- Add new post - 59,614 views
- Register - 49,321 views
- Statistics - 37,173 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,190 views
- Books on OpenCL and CUDA - 28,900 views