Views of posts on hgpu.org
Adaptive GPU Array Layout Auto-Tuning 2,174 views
A Parallel Recursive Approach for Solving All Pairs Shortest Path Problem on GPU using OpenCL 2,173 views
Flexible, Fast and Accurate Sequence Alignment Profiling on GPGPU with PaSWAS 2,173 views
A Financial Benchmark for GPGPU Compilation 2,173 views
GPU volume rendering in 3D echocardiography: Real-time pre-processing and ray-casting 2,173 views
3D Registration Based on Normalized Mutual Information: Performance of CPU vs. GPU Implementation 2,173 views
Patch-Based Image Vectorization with Automatic Curvilinear Feature Alignment 2,172 views
Porting Large HPC Applications to GPU Clusters: The Codes GENE and VERTEX 2,172 views
Extending OmpSs for OpenCL kernel co-execution in heterogeneous systems 2,171 views
Performance comparison of GPU and FPGA architectures for the SVM training problem 2,171 views
liquidSVM: A Fast and Versatile SVM package 2,171 views
A tutorial on the implementations of linear image filters in CPU and GPU 2,170 views
Relax-Miracle: GPU Parallelization of Semi-Analytic Fourier-Domain solvers for Earthquake Modeling 2,170 views
Drug Drug Interaction Extraction from Biomedical Literature Using Syntax Convolutional Neural Network 2,170 views
Tactics to Directly Map CNN graphs on Embedded FPGAs 2,170 views
Accelerating Fully Homomorphic Encryption on GPUs 2,170 views
A New Sparse Matrix Vector Multiplication GPU Algorithm Designed for Finite Element Problems 2,169 views
FFT-SPA Non-Binary LDPC Decoding on GPU 2,169 views
Parallel computing with graphics processing units for high-speed Monte Carlo simulation of photon migration 2,169 views
Artifact-Free Decompression and Zooming of JPEG Compressed Images with Total Generalized Variation 2,168 views
Accelerating Large Graph Algorithms on the GPU Using CUDA 2,168 views
GIST: an interactive, GPU-based level set segmentation tool for 3D medical images 2,168 views
GPU-Accelerated High-Accuracy Molecular Docking using Guided Differential Evolution 2,168 views
Orchestrating Multiple Data-Parallel Kernels on Multiple Devices 2,168 views
Portable Programming Models for Heterogeneous Platforms 2,168 views
Improved GPU Co-processor Sorting Algorithm with Barrier Synchronization 2,167 views
GPUMCD: a new GPU-oriented Monte Carlo dose calculation platform 2,167 views
Framework for Parallel Kernels Auto-tuning 2,167 views
Automatic Tuning of Local Memory Use on GPGPUs 2,167 views
GPU-based Low-dose 4DCT Reconstruction via Temporal Non-local Means 2,166 views
IODA: an Input/Output Deep Architecture for image labeling 2,166 views
HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Reconfigurable Computing 2,166 views
High Performance GPU-based Fourier Volume Rendering 2,166 views
GAMER-2: a GPU-accelerated adaptive mesh refinement code — accuracy, performance, and scalability 2,165 views
Optimized MFCC Feature Extraction on GPU 2,165 views
A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves 2,164 views
High-Dimensional Adaptive Particle Swarm Optimization on Heterogeneous Systems 2,164 views
Local Histogram Modification Based Contrast Enhancement with GPU Acceleration 2,164 views
Loo.py: transformation-based code generation for GPUs and CPUs 2,163 views
accULL: An User-directed Approach to Heterogeneous Programming 2,163 views
Efficient Implementation of RLS-Based Adaptive Filters on nVIDIA GeForce Graphics Processing Unit 2,163 views
An Extension of the StarSs Programming Model for Platforms with Multiple GPUs 2,163 views
Architecture-Adaptive Code Variant Tuning 2,162 views
GPU architecture overview 2,162 views
A Comparison of FPGA and GPU for Real-Time Phase-based Optical Flow, Stereo, and Local Image Features 2,162 views
3D vision of electromagnetic fields in antenna and microwave technique 2,161 views
C-DAC’s Efforts – Application Kernels on HPC Cluster with GPU Accelerators 2,161 views
K-Means on GPU: A Review 2,161 views
Real-time Kd-tree Based Importance Sampling of Environment Maps 2,161 views
Heat Load Modelling for District Heating Plants Using an OpenCL-based Algorithm 2,160 views
Heterogeneous Parallelization and Acceleration of Molecular Dynamics Simulations in GROMACS 2,160 views
A Modular Framework for Deformation and Fracture using GPU Shaders 2,159 views
Image segmentation using CUDA implementations of the Runge-Kutta-Merson and GMRES methods 2,159 views
Solving the Boltzmann equation on GPUs 2,159 views
A Study of Successive Over-relaxation Method Parallelization Over Modern HPC Languages 2,159 views
Power Management Techniques for Data Centers: A Survey 2,158 views
A Memory Efficient and Fast Sparse Matrix Vector Product on a GPU 2,158 views
Efficient fMRI Analysis and Clustering on GPUs 2,158 views
An Efficient Parallel GPU Evaluation of Small Angle X-Ray Scattering Profiles 2,158 views
A Comparison of GPU Execution Time Prediction using Machine Learning and Analytical Modeling 2,158 views
A Novel CPU/GPU Simulation Environment for Large-Scale Biologically-Realistic Neural Modeling 2,158 views
Optimising Purely Functional GPU Programs 2,158 views
Analyzing Use of OpenCL on the Cell Broadband Engine and a Proposal for OpenCL Extensions 2,157 views
MapSQ: A MapReduce-based Framework for SPARQL Queries on GPU 2,157 views
Portable Mapping of Data Parallel Programs to OpenCL for Heterogeneous Systems 2,157 views
Overlapping computation and communication of three-dimensional FDTD on a GPU cluster 2,156 views
CPU and GPU Co-processing for Sound 2,156 views
Augur: a Modeling Language for Data-Parallel Probabilistic Inference 2,156 views
Research on Parallel DVH Statistic Based on CUDA 2,155 views
AI Benchmark: Running Deep Neural Networks on Android Smartphones 2,155 views
A Case Study of SWIM: Optimization of Memory Intensive Application on GPGPU 2,155 views
Accelerating distance matrix calculations utilizing GPU 2,155 views
Design Exploration of AES Accelerators on FPGAs and GPUs 2,155 views
A Scalable graph-cut algorithm for N-D grids 2,155 views
5.6: GPU enhancement of FDTD-PIC plasma-wave simulations 2,154 views
A Survey Of Techniques for Managing and Leveraging Caches in GPUs 2,154 views
Dynamic Buffer Overflow Detection for GPGPUs 2,154 views
Efficient Sparse Matrix-Vector Multiplication on CUDA 2,154 views
Towards Enhancing Performance, Programmability, and Portability in Heterogeneous Computing 2,154 views
SystemC simulation on GP-GPUs: CUDA vs. OpenCL 2,153 views
High-Level Energy Model of Embedded GPU for Real-Time Graphic Rendering 2,153 views
MAP-based Brain Tissue Segmentation using Manifold Learning and Hierarchical Max-Flow regularization 2,153 views
Optimization of real-time ultrasound PCIe data streaming and OpenCL processing for SAFT imaging 2,153 views
Measuring the Performance of Realtime DSP Using Pure Data and GPU 2,153 views
Kernelet: High-Throughput GPU Kernel Executions with Dynamic Slicing and Scheduling 2,153 views
Formalizing Address Spaces with application to Cuda, OpenCL, and beyond 2,152 views
CST: Constructive Solid Trimming for Rendering BReps and CSG 2,151 views
GPU & CPU implementation of Young – Van Vliet’s Recursive Gaussian Smoothing Filter 2,151 views
Titles: 100
Total views: 216214
- Programming - 186,130 views
- Login - 164,407 views
- User dashboard - 90,761 views
- Paper titles list - 70,158 views
- Add new event - 64,597 views
- Add new post - 59,378 views
- Register - 49,236 views
- Statistics - 36,633 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,167 views
- Books on OpenCL and CUDA - 28,824 views