2402

Views of posts on hgpu.org

Adaptive GPU Array Layout Auto-Tuning  2,174 views

A Parallel Recursive Approach for Solving All Pairs Shortest Path Problem on GPU using OpenCL  2,173 views

Flexible, Fast and Accurate Sequence Alignment Profiling on GPGPU with PaSWAS  2,173 views

A Financial Benchmark for GPGPU Compilation  2,173 views

GPU volume rendering in 3D echocardiography: Real-time pre-processing and ray-casting  2,173 views

3D Registration Based on Normalized Mutual Information: Performance of CPU vs. GPU Implementation  2,173 views

Patch-Based Image Vectorization with Automatic Curvilinear Feature Alignment  2,172 views

Porting Large HPC Applications to GPU Clusters: The Codes GENE and VERTEX  2,172 views

Extending OmpSs for OpenCL kernel co-execution in heterogeneous systems  2,171 views

Performance comparison of GPU and FPGA architectures for the SVM training problem  2,171 views

liquidSVM: A Fast and Versatile SVM package  2,171 views

Efficient parallel implementation of the lattice Boltzmann method on large clusters of graphic processing units  2,170 views

A tutorial on the implementations of linear image filters in CPU and GPU  2,170 views

Relax-Miracle: GPU Parallelization of Semi-Analytic Fourier-Domain solvers for Earthquake Modeling  2,170 views

Drug Drug Interaction Extraction from Biomedical Literature Using Syntax Convolutional Neural Network  2,170 views

Tactics to Directly Map CNN graphs on Embedded FPGAs  2,170 views

Accelerating Fully Homomorphic Encryption on GPUs  2,170 views

A New Sparse Matrix Vector Multiplication GPU Algorithm Designed for Finite Element Problems  2,169 views

Parallel numerical simulation of two-phase flow model in porous media using distributed and shared memory architectures  2,169 views

FFT-SPA Non-Binary LDPC Decoding on GPU  2,169 views

Parallel computing with graphics processing units for high-speed Monte Carlo simulation of photon migration  2,169 views

High Performance and Scalable Radix Sorting: A case study of implementing dynamic parallelism for GPU computing  2,168 views

Artifact-Free Decompression and Zooming of JPEG Compressed Images with Total Generalized Variation  2,168 views

Accelerating Large Graph Algorithms on the GPU Using CUDA  2,168 views

GIST: an interactive, GPU-based level set segmentation tool for 3D medical images  2,168 views

GPU-Accelerated High-Accuracy Molecular Docking using Guided Differential Evolution  2,168 views

Orchestrating Multiple Data-Parallel Kernels on Multiple Devices  2,168 views

Portable Programming Models for Heterogeneous Platforms  2,168 views

Improved GPU Co-processor Sorting Algorithm with Barrier Synchronization  2,167 views

GPUMCD: a new GPU-oriented Monte Carlo dose calculation platform  2,167 views

Framework for Parallel Kernels Auto-tuning  2,167 views

Automatic Tuning of Local Memory Use on GPGPUs  2,167 views

GPU-based Low-dose 4DCT Reconstruction via Temporal Non-local Means  2,166 views

IODA: an Input/Output Deep Architecture for image labeling  2,166 views

HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Reconfigurable Computing  2,166 views

High Performance GPU-based Fourier Volume Rendering  2,166 views

Toward a Multi-level Parallel Framework on GPU Cluster with PetSC-CUDA for PDE-based Optical Flow Computation  2,165 views

GAMER-2: a GPU-accelerated adaptive mesh refinement code — accuracy, performance, and scalability  2,165 views

Optimized MFCC Feature Extraction on GPU  2,165 views

A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves  2,164 views

High-Dimensional Adaptive Particle Swarm Optimization on Heterogeneous Systems  2,164 views

A parallel implementation of a derivative pricing model incorporating SABR calibration and probability lookup tables  2,164 views

Local Histogram Modification Based Contrast Enhancement with GPU Acceleration  2,164 views

Loo.py: transformation-based code generation for GPUs and CPUs  2,163 views

accULL: An User-directed Approach to Heterogeneous Programming  2,163 views

Efficient Implementation of RLS-Based Adaptive Filters on nVIDIA GeForce Graphics Processing Unit  2,163 views

Simulating the Cardinal Movements of Childbirth Using Finite Element Analysis on the Graphics Processing Unit  2,163 views

The CUBLAS and CULA based GPU acceleration of adaptive finite element framework for bioluminescence tomography  2,163 views

An Extension of the StarSs Programming Model for Platforms with Multiple GPUs  2,163 views

Architecture-Adaptive Code Variant Tuning  2,162 views

GPU architecture overview  2,162 views

A Comparison of FPGA and GPU for Real-Time Phase-based Optical Flow, Stereo, and Local Image Features  2,162 views

3D vision of electromagnetic fields in antenna and microwave technique  2,161 views

C-DAC’s Efforts – Application Kernels on HPC Cluster with GPU Accelerators  2,161 views

K-Means on GPU: A Review  2,161 views

Real-time Kd-tree Based Importance Sampling of Environment Maps  2,161 views

Heat Load Modelling for District Heating Plants Using an OpenCL-based Algorithm  2,160 views

Parallel Implementation of the Finite Element Method on Graphics Processors for the Solution of Incompressible Flows  2,160 views

Heterogeneous Parallelization and Acceleration of Molecular Dynamics Simulations in GROMACS  2,160 views

A Modular Framework for Deformation and Fracture using GPU Shaders  2,159 views

Image segmentation using CUDA implementations of the Runge-Kutta-Merson and GMRES methods  2,159 views

Performance analysis of multi-core CPUs and GPU computing on SF-FDTD scheme for third order nonlinear materials and periodic media  2,159 views

Solving the Boltzmann equation on GPUs  2,159 views

A Study of Successive Over-relaxation Method Parallelization Over Modern HPC Languages  2,159 views

Power Management Techniques for Data Centers: A Survey  2,158 views

A Memory Efficient and Fast Sparse Matrix Vector Product on a GPU  2,158 views

Efficient fMRI Analysis and Clustering on GPUs  2,158 views

An Efficient Parallel GPU Evaluation of Small Angle X-Ray Scattering Profiles  2,158 views

A Comparison of GPU Execution Time Prediction using Machine Learning and Analytical Modeling  2,158 views

A Novel CPU/GPU Simulation Environment for Large-Scale Biologically-Realistic Neural Modeling  2,158 views

Optimising Purely Functional GPU Programs  2,158 views

Analyzing Use of OpenCL on the Cell Broadband Engine and a Proposal for OpenCL Extensions  2,157 views

MapSQ: A MapReduce-based Framework for SPARQL Queries on GPU  2,157 views

Portable Mapping of Data Parallel Programs to OpenCL for Heterogeneous Systems  2,157 views

Overlapping computation and communication of three-dimensional FDTD on a GPU cluster  2,156 views

On the Development and Implementation of High-Order Flux Reconstruction Schemes for Computational Fluid Dynamics  2,156 views

CPU and GPU Co-processing for Sound  2,156 views

Augur: a Modeling Language for Data-Parallel Probabilistic Inference  2,156 views

Research on Parallel DVH Statistic Based on CUDA  2,155 views

AI Benchmark: Running Deep Neural Networks on Android Smartphones  2,155 views

A Case Study of SWIM: Optimization of Memory Intensive Application on GPGPU  2,155 views

Accelerating distance matrix calculations utilizing GPU  2,155 views

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition  2,155 views

Design Exploration of AES Accelerators on FPGAs and GPUs  2,155 views

A Scalable graph-cut algorithm for N-D grids  2,155 views

5.6: GPU enhancement of FDTD-PIC plasma-wave simulations  2,154 views

A Survey Of Techniques for Managing and Leveraging Caches in GPUs  2,154 views

Dynamic Buffer Overflow Detection for GPGPUs  2,154 views

Efficient Sparse Matrix-Vector Multiplication on CUDA  2,154 views

Towards Enhancing Performance, Programmability, and Portability in Heterogeneous Computing  2,154 views

SystemC simulation on GP-GPUs: CUDA vs. OpenCL  2,153 views

High-Level Energy Model of Embedded GPU for Real-Time Graphic Rendering  2,153 views

MAP-based Brain Tissue Segmentation using Manifold Learning and Hierarchical Max-Flow regularization  2,153 views

Optimization of real-time ultrasound PCIe data streaming and OpenCL processing for SAFT imaging  2,153 views

Measuring the Performance of Realtime DSP Using Pure Data and GPU  2,153 views

Kernelet: High-Throughput GPU Kernel Executions with Dynamic Slicing and Scheduling  2,153 views

Formalizing Address Spaces with application to Cuda, OpenCL, and beyond  2,152 views

Performance analysis of SSE instructions in multi-core CPUs and GPU computing on FDTD scheme for solid and fluid vibration problems  2,152 views

CST: Constructive Solid Trimming for Rendering BReps and CSG  2,151 views

GPU & CPU implementation of Young – Van Vliet’s Recursive Gaussian Smoothing Filter  2,151 views

 

Brief statistics for this page

Titles: 100

Total views: 216214

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: