Views of posts on hgpu.org
Dynamical simulations of extrasolar planetary systems with debris disks using a GPU accelerated N-body code 2,196 views
Formal Semantics of Heterogeneous CUDA-C: A Modular Approach with Applications 2,196 views
Evolutionary Simulation of Life Using CUDA 2,196 views
Acceleration of Deep Learning on FPGA 2,196 views
Approximate dynamic programming with post-decision states as a solution method for dynamic economic models 2,195 views
Lattice Boltzmann Simulations of Multiphase Flows 2,195 views
Analysis & Design of Efficient Cryptographic Systems 2,195 views
GPU-MEME: Using Graphics Hardware to Accelerate Motif Finding in DNA Sequences 2,195 views
The GPU vs Phi Debate: Risk Analytics Using Many-Core Computing 2,194 views
Optimal Configuration of GPU Cache Memory to Maximize the Performance 2,194 views
Interactive GPU active contours for segmenting inhomogeneous objects 2,194 views
Intel nGraph: An Intermediate Representation, Compiler, and Executor for Deep Learning 2,194 views
Fast Estimation of Gaussian Mixture Model Parameters on GPU using CUDA 2,194 views
A GPU-based Large-scale Monte Carlo Simulation Method for Systems with Long-range Interactions 2,194 views
MAGMA Batched: A Batched BLAS Approach for Small Matrix Factorizations and Applications on GPUs 2,194 views
Parallel Viewshed Analysis on GPU Using CUDA 2,194 views
Efficient Convolutional Neural Networks for Pixelwise Classification on Heterogeneous Hardware Systems 2,194 views
Parallelization techniques of the x264 video encoder 2,193 views
A First Order Primal-Dual Algorithm for Nonconvex TV^q Regularization 2,193 views
A dataflow-like programming model for future hybrid clusters 2,192 views
BFROST: Binary Features from Robust Orientation Segment Tests accelerated on the GPU 2,192 views
Computational Modelling of Galaxy Formation using FLAME GPU 2,192 views
Brute force de-shredding algorithm using the GPU 2,192 views
Quadratic Pseudo-Boolean Optimization for Scene Analysis using CUDA 2,191 views
Efficient molecular dynamics simulations with many-body potentials on graphics processing units 2,191 views
Performance of PETSc GPU Implementation with Sparse Matrix Storage Schemes 2,191 views
Parallelizing General Histogram Application for CUDA Architectures 2,190 views
GeauxDock: Accelerating Structure-Based Virtual Screening with Heterogeneous Computing 2,190 views
A Straightforward Preprocessing Approach for Accelerating Convex Hull Computations on the GPU 2,190 views
A Survey On Parallelization Of Data Mining Techniques 2,190 views
Optimizing Xeon Phi for Interactive Data Analysis 2,189 views
An Efficient Parallel Algorithm for Graph Isomorphism on GPU using CUDA 2,189 views
Variants of Jump Flooding Algorithm for Computing Discrete Voronoi Diagrams 2,188 views
KD-tree acceleration structures for a GPU raytracer 2,188 views
Algorithm 9xx: Sparse QR Factorization on the GPU 2,188 views
Methods for Accelerating Machine Learning in High Performance Computing 2,188 views
Object Oriented Framework for CUDA based Pyramidal Image Blending 2,188 views
Portable GPU-Based Artificial Neural Networks for Accelerated Data-Driven Modeling 2,187 views
A Parallel PSO Algorithm for a Watermarking Application on a GPU 2,187 views
A GPU-accelerated Direct-sum Boundary Integral Poisson-Boltzmann Solver 2,187 views
Sort-First Parallel Volume Rendering 2,187 views
Parallel Genetic Algorithm Solving 0/1 Knapsack Problem Running on the GPU 2,187 views
StarPU: a Runtime System for Scheduling Tasks over Accelerator-Based Multicore Machines 2,186 views
Stable fluids 2,186 views
Efficient Implementation of MrBayes on multi-GPU 2,186 views
Parallel Multi-Dimensional LSTM, With Application to Fast Biomedical Volumetric Image Segmentation 2,186 views
Understanding the Costs of Many-Task Computing Workloads on Intel Xeon Phi Coprocessors 2,186 views
Dense photometric stereo reconstruction on many core GPUs 2,186 views
High Quality Elliptical Texture Filtering on GPU 2,186 views
SOCL: An OpenCL Implementation with Automatic Multi-Device Adaptation Support 2,185 views
Metamorphic Testing for (Graphics) Compilers 2,184 views
Application of the Mean Field Methods to MRF Optimization in Computer Vision 2,184 views
FPGA-Accelerated Image Processing Using High Level Synthesis with OpenCL 2,184 views
Somoclu: An Efficient Distributed Library for Self-Organizing Maps 2,184 views
Scaling High Performance Domain-Specific Language Implementation with Delite 2,183 views
Power and Performance Analysis of GPU-Accelerated Systems 2,182 views
Precision and Performance: Floating Point and IEEE 754 Compliance for NVIDIA GPUs 2,182 views
Solving 3D viscous incompressible Navier-Stokes equations using CUDA 2,182 views
Artificial neural network computation on graphic process unit 2,182 views
The discrete dipole approximation code DDscat.C++: features, limitations and plans 2,181 views
Graphics processing unit (GPU) programming strategies and trends in GPU computing 2,181 views
A Locality-Aware Memory Hierarchy for Energy-Efficient GPU Architectures 2,181 views
A Bi-objective Optimization Framework for Query Plans 2,181 views
A Hybrid-parallel Architecture for Applications in Bioinformatics 2,181 views
Research on DSP-GPU Heterogeneous Computing System 2,181 views
Framework for Batched and GPU-resident Factorization Algorithms Applied to Block Householder Transformations 2,181 views
A GPU implementation of EGSnrc’s Monte Carlo photon transport for imaging applications 2,181 views
Task-based FMM for heterogeneous architectures 2,181 views
Memory-Efficient Implementation of DenseNets 2,181 views
SPOC: GPGPU Programming Through Stream Processing With OCaml 2,180 views
Using P System with GPU Model to Design and Implement a Public Key Cryptography 2,180 views
Efficient implementation for MD5-RC4 encryption using GPU with CUDA 2,180 views
Scalable Multi-GPU Simulation of Long-Range Molecular Dynamics 2,180 views
Modelling, simulating and visualising the Cahn-Hilliard-Cook field equation 2,180 views
Fast Sparse Matrix Multiplication on GPU 2,180 views
Compiler Optimizations for SIMD/GPU/Multicore Architectures 2,180 views
OpenCL-ready High Speed FPGA Network for Reconfigurable High Performance Computing 2,179 views
Initial condition for efficient mapping of level set algorithms on many-core architectures 2,179 views
Improving the Performance of OpenCL-based FPGA Accelerator for Convolutional Neural Network 2,179 views
Efficient Target and Application Specific Selection and Ordering of Compiler Passes 2,179 views
Comparative Analysis of OpenACC, OpenMP and CUDA using Sequential and Parallel Algorithms 2,177 views
Bayesian State-Space Modelling on High-Performance Hardware Using LibBi 2,177 views
High Performance Computing via High Level Synthesis 2,176 views
Accelerating calculations of RNA secondary structure partition functions using GPUs 2,176 views
Performance modeling of atomic additions on GPU scratchpad memory 2,176 views
Optimizing Communication by Compression for Multi-GPU Scalable Breadth-First Searches 2,175 views
SAGA: SystemC Acceleration on GPU Architectures 2,175 views
VirtCL: a framework for OpenCL device abstraction and management 2,175 views
High-speed volume ray casting with CUDA 2,175 views
FPGA vs. multi-core CPUs vs. GPUs: hands-on experience with a sorting application 2,175 views
Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units 2,175 views
Experience with Intel’s Many Integrated Core architecture in ATLAS software 2,174 views
Adaptive GPU Array Layout Auto-Tuning 2,174 views
GPU accelerated pathfinding 2,174 views
Graphics Processing Unit (GPU) Implementation Methodology of AERMOD Model 2,174 views
Titles: 100
Total views: 218515
- Programming - 186,131 views
- Login - 164,409 views
- User dashboard - 90,767 views
- Paper titles list - 70,168 views
- Add new event - 64,599 views
- Add new post - 59,379 views
- Register - 49,237 views
- Statistics - 36,639 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,167 views
- Books on OpenCL and CUDA - 28,826 views