2402

Views of posts on hgpu.org

Dynamical simulations of extrasolar planetary systems with debris disks using a GPU accelerated N-body code  2,196 views

Formal Semantics of Heterogeneous CUDA-C: A Modular Approach with Applications  2,196 views

Evolutionary Simulation of Life Using CUDA  2,196 views

Acceleration of Deep Learning on FPGA  2,196 views

Approximate dynamic programming with post-decision states as a solution method for dynamic economic models  2,195 views

Lattice Boltzmann Simulations of Multiphase Flows  2,195 views

Analysis & Design of Efficient Cryptographic Systems  2,195 views

GPU-MEME: Using Graphics Hardware to Accelerate Motif Finding in DNA Sequences  2,195 views

The GPU vs Phi Debate: Risk Analytics Using Many-Core Computing  2,194 views

Optimal Configuration of GPU Cache Memory to Maximize the Performance  2,194 views

Interactive GPU active contours for segmenting inhomogeneous objects  2,194 views

Intel nGraph: An Intermediate Representation, Compiler, and Executor for Deep Learning  2,194 views

Fast Estimation of Gaussian Mixture Model Parameters on GPU using CUDA  2,194 views

A GPU-based Large-scale Monte Carlo Simulation Method for Systems with Long-range Interactions  2,194 views

MAGMA Batched: A Batched BLAS Approach for Small Matrix Factorizations and Applications on GPUs  2,194 views

Parallel Viewshed Analysis on GPU Using CUDA  2,194 views

Efficient Convolutional Neural Networks for Pixelwise Classification on Heterogeneous Hardware Systems  2,194 views

Towards accelerating Smoothed Particle Hydrodynamics simulations for free-surface flows on multi-GPU clusters  2,193 views

Parallelization techniques of the x264 video encoder  2,193 views

A First Order Primal-Dual Algorithm for Nonconvex TV^q Regularization  2,193 views

A dataflow-like programming model for future hybrid clusters  2,192 views

BFROST: Binary Features from Robust Orientation Segment Tests accelerated on the GPU  2,192 views

Computational Modelling of Galaxy Formation using FLAME GPU  2,192 views

Brute force de-shredding algorithm using the GPU  2,192 views

Quadratic Pseudo-Boolean Optimization for Scene Analysis using CUDA  2,191 views

Efficient molecular dynamics simulations with many-body potentials on graphics processing units  2,191 views

Performance of PETSc GPU Implementation with Sparse Matrix Storage Schemes  2,191 views

Parallelizing General Histogram Application for CUDA Architectures  2,190 views

GeauxDock: Accelerating Structure-Based Virtual Screening with Heterogeneous Computing  2,190 views

A Straightforward Preprocessing Approach for Accelerating Convex Hull Computations on the GPU  2,190 views

A Survey On Parallelization Of Data Mining Techniques  2,190 views

Optimizing Xeon Phi for Interactive Data Analysis  2,189 views

An Efficient Parallel Algorithm for Graph Isomorphism on GPU using CUDA  2,189 views

Variants of Jump Flooding Algorithm for Computing Discrete Voronoi Diagrams  2,188 views

KD-tree acceleration structures for a GPU raytracer  2,188 views

Algorithm 9xx: Sparse QR Factorization on the GPU  2,188 views

Methods for Accelerating Machine Learning in High Performance Computing  2,188 views

Object Oriented Framework for CUDA based Pyramidal Image Blending  2,188 views

Portable GPU-Based Artificial Neural Networks for Accelerated Data-Driven Modeling  2,187 views

A Parallel PSO Algorithm for a Watermarking Application on a GPU  2,187 views

A GPU-accelerated Direct-sum Boundary Integral Poisson-Boltzmann Solver  2,187 views

Sort-First Parallel Volume Rendering  2,187 views

Parallel Genetic Algorithm Solving 0/1 Knapsack Problem Running on the GPU  2,187 views

StarPU: a Runtime System for Scheduling Tasks over Accelerator-Based Multicore Machines  2,186 views

Technical Report about Tiramisu: a Three-Layered Abstraction for Hiding Hardware Complexity from DSL Compilers  2,186 views

Stable fluids  2,186 views

Efficient Implementation of MrBayes on multi-GPU  2,186 views

Parallel Multi-Dimensional LSTM, With Application to Fast Biomedical Volumetric Image Segmentation  2,186 views

Understanding the Costs of Many-Task Computing Workloads on Intel Xeon Phi Coprocessors  2,186 views

Dense photometric stereo reconstruction on many core GPUs  2,186 views

High Quality Elliptical Texture Filtering on GPU  2,186 views

SOCL: An OpenCL Implementation with Automatic Multi-Device Adaptation Support  2,185 views

Metamorphic Testing for (Graphics) Compilers  2,184 views

Application of the Mean Field Methods to MRF Optimization in Computer Vision  2,184 views

FPGA-Accelerated Image Processing Using High Level Synthesis with OpenCL  2,184 views

Somoclu: An Efficient Distributed Library for Self-Organizing Maps  2,184 views

Scaling High Performance Domain-Specific Language Implementation with Delite  2,183 views

Power and Performance Analysis of GPU-Accelerated Systems  2,182 views

Precision and Performance: Floating Point and IEEE 754 Compliance for NVIDIA GPUs  2,182 views

Solving 3D viscous incompressible Navier-Stokes equations using CUDA  2,182 views

Artificial neural network computation on graphic process unit  2,182 views

The discrete dipole approximation code DDscat.C++: features, limitations and plans  2,181 views

Graphics processing unit (GPU) programming strategies and trends in GPU computing  2,181 views

A Locality-Aware Memory Hierarchy for Energy-Efficient GPU Architectures  2,181 views

A Bi-objective Optimization Framework for Query Plans  2,181 views

A Hybrid-parallel Architecture for Applications in Bioinformatics  2,181 views

Research on DSP-GPU Heterogeneous Computing System  2,181 views

Framework for Batched and GPU-resident Factorization Algorithms Applied to Block Householder Transformations  2,181 views

Hetero-DB: Next Generation High-Performance Database Systems by Best Utilizing Heterogeneous Computing and Storage Resources  2,181 views

A GPU implementation of EGSnrc’s Monte Carlo photon transport for imaging applications  2,181 views

Task-based FMM for heterogeneous architectures  2,181 views

Memory-Efficient Implementation of DenseNets  2,181 views

SPOC: GPGPU Programming Through Stream Processing With OCaml  2,180 views

Using P System with GPU Model to Design and Implement a Public Key Cryptography  2,180 views

Efficient implementation for MD5-RC4 encryption using GPU with CUDA  2,180 views

Scalable Multi-GPU Simulation of Long-Range Molecular Dynamics  2,180 views

Modelling, simulating and visualising the Cahn-Hilliard-Cook field equation  2,180 views

Fast Sparse Matrix Multiplication on GPU  2,180 views

Compiler Optimizations for SIMD/GPU/Multicore Architectures  2,180 views

OpenCL-ready High Speed FPGA Network for Reconfigurable High Performance Computing  2,179 views

Initial condition for efficient mapping of level set algorithms on many-core architectures  2,179 views

Improving the Performance of OpenCL-based FPGA Accelerator for Convolutional Neural Network  2,179 views

An efficient MPI/OpenMP parallelization of the Hartree-Fock method for the second generation of Intel Xeon Phi processor  2,179 views

Efficient Target and Application Specific Selection and Ordering of Compiler Passes  2,179 views

Parallel implementation of the wideband DOA algorithm on single core, multicore, GPU and IBM cell BE processor  2,178 views

Comparative Analysis of OpenACC, OpenMP and CUDA using Sequential and Parallel Algorithms  2,177 views

Bayesian State-Space Modelling on High-Performance Hardware Using LibBi  2,177 views

High Performance Computing via High Level Synthesis  2,176 views

Accelerating calculations of RNA secondary structure partition functions using GPUs  2,176 views

Performance modeling of atomic additions on GPU scratchpad memory  2,176 views

Optimizing Communication by Compression for Multi-GPU Scalable Breadth-First Searches  2,175 views

SAGA: SystemC Acceleration on GPU Architectures  2,175 views

VirtCL: a framework for OpenCL device abstraction and management  2,175 views

High-speed volume ray casting with CUDA  2,175 views

FPGA vs. multi-core CPUs vs. GPUs: hands-on experience with a sorting application  2,175 views

Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units  2,175 views

Experience with Intel’s Many Integrated Core architecture in ATLAS software  2,174 views

Adaptive GPU Array Layout Auto-Tuning  2,174 views

GPU accelerated pathfinding  2,174 views

Graphics Processing Unit (GPU) Implementation Methodology of AERMOD Model  2,174 views

 

Brief statistics for this page

Titles: 100

Total views: 218515

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: