2402

Views of posts on hgpu.org

GPIC – GPU Power Iteration Cluster  1,338 views

Accelerating Deep Neural Networks implementation: A survey  1,338 views

A small-world network model for distributed storage of semantic metadata  1,338 views

Many-threaded implementation of differential evolution for the CUDA platform  1,338 views

Programming Challenges for the Implementation of Numerical Quadrature in Atomic Physics on FPGA and GPU Accelerators  1,338 views

A Performance Study for Iterative Stencil Loops on GPUs with Ghost Zone Optimizations  1,338 views

An Optimized Multiple Right-Hand Side Dslash Kernel for Intel Xeon Phi  1,338 views

Automatic Termination Analysis for GPU Kernels  1,338 views

Anytime Algorithms for GPU Architectures  1,338 views

A survey of BRDF models for computer graphics  1,337 views

Graphics processing unit implementation of lattice Boltzmann models for flowing soft systems  1,337 views

Experiments with Single Core, Multi-core, and GPU Based Computation of Cellular Automata  1,337 views

GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition  1,337 views

Studies on CUDA Offloading for Real-Time Simulation and Visualization  1,337 views

Analysis of periodic structures with GPU accelerating  1,337 views

Performance Analysis of the OP2 Framework on Many-core Architectures  1,337 views

Improving the Performance of CA-GMRES on Multicores with Multiple GPUs  1,337 views

Block based Singular Value Decomposition approach to matrix factorization for recommender systems  1,337 views

Optimizing GPU to GPU Communication on Cray XK7  1,337 views

Accelerating linear system solutions using randomization techniques  1,337 views

Quantifying NUMA and contention effects in multi-GPU systems  1,336 views

Performance in GPU Architectures: Potentials and Distances  1,336 views

Physically-Based Interactive Flow Visualization Based on Schlieren and Interferometry Experimental Techniques  1,336 views

GPU-based real-time acoustical occlusion modeling  1,336 views

Increasing predictability of GPU’s  1,336 views

Online Adaptive Code Generation and Tuning  1,336 views

An Efficient Stream Buffer Mechanism for Dataflow Execution on Heterogeneous Platforms with GPUs  1,336 views

Acceleration of a Locally Tuned Sine Non Linear Video Enhancement Algorithm on GPGPU  1,336 views

Hardware Accelerators for Cartesian Genetic Programming  1,335 views

A GPU-tailored approach for training kernelized SVMs  1,335 views

Auto-Generation of Parallel Finite-Differencing Code for MPI, TBB and CUDA  1,335 views

Decoupled Access/Execute Metaprogramming for GPU-Accelerated Systems  1,335 views

SLATE port to AMD and Intel platforms  1,335 views

Co-processor acceleration of an unmodified parallel solid mechanics code with FEASTGPU  1,335 views

FusionAccel: A General Re-configurable Deep Learning Inference Accelerator on FPGA for Convolutional Neural Networks  1,335 views

Real-time Medical Image Volume Rendering Based on GPU Accelerated Method  1,335 views

Efficient Intranode Communication in GPU-Accelerated Systems  1,335 views

Automatically Tuned Dense Linear Algebra for Multicore+GPU  1,335 views

Toward efficient GPU-accelerated N-body simulations  1,335 views

Deforming a High-Resolution Mesh in Real-Time by Mapping onto a Low-Resolution Physical Model  1,334 views

Line-art Illustration of Dynamic and Specular Surfaces  1,334 views

Feature Generation for Quantification of Visual Similarity  1,334 views

GPGPU supported cooperative acceleration in molecular dynamics  1,334 views

Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System  1,334 views

Hera-JVM: a runtime system for heterogeneous multi-core architectures  1,334 views

Towards systematic exploration of tradeoffs for medical image registration on heterogeneous platforms  1,334 views

GPU accelerated FDTD solver and its application in MRI  1,334 views

Task and Data Distribution in Hybrid Parallel Systems  1,334 views

Two-Level Approach to Efficient Visualization of Protein Dynamics  1,333 views

CANSCID-CUDA  1,333 views

Load Balancing Utilizing Data Redundancy in Distributed Volume Rendering  1,333 views

Dependable Embedded Systems  1,333 views

Program Optimization of Array-Intensive SPEC2k Benchmarks on Multithreaded GPU Using CUDA and Brook+  1,333 views

Development of Krylov and AMG linear solvers for large-scale sparse matrices on GPUs  1,333 views

Petascale visualization: Approaches and initial results  1,333 views

GPUQT: An efficient linear-scaling quantum transport code fully implemented on graphics processing units  1,333 views

Directives Based Programming of GPU Accelerated Systems  1,333 views

GPU Rigid Skinning based on a Refined Skeletonization Method  1,333 views

Realtime phase-based optical flow on the GPU  1,333 views

High throughput multiple-precision GCD on the CUDA architecture  1,333 views

An Explicit Algorithm for Porous Media Flow Simulation using GPUs  1,332 views

High Performance Data Mining Using R on Heterogeneous Platforms  1,332 views

Real time ultrasound image denoising  1,332 views

Accelerating Stochastic Simulations on GPUs Using OpenCL  1,332 views

Achieving High Throughput Sequencing with Graphics Processing Units  1,332 views

Accelerating data mining workloads: current approaches and future challenges in system architecture design  1,332 views

Linear Algebra Algorithms for Hybrid Architectures with XKaapi  1,332 views

A framework for lab-based real-time video analysis on distributed camera networks  1,332 views

Parallel Streaming Intra Prediction for Full HD H.264 Encoding  1,332 views

An events based algorithm for distributing concurrent tasks on multi-core architectures  1,332 views

Building a Real-Time Multi-GPU Platform: Robust Real-Time Interrupt Handling Despite Closed-Source Drivers  1,332 views

GEVO: GPU Code Optimization using Evolutionary Computation  1,332 views

Stadium Hashing: Scalable and Flexible Hashing on GPUs  1,332 views

GPU-based multi-view rendering for spatial-multiplex autostereoscopic displays  1,332 views

Multi-core programming with OpenCL: performance and portability: OpenCL in a memory bound scenario  1,332 views

Power performance analysis of 3-D finite element mesh refinement with tetrahedra by CUDA/MPI on multi-core and GPU architecture  1,332 views

Accelerating Beam Dynamics Simulations with GPUs  1,331 views

Sub-seasonal forecasting with a large ensemble of deep-learning weather prediction models  1,331 views

Parallel Cycle Based Logic Simulation Using Graphics Processing Units  1,331 views

Software-Defined FPGA Accelerator Design for Mobile Deep Learning Applications  1,331 views

Fine-grained parallelization of a Vlasov-Poisson application on GPU  1,331 views

A Self-Optimizing Framework for Developing Metrology Software on Massive Parallel Processor Architectures  1,331 views

Acceleration of a Full-scale Industrial CFD Application with OP2  1,331 views

Using of New Possibilities of Fermi Architecture by Development of GPGPU Programs  1,331 views

Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms  1,331 views

Balancing locality and concurrency: solving sparse triangular systems on GPUs  1,331 views

Anisotropic noise  1,331 views

GPU-based implementation of a cerebellar spiking network model for realtime robot control  1,331 views

Automatic Scheduling of Compute Kernels Across Heterogeneous Architectures  1,331 views

Optimized GPU Framework for Ultrasound Color Flow Imaging  1,330 views

Experimentation Procedure for Offloaded Mini-Apps Executed on Cluster Architectures with Xeon Phi Accelerators  1,330 views

Leveraging Binary Translation for Heterogeneous Profiling  1,330 views

Designing Fast Architecture Sensitive Tree Search on Modern Multi-Core/Many-Core Processors  1,330 views

Cloudlet-screen computing: A multi-core-based, cloud-computing-oriented, traditional-computing-compatible parallel computing Paradigm for the masses  1,330 views

GPU-Enabled AI  1,330 views

Rendering of 3D Dynamic Virtual Environments  1,330 views

FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow  1,330 views

Importance of Data Loading Pipeline in Training Deep Neural Networks  1,329 views

Performance analysis and optimization of a CFD application  1,329 views

Efficient nearest-neighbor computation for GPU-based motion planning  1,329 views

 

Brief statistics for this page

Titles: 100

Total views: 133366

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: