2402

Views of posts on hgpu.org

Implementing AES on GPU: Final Report  1,917 views

ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers  1,917 views

Optimizing Memory Efficiency for Convolution Kernels on Kepler GPUs  1,916 views

Arioc: high-throughput read alignment with GPU-accelerated exploration of the seed-and-extend search space  1,916 views

Activity recognition from videos with parallel hypergraph matching on GPUs  1,916 views

Algorithms acceleration of pattern-matching in multi-core architectures  1,916 views

A novel parallel Tier-1 coder for JPEG2000 using GPUs  1,916 views

A framework for efficient execution on GPU and CPU+GPU systems  1,916 views

Implementing an architecture for efficient network traffic processing on modern graphics hardware  1,916 views

Parallel programming on GPU using Intel Array Building Blocks  1,915 views

Unified Particle Physics for Real-Time Applications  1,915 views

Implementing a Preconditioned Iterative Linear Solver Using Massively Parallel Graphics Processing Units  1,915 views

Learning Sparse Recurrent Neural Networks in Language Modeling  1,915 views

Multi-agent traffic simulation with CUDA  1,915 views

Interactive Collision Detection for Deformable Models Using Streaming AABBs  1,915 views

MATLAB Parallelization through Scalarization  1,914 views

Interactive BRDF Estimation for Mixed-Reality Applications  1,914 views

MapCG: writing parallel program portable between CPU and GPU  1,914 views

How to Correctly Deal With Pseudorandom Numbers in Manycore Environments – Application to GPU programming with Shoverand  1,914 views

A Lattice Boltzmann Method Simulator for Microfluidics on GPU Cluster  1,913 views

A Strategy for Automatic Performance Tuning of Stencil Computations on GPUs  1,913 views

Sequential Consistency for Heterogeneous-Race-Free: Programmer-centric Memory Models for Heterogeneous Platforms  1,913 views

Automatic NUMA Characterization using Cbench  1,913 views

GPU-based Space Situational Awareness Simulation utilising parallelism for enhanced multi-sensor management  1,913 views

The International Exascale Software Project roadmap  1,913 views

A High Performance Framework for Coupled Urban Microclimate Models  1,913 views

Self-calibration of geometric and radiometric parameters for cone-beam computed tomography  1,913 views

Axel: a heterogeneous cluster with FPGAs and GPUs  1,913 views

Massively Parallel Lossless Compression of Medical Images Using Least-Squares Prediction and Arithmetic Coding  1,913 views

CUDA Implementation of a Lattice Boltzmann Method and Code Optimization  1,913 views

Acceleration of finite-difference time-domain (FDTD) using graphics processor units (GPU)  1,913 views

A New Software Based GPU Framework  1,913 views

Nonlinear dynamic finite element analysis with GPU  1,912 views

Computational wave optics library for C++: CWO++ library  1,912 views

Efficient Shallow Water Simulations on GPUs  1,912 views

Accelerating Iterative SpMV for Discrete Logarithm Problem using GPUs  1,912 views

Enabling New Uses for GPUs  1,912 views

Investigating Half Precision Arithmetic to Accelerate Dense Linear System Solvers  1,911 views

Accelerating Protein Sequence Search in a Heterogeneous Computing System  1,911 views

Performance Portability Study of Linear Algebra Kernels in OpenCL  1,911 views

An Optimized Parallel IDCT on Graphics Processing Units  1,911 views

Processing OLTP Workloads on Hybrid CPU/GPU Systems  1,911 views

Heterogenous Acceleration for Linear Algebra in Multi-Coprocessor Environments  1,911 views

Dynamic Instrumentation and Optimization for GPU Applications  1,911 views

Caffeine: Towards Uniformed Representation and Acceleration for Deep Convolutional Neural Networks  1,911 views

Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison  1,911 views

GPU Accelerated framework for financial nested simulations  1,910 views

An Improved Image Segmentation Algorithm Based on GPU Parallel Computing  1,910 views

Multi-fragment effects on the GPU using the k-buffer  1,910 views

Multicore architecture and cache optimization techniques for solving graph problems  1,910 views

ParadisEO-MO-GPU: a Framework for Parallel GPU-based Local Search Metaheuristics  1,910 views

GPU Computing for Meshfree Particle Method  1,910 views

Global Point Mascon Models for Simple, Accurate and Parallel Geopotential Computation  1,909 views

Extending Scala with General Purpose GPU Programming  1,909 views

Design Space Exploration of OpenCL Applications on Heterogeneous Parallel Platforms  1,909 views

Flexible N-Way MIMO Detector on GPU  1,909 views

A Novel GPU Implementation of Eigen Analysis for Risk Management  1,909 views

The application of GPU particle tracing to diffusion tensor field visualization  1,909 views

Towards Dense Linear Algebra for Hybrid GPU Accelerated Manycore Systems  1,909 views

A new parallel video understanding and retrieval system  1,909 views

Vector and Line Quantization for Billion-scale Similarity Search on GPUs  1,909 views

Differential Evolution with parallelised objective functions using CUDA  1,909 views

A Practical Quicksort Algorithm for Graphics Processors  1,908 views

ElastiFace: Matching and Blending Textured Faces  1,908 views

A Unified Runtime System for Heterogeneous Multi-core Architectures  1,908 views

Efficient implementation of multiuser precoding algorithms on GPU for MIMO-OFDM systems  1,908 views

Specification and Verification of GPGPU Programs using Permission-Based Separation Logic  1,908 views

Methods and Metrics for Fair Server Assessment under Real-Time Financial Workloads  1,908 views

Extending OmpSs to support CUDA and OpenCL in C, C++ and Fortran Applications  1,907 views

Sorting On A Graphics Processing Unit (GPU)  1,907 views

CAVE-CL: An OpenCL version of the package for detection and quantitative analysis of internal cavities in a system of overlapping balls: application to proteins  1,907 views

PFunc: modern task parallelism for modern high performance computing  1,907 views

Parallel, distributed and GPU computing technologies in single-particle electron microscopy  1,907 views

Platform 2012, a Many-Core Computing Accelerator for Embedded SoCs: Performance Evaluation of Visual Analytics Applications  1,907 views

Accelerating wavelet-based video coding on graphics hardware using CUDA  1,907 views

Hybrid Monte Carlo with Wilson Dirac operator on the Fermi GPU  1,907 views

Frameworks for multi-core architectures: a comprehensive evaluation using 2D/3D image registration  1,907 views

Accelerated Combinatorial Optimization using Graphics Processing Units and C++ AMP  1,907 views

SWPS3 – fast multi-threaded vectorized Smith-Waterman for IBM Cell/B.E. and x86/SSE2  1,907 views

SkePU: a multi-backend skeleton programming library for multi-GPU systems  1,906 views

Neurokernel: An Open Scalable Software Framework for Emulation and Validation of Drosophila Brain Models on Multiple GPUs  1,906 views

A new ray-tracing scheme for 3D diffuse radiation transfer on highly parallel architectures  1,906 views

Stealing Webpages Rendered on Your Browser by Exploiting GPU Vulnerabilities  1,906 views

CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms  1,905 views

Accelerating Mean Shift Segmentation Algorithm on Hybrid CPU/GPU Platforms  1,905 views

A Scalable Multi-Path Microarchitecture for Efficient GPU Control Flow  1,905 views

Vortex Methods for Fluid Simulation in Computer Graphics  1,905 views

Adapting data processing methods to modern GPU architecture  1,905 views

Processing Hard Sphere Collisions on a GPU Using OpenCL  1,905 views

An Improved CUDA-Based Implementation of Differential Evolution on GPU  1,905 views

Fine-Grained Parallel Incomplete LU Factorization  1,905 views

Solving prime-field ECDLPs on GPUs with OpenCL  1,905 views

High locality and increased intra-node parallelism for solving finite element models on GPUs by novel element-by-element implementation  1,905 views

DjiNN and Tonic: DNN as a Service and Its Implications for Future Warehouse Scale Computers  1,905 views

Hardware/Software Co-design for Energy-Efficient Seismic Modeling  1,905 views

Optimized Parallel Implementation of Gillespie’s First Reaction Method on Graphics Processing Units  1,905 views

Speedup of Micromagnetic Simulations with C++ AMP On Graphics Processing Units  1,904 views

Facial Expression Recognition – Review  1,904 views

An Optimization for Fast Generation of Digital Hologram  1,904 views

Bacon: A GPU Programming System With Just in Time Specialization  1,904 views

 

Brief statistics for this page

Titles: 100

Total views: 191000

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: