Views of posts on hgpu.org
Implementing AES on GPU: Final Report 1,917 views
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers 1,917 views
Optimizing Memory Efficiency for Convolution Kernels on Kepler GPUs 1,916 views
Arioc: high-throughput read alignment with GPU-accelerated exploration of the seed-and-extend search space 1,916 views
Activity recognition from videos with parallel hypergraph matching on GPUs 1,916 views
Algorithms acceleration of pattern-matching in multi-core architectures 1,916 views
A novel parallel Tier-1 coder for JPEG2000 using GPUs 1,916 views
A framework for efficient execution on GPU and CPU+GPU systems 1,916 views
Implementing an architecture for efficient network traffic processing on modern graphics hardware 1,916 views
Parallel programming on GPU using Intel Array Building Blocks 1,915 views
Unified Particle Physics for Real-Time Applications 1,915 views
Implementing a Preconditioned Iterative Linear Solver Using Massively Parallel Graphics Processing Units 1,915 views
Learning Sparse Recurrent Neural Networks in Language Modeling 1,915 views
Multi-agent traffic simulation with CUDA 1,915 views
Interactive Collision Detection for Deformable Models Using Streaming AABBs 1,915 views
MATLAB Parallelization through Scalarization 1,914 views
Interactive BRDF Estimation for Mixed-Reality Applications 1,914 views
MapCG: writing parallel program portable between CPU and GPU 1,914 views
A Lattice Boltzmann Method Simulator for Microfluidics on GPU Cluster 1,913 views
A Strategy for Automatic Performance Tuning of Stencil Computations on GPUs 1,913 views
Automatic NUMA Characterization using Cbench 1,913 views
GPU-based Space Situational Awareness Simulation utilising parallelism for enhanced multi-sensor management 1,913 views
The International Exascale Software Project roadmap 1,913 views
A High Performance Framework for Coupled Urban Microclimate Models 1,913 views
Self-calibration of geometric and radiometric parameters for cone-beam computed tomography 1,913 views
Axel: a heterogeneous cluster with FPGAs and GPUs 1,913 views
CUDA Implementation of a Lattice Boltzmann Method and Code Optimization 1,913 views
Acceleration of finite-difference time-domain (FDTD) using graphics processor units (GPU) 1,913 views
A New Software Based GPU Framework 1,913 views
Nonlinear dynamic finite element analysis with GPU 1,912 views
Computational wave optics library for C++: CWO++ library 1,912 views
Efficient Shallow Water Simulations on GPUs 1,912 views
Accelerating Iterative SpMV for Discrete Logarithm Problem using GPUs 1,912 views
Enabling New Uses for GPUs 1,912 views
Investigating Half Precision Arithmetic to Accelerate Dense Linear System Solvers 1,911 views
Accelerating Protein Sequence Search in a Heterogeneous Computing System 1,911 views
Performance Portability Study of Linear Algebra Kernels in OpenCL 1,911 views
An Optimized Parallel IDCT on Graphics Processing Units 1,911 views
Processing OLTP Workloads on Hybrid CPU/GPU Systems 1,911 views
Heterogenous Acceleration for Linear Algebra in Multi-Coprocessor Environments 1,911 views
Dynamic Instrumentation and Optimization for GPU Applications 1,911 views
Caffeine: Towards Uniformed Representation and Acceleration for Deep Convolutional Neural Networks 1,911 views
Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison 1,911 views
GPU Accelerated framework for financial nested simulations 1,910 views
An Improved Image Segmentation Algorithm Based on GPU Parallel Computing 1,910 views
Multi-fragment effects on the GPU using the k-buffer 1,910 views
Multicore architecture and cache optimization techniques for solving graph problems 1,910 views
ParadisEO-MO-GPU: a Framework for Parallel GPU-based Local Search Metaheuristics 1,910 views
GPU Computing for Meshfree Particle Method 1,910 views
Global Point Mascon Models for Simple, Accurate and Parallel Geopotential Computation 1,909 views
Extending Scala with General Purpose GPU Programming 1,909 views
Design Space Exploration of OpenCL Applications on Heterogeneous Parallel Platforms 1,909 views
Flexible N-Way MIMO Detector on GPU 1,909 views
A Novel GPU Implementation of Eigen Analysis for Risk Management 1,909 views
The application of GPU particle tracing to diffusion tensor field visualization 1,909 views
Towards Dense Linear Algebra for Hybrid GPU Accelerated Manycore Systems 1,909 views
A new parallel video understanding and retrieval system 1,909 views
Vector and Line Quantization for Billion-scale Similarity Search on GPUs 1,909 views
Differential Evolution with parallelised objective functions using CUDA 1,909 views
A Practical Quicksort Algorithm for Graphics Processors 1,908 views
ElastiFace: Matching and Blending Textured Faces 1,908 views
A Unified Runtime System for Heterogeneous Multi-core Architectures 1,908 views
Efficient implementation of multiuser precoding algorithms on GPU for MIMO-OFDM systems 1,908 views
Specification and Verification of GPGPU Programs using Permission-Based Separation Logic 1,908 views
Methods and Metrics for Fair Server Assessment under Real-Time Financial Workloads 1,908 views
Extending OmpSs to support CUDA and OpenCL in C, C++ and Fortran Applications 1,907 views
Sorting On A Graphics Processing Unit (GPU) 1,907 views
PFunc: modern task parallelism for modern high performance computing 1,907 views
Parallel, distributed and GPU computing technologies in single-particle electron microscopy 1,907 views
Accelerating wavelet-based video coding on graphics hardware using CUDA 1,907 views
Hybrid Monte Carlo with Wilson Dirac operator on the Fermi GPU 1,907 views
Frameworks for multi-core architectures: a comprehensive evaluation using 2D/3D image registration 1,907 views
Accelerated Combinatorial Optimization using Graphics Processing Units and C++ AMP 1,907 views
SWPS3 – fast multi-threaded vectorized Smith-Waterman for IBM Cell/B.E. and x86/SSE2 1,907 views
SkePU: a multi-backend skeleton programming library for multi-GPU systems 1,906 views
A new ray-tracing scheme for 3D diffuse radiation transfer on highly parallel architectures 1,906 views
Stealing Webpages Rendered on Your Browser by Exploiting GPU Vulnerabilities 1,906 views
CaffePresso: An Optimized Library for Deep Learning on Embedded Accelerator-based platforms 1,905 views
Accelerating Mean Shift Segmentation Algorithm on Hybrid CPU/GPU Platforms 1,905 views
A Scalable Multi-Path Microarchitecture for Efficient GPU Control Flow 1,905 views
Vortex Methods for Fluid Simulation in Computer Graphics 1,905 views
Adapting data processing methods to modern GPU architecture 1,905 views
Processing Hard Sphere Collisions on a GPU Using OpenCL 1,905 views
An Improved CUDA-Based Implementation of Differential Evolution on GPU 1,905 views
Fine-Grained Parallel Incomplete LU Factorization 1,905 views
Solving prime-field ECDLPs on GPUs with OpenCL 1,905 views
DjiNN and Tonic: DNN as a Service and Its Implications for Future Warehouse Scale Computers 1,905 views
Hardware/Software Co-design for Energy-Efficient Seismic Modeling 1,905 views
Optimized Parallel Implementation of Gillespie’s First Reaction Method on Graphics Processing Units 1,905 views
Speedup of Micromagnetic Simulations with C++ AMP On Graphics Processing Units 1,904 views
Facial Expression Recognition – Review 1,904 views
An Optimization for Fast Generation of Digital Hologram 1,904 views
Bacon: A GPU Programming System With Just in Time Specialization 1,904 views
Titles: 100
Total views: 191000
- Programming - 186,133 views
- Login - 164,567 views
- User dashboard - 91,310 views
- Paper titles list - 71,318 views
- Add new event - 64,811 views
- Add new post - 59,607 views
- Register - 49,319 views
- Statistics - 37,171 views
- Modification of self-organizing migration algorithm for OpenCL framework - 34,190 views
- Books on OpenCL and CUDA - 28,900 views