2402

Views of posts on hgpu.org

Fast Greeks: Case of Credit Valuation Adjustments  2,003 views

Thousand core chips: a technology perspective  2,003 views

Speckle Reduction with Trained Nonlinear Diffusion Filtering  2,002 views

GPU-based Implementation of 128-bit Secure Eta Pairing Over a Binary Field  2,002 views

Performance Engineering of the Kernel Polynomial Method on Large-Scale CPU-GPU Systems  2,002 views

Deep Learning For Smile Recognition  2,002 views

Analysis of illumination conditions at the lunar south pole using parallel computing techniques  2,002 views

Understanding GPU Programming for Statistical Computation: Studies in Massively Parallel Massive Mixtures  2,002 views

A Unified Optimizing Compiler Framework for Different GPGPU Architectures  2,002 views

Acceleration of Coarse Grain Molecular Dynamics on GPU Architectures  2,002 views

Exploring Design Space of 3D NVM and eDRAM Caches Using DESTINY Tool (open-source code)  2,002 views

dMath: A Scalable Linear Algebra and Math Library for Heterogeneous GP-GPU Architectures  2,001 views

A TBB-CUDA Implementation for Background Removal in a video-based Fire Detection System  2,001 views

Image-Space Caustics and Curvatures  2,001 views

Graphics processor unit (GPU) acceleration of finite-difference time-domain (FDTD) algorithm  2,000 views

Boosting GPU Virtualization Performance with Hybrid Shadow Page Tables  2,000 views

Arbitrary dimension Reed-Solomon coding and decoding for extended RAID on GPUs  2,000 views

Parallel Quadtree Coding of Large-Scale Raster Geospatial Data on GPGPUs  2,000 views

OpenMP to GPGPU: a compiler framework for automatic translation and optimization  2,000 views

Fast and Robust Pyramid-based Image Processing  2,000 views

Solving the Boltzmann Equation on GPU  2,000 views

Poster: GPU-accelerated artificial neural network for QSAR modeling  2,000 views

Optimization of Molecular Dynamics Simulation Code and Applications to Biomolecular Systems  1,999 views

Medusa: A Parallel Graph Processing System on Graphics Processors  1,999 views

Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 1. Generalized Born  1,999 views

A Coarse Grain Reconfigurable Architecture for sequence alignment problems in bio-informatics  1,999 views

Action Spotting and Recognition Based on a Spatiotemporal Orientation Analysis  1,999 views

GPU-Accelerated Face Detection Algorithm  1,999 views

Hybrid OpenCL: Enhancing OpenCL for Distributed Processing  1,999 views

Efficiently Mapping the AES Encryption Algorithm on GPUs  1,999 views

On the numerical solution of chaotic dynamical systems using extend precision floating point arithmetic and very high order numerical methods  1,998 views

Fine-Granular Parallel EBCOT and Optimization with CUDA for Digital Cinema Image Compression  1,998 views

Motion Compensation and Reconstruction of H.264/AVC Video Bitstreams using the GPU  1,998 views

Parallel k-Means Image Segmentation Using Sort, Scan & Connected Components on a GPU  1,998 views

GPU Programming – Speeding Up the 3D Surface Generator VESTA  1,998 views

A GPU-Based Parallel Algorithm for Design Structure Matrix (DSM) Partition  1,998 views

Performance Optimization Using Partitioned SpMV on GPUs and Multicore CPUs  1,998 views

A CUDA Implementation of Independent Component Analysis in the Time-Frequency Domain  1,997 views

Real Time Background Subtraction On GPU Using CUDA  1,997 views

Stellar-mass black holes in star clusters: implications for gravitational wave radiation  1,997 views

Accelerating Workloads on FPGAs via OpenCL: A Case Study with OpenDwarfs  1,997 views

Accelerating High-Order Stencils on GPUs  1,997 views

Analyzing the CUDA Applications with its Latency and Bandwidth Tolerance  1,997 views

Parallel Zigzag Scanning and Huffman Coding for a GPU-based MPEG-2 Encoder  1,997 views

GPU accelerated Trotter-Suzuki solver for quantum spin dynamics  1,997 views

An MPI-CUDA Implementation for the Compression of DEM  1,997 views

GPU Computing for Parallel Local Search Metaheuristics  1,997 views

Fast multipole methods on a cluster of GPUs for the meshless simulation of turbulence  1,997 views

Torchnet: An Open-Source Platform for (Deep) Learning Research  1,997 views

GPU-based simulation of the long-range Potts model via parallel tempering  1,996 views

On the Parallelization of Integer Polynomial Multiplication  1,996 views

Case Studies in Acceleration of Heston’s Stochastic Volatility Financial Engineering Model: GPU, Cloud and FPGA Implementations  1,996 views

Tracking 3d Pose of Rigid Object by Sparse Template Matching  1,996 views

General purpose molecular dynamics simulations fully implemented on graphics processing units  1,996 views

Multi-GPU Parallel Computing and Task Scheduling under Virtualization  1,996 views

GPU Multiple Sequence Alignment Fourier-Space Cross-Correlation Alignment  1,996 views

Kite: Braided Parallelism for Heterogeneous Systems  1,995 views

Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes  1,995 views

LLVM to PTX Backend  1,995 views

Implementation and Optimization of Image Processing Algorithms on Embedded GPU  1,995 views

Teaching Parallel Programming Models on a Shallow-Water Code  1,995 views

GPU computing with OpenCL to model 2D elastic wave propagation: exploring memory usage  1,994 views

Reduced Vlasov-Maxwell simulations  1,994 views

The Architecture and Evolution of CPU-GPU Systems for General Purpose Computing  1,994 views

Fine-Tuning Vectorization and Memory Traffic on Intel Xeon Phi Coprocessors: LU Decomposition of Small Matrices  1,994 views

A performance study of general-purpose applications on graphics processors using CUDA  1,994 views

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations  1,993 views

From OpenCL to Gates: the FFT  1,993 views

Cryptanalysis of the Full AES Using GPU-Like Special-Purpose Hardware  1,993 views

Performance of a GPU-based Direct Summation Algorithm for Computation of Small Angle Scattering Profile  1,993 views

Accelerating the D3Q19 Lattice Boltzmann Model with OpenACC and MPI  1,992 views

Automated image alignment for 2D gel electrophoresis in a high-throughput proteomics pipeline  1,992 views

Multi-GPGPU Cellular Automata Simulations using OpenACC  1,992 views

A Tensor Compiler for Unified Machine Learning Prediction Serving  1,992 views

Applying Object Oriented Design Patterns to CUDA based Pyramidal Image Blending – An Experience  1,992 views

Computing Spectral Transforms Used in Digital Logic on the GPU  1,992 views

Accelerating the Critical Line Algorithm for Portfolio Optimization Using GPUs  1,991 views

HashGraph – Scalable Hash Tables Using A Sparse Graph Data Structure  1,991 views

Comprehensive Analysis of High-Performance Computing Methods for Filtered Back-Projection  1,991 views

Handbook of open source tools  1,991 views

Long Timestep Molecular Dynamics on the Graphical Processing Unit  1,991 views

A New Architecture for Optimization Modeling Frameworks  1,991 views

Novel implementations of recursive discrete wavelet transform for real time computation with multicore systems on chip (SOC)  1,991 views

On-the-fly elimination of dynamic irregularities for GPU computing  1,991 views

Heterogeneous GPU and CPU acceleration of a finite volume compressible flow solver for multiblock structured grids  1,991 views

Parallel view-dependent refinement of progressive meshes  1,990 views

GPU Gems: Programming Techniques, Tips and Tricks for Real-Time Graphics  1,990 views

Performance and Scalability of GPU-Based Convolutional Neural Networks  1,990 views

Evolutionary Clustering on CUDA  1,990 views

NBODY6++GPU: Ready for the gravitational million-body problem  1,990 views

OpenCL-accelerated object classification in video streams using Spatial Pooler of Hierarchical Temporal Memory  1,990 views

Particle Filters on Multi-Core Processors  1,989 views

Hierarchical Partitioning Algorithm for Scientific Computing on Highly Heterogeneous CPU + GPU Clusters  1,989 views

Point Rendering in CUDA Path Tracer  1,989 views

CLort: High Throughput and Low Energy Network Intrusion Detection on IoT Devices with Embedded GPUs  1,989 views

Towards Good Practices for Very Deep Two-Stream ConvNets  1,989 views

Parallel technologies for solving system of the linear equations by the conjugate gradient method  1,989 views

PRAND: GPU accelerated parallel random number generation library: Using most reliable algorithms and applying parallelism of modern GPUs and CPUs  1,989 views

Vlasov on GPU (VOG Project)  1,988 views

Real-Time 3D Face Identification from a Depth Camera  1,988 views

 

Brief statistics for this page

Titles: 100

Total views: 199568

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: