2402

Views of posts on hgpu.org

SOMGPU: An unsupervised pattern classifier on Graphical Processing Unit  1,988 views

A Survey of FPGA Based Deep Learning Accelerators: Challenges and Opportunities  1,988 views

Accelerating Lagrangian Particle Dispersion in the Atmosphere with OpenCL across Multiple Platforms  1,988 views

FPGA Accelerated Simulation of Biologically Plausible Spiking Neural Networks  1,988 views

Implementation and Evaluation of Recurrence Equation Solvers on GPGPU systems using Rearrangement of Array Configurations  1,988 views

EFFEX: an embedded processor for computer vision based feature extraction  1,988 views

GPU Accelerated Face Detection  1,988 views

A Parallelizing Matlab Compiler Framework and Run time for Heterogeneous Systems  1,988 views

Towards a Portable and Future-proof Particle-in-Cell Plasma Physics Code  1,987 views

Graphics Processing Unit Utilization in Circuit Simulation  1,987 views

Automatic Parallelization of Tiled Loop Nests with Enhanced Fine-Grained Parallelism on GPUs  1,987 views

A Fast Batched Cholesky Factorization on a GPU  1,987 views

Yang-Mills lattice on CUDA  1,987 views

A GPU implementation for improved granular simulations with LAMMPS  1,987 views

A New Parallel Method of Smith-Waterman Algorithm on a Heterogeneous Platform  1,987 views

Solving quadratic assignment problems by genetic algorithms with GPU computation: a case study  1,987 views

Analyzing CUDA’s Compiler through the Visualization of Decoded GPU Binaries  1,987 views

Is GPGPU CCL worth it? A performance comparison between some GPU and CPU algorithms for solving connected components labeling on binary images  1,987 views

On GPU Fourier Transformations  1,987 views

G-NetMon: A GPU-accelerated Network Performance Monitoring System  1,986 views

GPU-based ultra-fast direct aperture optimization for online adaptive radiation therapy  1,986 views

The future of microprocessors  1,986 views

Study on GPU-accelerated extraction of interconnects parasitic using CUDA and MPI  1,986 views

OpenCL JIT Compilation for Dynamic Programming Languages  1,986 views

Distributed learning of CNNs on heterogeneous CPU/GPU architectures  1,986 views

High performance bioinformatics and computational biology on general-purpose graphics processing units  1,985 views

Displacement Mapping on the GPU – State of the Art  1,985 views

Improving Numerical Accuracy for Non-Negative Matrix Multiplication on GPUs using Recursive Algorithms  1,985 views

AXC: A new format to perform the SpMV oriented to Intel Xeon Phi architecture in OpenCL  1,985 views

Overlapping Computation and Communication for Advection on Hybrid Parallel Computers  1,985 views

Fast LZW compression using a GPU  1,984 views

CUDASW++ 3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions  1,984 views

Validation of the PyGBe code for Poisson-Boltzmann equation with boundary element methods  1,984 views

Implementation and Evaluation of Scientific Simulations on High Performance Computing Architectures  1,984 views

Parallel Neutrino Triggers using GPUs for an underwater telescope  1,984 views

High-Performance Neural Networks for Visual Object Classification  1,984 views

A Way For Accelerating The DNA Sequence Reconstruction Problem By CUDA  1,984 views

Dual-RBF based surface reconstruction  1,984 views

Many-Core Algorithms for Combinatorial Optimization  1,984 views

Streaming Data from HDD to GPUs for Sustained Peak Performance  1,984 views

GPU Join Processing Revisited  1,983 views

Implementing QR Factorization Updating Algorithms on GPUs  1,983 views

General Purpose Computing on Low-Power Embedded GPUs: Has It Come of Age?  1,983 views

Coupling between Meshless FEM Modeling and Rendering on GPU for Real-time Physically-based Volumetric Deformation  1,983 views

A History-Based Performance Prediction Model with Profile Data Classification for Automatic Task Allocation in Heterogeneous Computing Systems  1,983 views

An Automatic Input-Sensitive Approach for Heterogeneous Task Partitioning  1,983 views

SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing  1,983 views

OMP2HMPP: HMPP Source Code Generation from Programs with Pragma Extensions  1,983 views

GPU histogram computation  1,982 views

Accelerated rescaling of single Monte Carlo simulation runs with the Graphics Processing Unit (GPU)  1,982 views

Performance models for CUDA streams on NVIDIA GeForce series  1,982 views

GPU-accelerated Bernstein-Bezier discontinuous Galerkin methods for wave problems  1,982 views

Liszt: A Domain Specific Language for Building Portable Mesh-based PDE Solvers  1,982 views

Iterative CT Reconstruction on the GPU  1,982 views

Effective Extensible Programming: Unleashing Julia on GPUs  1,982 views

An implementation for quad-tree based solid object coloring using CUDA  1,982 views

SparkJNI: A Reference Design for a Heterogeneous Apache Spark Framework  1,982 views

Accelerator: using data parallelism to program GPUs for general-purpose uses  1,982 views

Multilevel Tile Load Map on Massive Terrain Visualization  1,981 views

GPU-Disasm: A GPU-based x86 Disassembler  1,981 views

PTask: Operating System Abstractions To Manage GPUs as Compute Devices  1,981 views

Parallel Spectral Graph Partitioning on CUDA  1,980 views

The Potential of the Intel Xeon Phi for Supervised Deep Learning  1,980 views

Diffusion Curves: A Vector Representation for Smooth-Shaded Images  1,979 views

The GASPI API specification and its implementation GPI 2.0  1,979 views

Accelerating Boosting-based Face Detection on GPUs  1,979 views

A Fast Mixed-Band Lifting Wavelet Transform on the GPU  1,979 views

Strain Visualization of Ultra Sound Signals Processed by General Purpose Graphic Process Unit  1,979 views

Efficient GPU implementation of the integral histogram  1,979 views

An efficient implementation of Smith Waterman algorithm on GPU using CUDA, for massively parallel scanning of sequence databases  1,979 views

Survey and Benchmarking of Machine Learning Accelerators  1,978 views

Lightweight Modular Staging and Embedded Compilers: Abstraction Without Regret for High-Level High-Performance Programming  1,978 views

Realtime Two-Way Coupling of Meshless Fluids and Nonlinear FEM  1,978 views

Fast 3D Structure Localization in Medical Volumes using CUDA-enabled GPUs  1,978 views

Fast Speaker Diarization Using a Specialization Framework for Gaussian Mixture Model Training  1,978 views

A Performance Analysis Framework for Optimizing OpenCL Applications on FPGAs  1,978 views

GPU Simulation and Rendering of Volumetric Effects for Computer Games and Virtual Environments  1,977 views

Optimization Techniques for Mapping Algorithms and Applications onto CUDA GPU Platforms and CPU-GPU Heterogeneous Platforms  1,977 views

A Qualitative Comparison Study Between Common GPGPU Frameworks  1,977 views

A GPU Algorithm for 3D Convex Hull  1,977 views

Implementation of 3D FFTs Across Multiple GPUs in Shared Memory Environments  1,977 views

Vectorized Higher Order Finite Difference Kernels  1,977 views

Achieving high-performance with a sparse direct solver on Intel KNL  1,977 views

Accelerating cellular automata simulations using AVX and CUDA  1,977 views

GPU Accelerated Parallel Occupancy Voxel Based ICP for Position Tracking  1,977 views

Decoding with Finite-State Transducers on GPUs  1,977 views

A GPU-based Algorithm-specific Optimization for High-performance Background Subtraction  1,977 views

Visual Simulation of Heat Shimmering and Mirage  1,976 views

PATUS: A Code Generation and Autotuning Framework For Parallel Iterative Stencil Computations on Modern Microarchitectures  1,976 views

GPU Computing for Particle Tracking  1,976 views

GRATER: An Approximation Workflow for Exploiting Data-Level Parallelism in FPGA Acceleration  1,976 views

Parallelization of Hierarchical Text Clustering on Multi-core CUDA Architecture  1,976 views

Database Operation Development on the GPU  1,976 views

Implementation of Smith-Waterman algorithm in OpenCL for GPUs  1,976 views

Comparison based sorting for systems with multiple GPUs  1,976 views

Comparison of Different Parallel Implementaions of the 2+1-Dimensional KPZ Model and the 3-Dimensional KMC Model  1,975 views

Texture Cache Approximation on GPUs  1,975 views

Simulation of reaction-diffusion processes in three dimensions using CUDA  1,975 views

Learning to Detect Roads in High-Resolution Aerial Images  1,975 views

Exploring Traditional and Emerging Parallel Programming Models using a Proxy Application  1,975 views

 

Brief statistics for this page

Titles: 100

Total views: 198180

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: