2402

Views of posts on hgpu.org

Characterization and Performance Analysis for 3D Benchmarks  2,218 views

OpenCL Accelerated Multi-GPU Cone-Beam Reconstruction  2,218 views

Implementing Parallel SMO to Train SVM on CUDA-Enabled Systems  2,218 views

Dense point trajectories by GPU-accelerated large displacement optical flow  2,218 views

Implementation of a Power Efficient Synthetic Aperture Radar Back Projection Algorithm on FPGAs Using OpenCL  2,217 views

Effects of Dynamic Voltage and Frequency Scaling on a K20 GPU  2,216 views

A Survey of General-Purpose Computation on Graphics Hardware  2,216 views

Using OpenCL for image analysis  2,216 views

CVPI: A Computer Vision Library For Mobile and Embedded Platforms  2,216 views

CUDA implementation of the solution of a system of linear equations arising in an hp-Finite Element code  2,216 views

linalg: Matrix Computations in Apache Spark  2,215 views

GPUstore: Harnessing GPU Computing for Storage Systems in the OS Kernel  2,215 views

SSLPV: subsurface light propagation volumes  2,215 views

Depth-First Search versus Jurema Search on GPU Branch-and-Bound Algorithms: a case study  2,214 views

Evaluating the Power of GPU Acceleration for IDW Interpolation Algorithm  2,214 views

DeepBE: Learning Deep Binary Encoding for Multi-Label Classification  2,214 views

Heterogeneous parallel algorithms for Computational Fluid Dynamics on unstructured meshes  2,214 views

Performance analysis of GPGPU and CPU On AES Encryption  2,213 views

Estimating GPU Speedups for Programs Without Writing a Single Line of GPU Code  2,213 views

Generalized Voronoi Diagram Computation on GPU  2,213 views

Parallelization of the Local Threshold and Boolean Function Based Edge Detection Algorithm Using CUDA  2,213 views

Direct GPU/FPGA Communication Via PCI Express  2,213 views

AeminiumGPU: An Intelligent Framework for GPU Programming  2,212 views

A capabilities-aware framework for using computational accelerators in data-intensive computing  2,212 views

Living Flows: Enhanced Exploration of Edge-Bundled Graphs Based on GPU-Intensive Edge Rendering  2,212 views

Design of an FPGA-Based FDTD Accelerator Using OpenCL  2,212 views

Parallel Solving Massive Linear Equations with CUDA  2,211 views

Automatic Performance Tuning of Pipeline Patterns for Heterogeneous Parallel Architectures  2,211 views

Direct N-body simulations of globular clusters: (I) Palomar 14  2,211 views

A Performance Comparison of Sort and Scan Libraries for GPUs  2,210 views

Regular Expression Matching on Graphics Hardware for Intrusion Detection  2,210 views

Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures  2,210 views

Accelerating simultaneous algebraic reconstruction technique with motion compensation using CUDA-enabled GPU  2,209 views

On Performance of GPU and DSP Architectures for Computationally Intensive Applications  2,209 views

Deep neural networks for direct, featureless learning through observation: the case of 2d spin models  2,209 views

K-nearest neighbor search: Fast GPU-based implementations and application to high-dimensional feature matching  2,209 views

Accelerating Cost Aggregation for Real-Time Stereo Matching  2,209 views

High performance conjugate gradient solver on multi-GPU clusters using hypergraph partitioning  2,208 views

JCudaMP: OpenMP/Java on CUDA  2,208 views

Parallel AES algorithm for fast Data Encryption on GPU  2,207 views

Using graphics devices in reverse: GPU-based Image Processing and Computer Vision  2,207 views

GLoP: Enabling Massively Parallel Incident Response Through GPU Log Processing  2,207 views

Molecular Simulations using CUDA  2,206 views

Towards High Speed Aerial Tracking of Agile Targets  2,206 views

Android Malware Classification Using Parallelized Machine Learning Methods  2,206 views

Improving the usability of hierarchical representations for interactively labeling large image data sets  2,206 views

A Parallel Cellular Automaton Simulation Framework using CUDA  2,205 views

Bundled depth-map merging for multi-view stereo  2,205 views

Fully 3D list-mode time-of-flight PET image reconstruction on GPUs using CUDA  2,204 views

Analysis and Implementation of eSTREAM and SHA-3 Cryptographic Algorithms  2,204 views

Developing Performance-Portable Molecular Dynamics Kernels in OpenCL  2,203 views

Compiling a high-level language for GPUs: (via language support for architectures and compilers)  2,203 views

Efficient implementation of data flow graphs on multi-gpu clusters  2,203 views

Dataflow-driven GPU performance projection for multi-kernel transformations  2,202 views

GEMM on a GPU  2,202 views

Video coding on multicore graphics processors (GPUs)  2,202 views

An Adaptive Step Size GPU ODE Solver for Simulating the Electric Cardiac Activity  2,202 views

CAPRI: Prediction of Compaction-Adequacy for Handling Control-Divergence in GPGPU Architectures  2,202 views

Advanced Video Coding on CPUs and GPUs: Parallelization and RD Analysis  2,202 views

Asynchronous OpenCL/MPI numerical simulations of conservation laws  2,202 views

Interactive Ray Tracing with Data Locality Optimizations  2,202 views

TensorFlow Doing HPC  2,200 views

JCUDA: A Programmer-Friendly Interface for Accelerating Java Programs with CUDA  2,200 views

Parallel Implementation of Dynamic Programming Algorithm Using Graphics Processing Unit  2,200 views

Performance Analysis of Sobel Edge Filter on Heterogeneous System Using OpenCL  2,199 views

A GPU-based Simulation for Stochastic Computing  2,199 views

ICNet for Real-Time Semantic Segmentation on High-Resolution Images  2,199 views

Parallel local search on GPU and CPU with OpenCL  2,199 views

Offload Annotations: Bringing Heterogeneous Computing to Existing Libraries and Workloads  2,198 views

Efficient Pattern-Based Time Series Classification on GPU  2,198 views

Massive Exploration of Neural Machine Translation Architectures  2,198 views

All-pairs Shortest Path Algorithm based on MPI+CUDA Distributed Parallel Programming Model  2,198 views

3D visualization of astronomy data cubes using immersive displays  2,197 views

Unfolding and Shrinking Neural Machine Translation Ensembles  2,197 views

Improving 3D Lattice Boltzmann Method stencil with asynchronous transfers on many-core processors  2,197 views

Tiled Shading  2,197 views

Implementing Efficient, Portable Computations for Machine Learning  2,196 views

MSA-CUDA: Multiple Sequence Alignment on Graphics Processing Units with CUDA  2,196 views

Using GPU to exploit parallelism on cryptography  2,196 views

A Stochastic-based Optimized Schwarz Method for the Gravimetry Equations on GPU Clusters  2,196 views

Heterogeneous CPU/(GP) GPU Memory Hierarchy Analysis and Optimization  2,195 views

Fast tridiagonal solvers on the GPU  2,195 views

clSpMV: A Cross-Platform OpenCL SpMV Framework on GPUs  2,194 views

Analysis of Parallel Sorting Algorithms on Heterogeneous Processors with OpenCL  2,194 views

Fast parallel surface and solid voxelization on GPUs  2,194 views

Speeding Up Model Building for ECGA on CUDA Platform  2,194 views

A GaBP-GPU Algorithm of Solving Large-Scale Sparse Linear Systems  2,194 views

A Collective Knowledge workflow for collaborative research into multi-objective autotuning and machine learning techniques  2,193 views

Automated Generation of OpenCL Programs Based on Algebra-Algorithmic Approach  2,193 views

Computational advances in gravitational microlensing: a comparison of CPU, GPU, and parallel, large data codes  2,193 views

A New Digital Repository for Hyperspectral Imagery with Unmixing-Based Retrieval Functionality Implemented on GPUs  2,193 views

Voxelized Minkowski sum computation on the GPU with robust culling  2,193 views

Parallel Computing Methods For Particle Accelerator Design  2,193 views

A Machine-Learning Framework for Design for Manufacturability  2,192 views

GPU Accelerated Discrete Element Method (DEM) Molecular Dynamics for Conservative, Faceted Particle Simulations  2,192 views

Real-Time Grasp Detection Using Convolutional Neural Networks  2,192 views

Formal Semantics of Heterogeneous CUDA-C: A Modular Approach with Applications  2,192 views

Parallelization of DIRA and CTmod using OpenMP and OpenCL  2,192 views

Dynamical simulations of extrasolar planetary systems with debris disks using a GPU accelerated N-body code  2,191 views

Efficient Energyminimization in Finite-Difference Micromagnetics: Speeding up Hysteresis Computations  2,191 views

 

Brief statistics for this page

Titles: 100

Total views: 220415

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: