2402

Views of posts on hgpu.org

Real time ultrasound image denoising  1,660 views

Cloudlet-screen computing: A multi-core-based, cloud-computing-oriented, traditional-computing-compatible parallel computing Paradigm for the masses  1,660 views

Fast generating of a digital hologram using general-purpose computation on graphics processing units  1,660 views

A general tridiagonal solver for coprocessors: Adapting g-Spike for the Intel Xeon Phi  1,660 views

Parallel cross-layer optimization of high-level synthesis and physical design  1,659 views

Distributed Training of Deep Neuronal Networks: Theoretical and Practical Limits of Parallel Scalability  1,659 views

Kernelized Renyi distance for speaker recognition  1,659 views

Linear Algebra Algorithms for Hybrid Architectures with XKaapi  1,659 views

Performance Considerations When Using a Dedicated Ray Traversal Engine  1,659 views

High Performance Data Mining Using R on Heterogeneous Platforms  1,658 views

Accelerating Database Query Processing on OpenCL-based FPGAs  1,658 views

Fine-grained parallelization of a Vlasov-Poisson application on GPU  1,658 views

Direct Volume Editing  1,658 views

Automatic Termination Analysis for GPU Kernels  1,658 views

Optimizations and Performance of a Robotics Grasping Algorithm Described in Geometric Algebra  1,658 views

Believe it or Not! Multi-core CPUs Can Match GPU Performance for FLOP-intensive Application!  1,658 views

Experiments with Single Core, Multi-core, and GPU Based Computation of Cellular Automata  1,657 views

Exploiting Task-Parallelism on GPU Clusters via OmpSs and rCUDA Virtualization  1,657 views

Efficiency analysis of a physical problem: Different parallel computational approaches for a dynamical integrator evolution  1,657 views

The Future in Mobile Multicore Computing  1,657 views

Automatic Synthesis of Heterogeneous CPU-GPU Embedded Applications from a UML Profile  1,657 views

People detection method using graphics processing units for a mobile robot with an omnidirectional camera  1,657 views

Programming Challenges for the Implementation of Numerical Quadrature in Atomic Physics on FPGA and GPU Accelerators  1,657 views

A Personal Surround Environment: Projective Display with Correction for Display Surface Geometry and Extreme Lens Distortion  1,657 views

Vector Quantization: A Many-Core Approach  1,657 views

A novel stereo camera based collision warning system for automotive applications  1,657 views

Fast and Efficient FPGA-Based Feature Detection Employing the SURF Algorithm  1,657 views

Combustion Simulations Using Graphic Processing Units  1,656 views

Adaptive Data Migration in Load-Imbalanced HPC Applications  1,656 views

Evolving a CUDA kernel from an nVidia template  1,656 views

DeepfakeUCL: Deepfake Detection via Unsupervised Contrastive Learning  1,656 views

Frequent itemset mining on graphics processors  1,656 views

Top ten ways to make formal methods for HPC practical  1,656 views

LDetector: A Low Overhead Race Detector For GPU Programs  1,655 views

Graphics Processor Clusters for High Speed Backpropagation  1,655 views

Accelerating image registration of MRI by GPU-based parallel computation  1,655 views

Energy-aware Task Scheduling with Deadline Constraint in DVFS-enabled Heterogeneous Clusters  1,655 views

GPU-based surface oriented interslice directional interpolation for volume visualization  1,655 views

Vortex methods for incompressible flow simulations on the GPU  1,655 views

Parallel Computation for Discrete Orthogonal Moments of Images Using Graphic Processing Unit  1,654 views

Continuous Level of Detail on Graphics Hardware  1,654 views

Enabling active storage on parallel I/O software stacks  1,654 views

Visibility Sampling on GPU and Applications  1,654 views

Efficient Intranode Communication in GPU-Accelerated Systems  1,654 views

An approach of tool paths generation for CNC machining based on CUDA  1,654 views

GPU-accelerated hierarchical dense correspondence for real-time aerial video processing  1,654 views

Edge Stream Oriented LDPC Decoding  1,654 views

A general relativistic evolution code on CUDA architectures  1,654 views

When HLS Meets FPGA HBM: Benchmarking and Bandwidth Optimization  1,654 views

Faster Upper Body Pose Estimation Using CUDA  1,654 views

FC_ACCEL: Enabling Efficient, Low-Latency and Flexible Inference in DNN Fully Connected Layers, using Optimized Checkerboard Block matrix decomposition, fast scheduling, and a resource efficient 1D PE array with a custom HBM2 memory subsystem  1,653 views

Mixed-Tool Performance Analysis on Hybrid Multicore Architectures  1,653 views

Numerical solution of PDEs with hybrid and heterogeneous computing models  1,653 views

An error correction solver for linear systems: Evaluation of mixed precision implementations  1,653 views

Improving performance portability for GPU-specific OpenCL kernels on multi-core/many-core CPUs by analysis-based transformations  1,653 views

Solving knapsack problems on GPU  1,653 views

GPU-Enabled AI  1,653 views

Co-processor acceleration of an unmodified parallel solid mechanics code with FEASTGPU  1,653 views

An Adaptative Multi-GPU based Branch-and-Bound. A Case Study: the Flow-Shop Scheduling Problem  1,653 views

Realtime Simulation of Burning Solids on GPU with CUDA  1,653 views

A framework for parallel unstructured grid applications on GPUs  1,652 views

A Restructuring Algorithm for CUDA  1,652 views

Software-Defined FPGA Accelerator Design for Mobile Deep Learning Applications  1,652 views

Hera-JVM: a runtime system for heterogeneous multi-core architectures  1,652 views

GPU Rigid Skinning based on a Refined Skeletonization Method  1,652 views

Scalable Software Defined FM-radio receiver running on desktop computers  1,652 views

Comparing the Treecode with FMM on GPUs for vortex particle simulations of a leapfrogging vortex ring  1,652 views

Acceleration of a Locally Tuned Sine Non Linear Video Enhancement Algorithm on GPGPU  1,652 views

The case for VOS: the vector operating system  1,651 views

Physical modeling and high-performance GPU computing for characterization, interception, and disruption of hazardous near-Earth objects  1,651 views

Record Setting Software Implementation of DES Using CUDA  1,651 views

Ray Tracing on Graphics Hardware  1,651 views

Parallel Streaming Intra Prediction for Full HD H.264 Encoding  1,651 views

Collaborative diffusion: programming antiobjects  1,651 views

Online Adaptive Code Generation and Tuning  1,651 views

Accelerating Multi-Sensor Image Fusion Using Graphics Hardware  1,650 views

High Performance Computing on GPU for Electromagnetic Logging  1,650 views

Electric potential and field calculation of charged BEM triangles and rectangles by Gaussian cubature  1,650 views

Implicit Boundary Control of Vector Field Based Shape Deformations  1,650 views

Cryptanalysis of the McEliece Cryptosystem on GPGPUs  1,650 views

Hierarchical Line Integration  1,650 views

GPU-based implementation of a cerebellar spiking network model for realtime robot control  1,650 views

GPU-accelerated real-time 3D tracking for humanoid locomotion and stair climbing  1,650 views

Real-Time Animating and Rendering of Large Scale Grass Scenery on GPU  1,649 views

Visualization and Analysis of GPU Summer School Applicants and Participants  1,649 views

Balancing locality and concurrency: solving sparse triangular systems on GPUs  1,649 views

Accelerating Particle Swarm Algorithm with GPGPU  1,649 views

Using GPU to Accelerate Cache Simulation  1,649 views

Cg in Two Pages  1,649 views

Serpent encryption algorithm implementation on Compute Unified Device Architecture (CUDA)  1,649 views

Fast Code Exploration for Pipeline Processing in FPGA Accelerators  1,649 views

Lessons learned in a decade of research software engineering GPU applications  1,649 views

CRAC: Checkpoint-Restart Architecture for CUDA with Streams and UVM  1,648 views

Extensions and Limitations of the Neural GPU  1,648 views

Jitter analysis of PLL-generated clock propagation using Jitter Mitigation techniques with laser voltage probing  1,648 views

Parallel multi-level analytical global placement on graphics processing units  1,648 views

Curracurrong: a stream processing system for distributed environments  1,648 views

Practical Symmetric Key Cryptography on Modern Graphics Hardware  1,648 views

Enabling Traceability in MDE to Improve Performance of GPU Applications  1,648 views

D5.5.2 – Architectural Techniques to exploit SLACK & ACCURACY trade-offs  1,648 views

 

Brief statistics for this page

Titles: 100

Total views: 165356

 

Most viewed items:

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org