11969

Posts

Apr, 25

A new way in few-body scattering calculations: discretized Faddeev equations solved on GPU

A new approach towards very fast and economic few-body scattering calculations is described. The general method is realized on three steps: (i) reformulation of the scattering equations using the convenient analytical form for the channel resolvent operator; (ii) the complete few-body continuum discretization and projection of all operators and wave functions onto the $L_2$ type […]
Apr, 25

Integrating multi-threading and accelerators into DUNE-ISTL

A major challenge in PDE software is the balance between user-level flexibility and performance on heterogeneous hardware. We discuss our ideas on how this challenge can be tackled, exemplarily for the DUNE framework and in particular its linear algebra and solver components. We demonstrate how the former MPI-only implementation is modified to support MPI+[CPU/GPU] threading […]
Apr, 25

One weird trick for parallelizing convolutional neural networks

I present a new way to parallelize the training of convolutional neural networks across multiple GPUs. The method scales significantly better than all alternatives when applied to modern convolutional neural networks.
Apr, 25

Neural Decoding using a Parallel Sequential Monte Carlo method on Point Processes with Ensemble Effect

Sequential Monte Carlo estimation on point processes has been successfully applied to predict the movement from neural activity. However, there exist some issues along with this method such as the too simplified tuning model and the high computational complexity. In this paper, we attempt to address these issues and improve its decoding performance. Firstly, a […]
Apr, 24

Automating a Labour Performance Measurement and Risk Assessment: An Evaluation of Methods for a Computer Vision based System

This thesis brings together productivity and risk assessments through innovative design, development and evaluation of a unique system for retrieving and analysing data. In the past, although the link between them is well-documented, these assessments have largely been dealt with as separate antagonist entities. A broad evaluation of the existing traditional and technological support systems […]
Apr, 24

Parallel Computational Intelligence-Based Multi-Camera Surveillance System

In this work, we present a multi-camera surveillance system based on the use of self-organizing neural networks to represent events on video. The system processes several tasks in parallel using GPUs (graphic processor units). It addresses multiple vision tasks at various levels, such as segmentation, representation or characterization, analysis and monitoring of the movement. These […]
Apr, 24

Discrete Planning Unit Look-ahead Velocity Control Strategy and Parallelization Research based on GPU

High-velocity and high-accuracy are the development direction of the numerical control technology. During the machining of complicated curves and surfaces, by CAD/CAM software, the massive micro-segments are generated. Then the micro-segments are inputted into numerical control system (CNC) to process the velocity planning and high-velocity interpolation. This whole procedure is the core algorithm of CNC. […]
Apr, 24

A GPU Framework for Sparse Matrix Vector Multiplication

The hardware and software evolutions related to Graphics Processing Units (GPUs), for general purpose computations, have changed the way the parallel programming issues are addressed. Many applications are being ported onto GPU for achieving performance gain. The GPU execution time is continuously optimized by the GPU programmers while optimizing pre-GPU computation overheads attracted the research […]
Apr, 24

Galerkin-based multi-scale time integration for nonlinear structural dynamics

This paper deals with a Galerkin-based multi-scale time integration of a viscoelastic rope model. Using Hamilton’s dynamical formulation, Newton’s equation of motion as a second-order partial differential equation is transformed into two coupled first order partial differential equations in time. The considered finite viscoelastic deformations are described by means of a deformation-like internal variable determined […]
Apr, 22

Faster Maliciously Secure Two-Party Computation Using the GPU

We present a new protocol for maliciously secure two-partycomputation based on cut-and-choose of garbled circuits using the recent idea of "forge-and-loose" which eliminates around a factor 3 of garbled circuits that needs to be constructed and evaluated. Our protocol introduces a new way to realize the "forge-and-loose" approach which avoids an auxiliary secure two-party computation […]
Apr, 22

Reflector Antenna Analysis using Physical Optics on Graphics Processing Units

The Physical Optics approximation is a widely used asymptotic method for calculating the scattering from electrically large bodies. It requires significant computational work and little memory, and is thus well suited for application on a Graphics Processing Unit. Here, we investigate the performance of an implementation and demonstrate that while there are some implementational pitfalls, […]
Apr, 22

Parallel In-Memory Distance Threshold Queries on Trajectory Databases

Spatiotemporal databases are utilized in many applications to store the trajectories of moving objects. In this context, we focus on in-memory distance threshold queries that return all trajectories found within a distance d of a fixed or moving object over a time interval. We present performance results for a sequential query processing algorithm that uses […]

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us: