1607

Posts

Nov, 16

High performance conjugate gradient solver on multi-GPU clusters using hypergraph partitioning

Motivated by high computation power and low price per performance ratio of GPUs, GPU accelerated clusters are being built for high performance scientific computing. In this work, we propose a scalable implementation of a Conjugate Gradient (CG) solver for unstructured matrices on a GPU-extended cluster, where each cluster node has multiple GPUs. Basic computations of […]
Nov, 16

GPU-Accelerated Robotic Intra-operative Laparoscopic 3D Reconstruction

In this paper we present a real-time intra-operative reconstruction system for laparoscopic surgery. The system builds upon a surgical robot for laparoscopy that has previously been developed by us. Such a system is valuable for surgeons, who can get a three dimensional visualization of the scene online, without having to postprocess data. We gain a […]
Nov, 13

A configurable simulation environment for the efficient simulation of large-scale spiking neural networks on graphics processors

Neural network simulators that take into account the spiking behavior of neurons are useful for studying brain mechanisms and for various neural engineering applications. Spiking Neural Network (SNN) simulators have been traditionally simulated on large-scale clusters, super-computers, or on dedicated hardware architectures. Alternatively, Compute Unified Device Architecture (CUDA) Graphics Processing Units (GPUs) can provide a […]
Nov, 13

Graphic processors to speed-up simulations for the design of high performance solar receptors

Graphics Processing Units (GPUs) are now powerful and flexible systems adapted and used for other purposes than graphics calculations (General Purpose computation on GPU — GPGPU). We present here a prototype to be integrated into simulation codes that estimate temperature, velocity and pressure to design next generations of solar receptors. Such codes will delegate to […]
Nov, 13

High Performance Direct Gravitational N-body Simulations on Graphics Processing Unit I: An implementation in Cg

We present the results of gravitational direct $N$-body simulations using the commercial graphics processing units (GPU) NVIDIA Quadro FX1400 and GeForce 8800GTX, and compare the results with GRAPE-6Af special purpose hardware. The force evaluation of the $N$-body problem was implemented in Cg using the GPU directly to speed-up the calculations. The integration of the equations […]
Nov, 13

A multiphysics and multiscale software environment for modeling astrophysical systems

We present MUSE, a software framework for combining existing computational tools for different astrophysical domains into a single multiphysics, multiscale application. MUSE facilitates the coupling of existing codes written in different languages by providing inter-language tools and by specifying an interface between each module and the framework that represents a balance between generality and computational […]
Nov, 13

Time dependent simulation of the Driven Lid Cavity at High Reynolds Number

In this work, numerical solutions of the two dimensional time dependent incompressible flow, in a driven cavity at high Reynolds number Re, are presented. At high Re, there is a controversy. Some studies predicted that the flow is steady, others found time dependent non-steady flow, either periodic or aperiodic. In this study, the driven lid […]
Nov, 13

Fast GPU Implementation of Sparse Signal Recovery from Random Projections

We consider the problem of sparse signal recovery from a small number of random projections (measurements). This is a well known NP-hard to solve combinatorial optimization problem. A frequently used approach is based on greedy iterative procedures, such as the Matching Pursuit (MP) algorithm. Here, we discuss a fast GPU implementation of the MP algorithm, […]
Nov, 13

Developing and Deploying Advanced Algorithms to Novel Supercomputing Hardware

The objective of our research is to demonstrate the practical usage and orders of magnitude speedup of real-world applications by using alternative technologies to support high performance computing. Currently, the main barrier to the widespread adoption of this technology is the lack of development tools and case studies that typically impede non-specialists that might otherwise […]
Nov, 13

Fast recursive filters for simulating nonlinear dynamic systems

A fast and accurate computational scheme for simulating nonlinear dynamic systems is presented. The scheme assumes that the system can be represented by a combination of components of only two different types: first-order low-pass filters and static nonlinearities. The parameters of these filters and nonlinearities may depend on system variables, and the topology of the […]
Nov, 13

Parallel Algorithm for Solving Kepler’s Equation on Graphics Processing Units: Application to Analysis of Doppler Exoplanet Searches

We present the results of a highly parallel Kepler equation solver using the Graphics Processing Unit (GPU) on a commercial nVidia GeForce 280GTX and the “Compute Unified Device Architecture” programming environment. We apply this to evaluate a goodness-of-fit statistic (e.g., chi^2) for Doppler observations of stars potentially harboring multiple planetary companions (assuming negligible planet-planet interactions). […]
Nov, 13

Parallel GPU Implementation of Iterative PCA Algorithms

Principal component analysis (PCA) is a key statistical technique for multivariate data analysis. For large data sets the common approach to PCA computation is based on the standard NIPALS-PCA algorithm, which unfortunately suffers from loss of orthogonality, and therefore its applicability is usually limited to the estimation of the first few components. Here we present […]

Recent source codes

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org