8604

Posts

Nov, 19

CUDA-enabled Optimisation of Technical Analysis Parameters

The optimisation of Technical Trading parameters is a computationally intensive exercise. Models comprising a modest number of Technical Indicators require many thousands of simulations to be executed over a sample period of data, with the best performing sets of parameters employed to generate future trading signals. The purpose of this research is to investigate the […]
Nov, 19

Modern GPGPU Frameworks and their Application to the Physical Core of the ASUCA Weather Prediction Model

One of today’s biggest challenges in the field of high performance computing is the efficient exploitation of the heavily increasing parallelism on socket level, especially when both CPU and GPU resources are to be applied – a challenge becoming very real for the physical processes of ASUCA. ASUCA is the Japan Meteorological Agency’s next-generation weather […]
Nov, 19

Parallel Search of k-Nearest Neighbors with Synchronous Operations

We present a new study of parallel algorithms for locating k-nearest neighbors (kNN) of each single query in a high dimensional (feature) space on a many-core processor or accelerator that favors synchronous operations, such as on a graphics processing unit. Exploiting the intimate relationships between two primitive operations, select and sort, we introduce a cohort […]
Nov, 19

Criticality of the XY model in complex topologies

The critical behavior of the O(2) model on dilute Levy graphs built on a 2D square lattice is analyzed. Different qualitative cases are probed, varying the exponent rho governing the dependence on the distance of the connectivity probability distribution. The mean-field regime, as well as the long-range and short-range non-mean-field regimes are investigated by means […]
Nov, 19

Accelerated molecular dynamics force evaluation on graphics processing units for thermal conductivity calculations

In this paper, we develop a highly efficient molecular dynamics code fully implemented on graphics processing units for thermal conductivity calculations using the Green-Kubo formula. We compare two different schemes for force evaluation, a previously used thread-scheme where a single thread is used for one particle and each thread calculates the total force for the […]
Nov, 18

Auto-tunable GPU BLAS (thesis)

In this paper, we present our implementation of an Auto tuning system, written in C++, which incorporate the use of OpenCL kernels. We deploy this approach on different GPU architectures, evaluating the performance of the approach. Our main focus is to easily generate tuned code, that would otherwise require a large amount of empirical testing, […]
Nov, 18

Facial Recognition Using Neural Networks over GPGPU

This article introduces a parallel neural network approach implemented over Graphic Processing Units (GPU) to solve a facial recognition problem, which consists in deciding where the face of a person in a certain image is pointing. The proposed method uses the parallel capabilities of GPU in order to train and evaluate a neural network used […]
Nov, 18

Real-Time Hair Simulation and Rendering with OpenCL and OpenGL

In computer graphics, human hair simulation represents a challenging issue, and is still an active research subject nowadays. The problem comprises two complementary dimensions: the physical simulation and the rendering. While both aspects must be treated individually for each strand, they must also be treated globally due to interactions between hair strands. Because of the […]
Nov, 18

Using Graphics Processing Units to Parallelize the FDK Algorithm for Tomographic Image Reconstruction

The paper presents the implementation of a parallel version of FDK (Felkamp, David e Kress) algorithm using graphics processing units. Discussion was briefly some elements the computed tomographic scan and FDK algorithm; and some ideas about GPUs (Graphics Processing Units) and its use in general purpose computing were presented. The paper shows a computational implementation […]
Nov, 18

GPU-Based Airway Tree Segmentation and Centerline Extraction

Lung cancer is one of the deadliest and most common types of cancer in Norway. Early and precise diagnosis is crucial for improving the survivalrate. Diagnosis is often done by extracting a tissue sample in the lung throughthe mouth and throat. It is difficult to navigate to the tissue because of thecomplexity of the airways […]
Nov, 18

High locality and increased intra-node parallelism for solving finite element models on GPUs by novel element-by-element implementation

The utilization of Graphical Processing Units (GPUs) for the element-by-element (EbE) finite element method (FEM) is demonstrated. EbE FEM is a long known technique, by which a conjugate gradient (CG) type iterative solution scheme can be entirely decomposed into computations on the element level, i.e., without assembling the global system matrix. In our implementation, NVIDIA’s […]
Nov, 18

Lattice Boltzmann Simulations on a GPU: An optimization approach using C++ AMP

The lattice Boltzmann method has become a valuable tool in computational fluid dynamics, one of the reasons is due to the simplicity of its coding. In order to maximize the performance potential of today’s computers, code has to be optimized for parallel execution. In order to achieve parallel execution of the lattice Boltzmann method, the […]

Recent source codes

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us: