9375

Posts

Apr, 27

Efficient Computation of the Kleene Star in Max-Plus Algebra using a CUDA GPU

This research aims to accelerate the computation of the Kleene star in max-plus algebra using CUDA technology on graphics processing units (GPUs). The target module is the Kleene star of a weighted adjacency matrix for directed acyclic graph (DAGs) which plays an essential role in calculating the earliest and/or latest schedule for a class of […]
Apr, 27

Modeling and Optimization of Parallel Matrix-based Computations on GPU

As graphics processing units (GPUs) are continually being utilized as coprocessors, the demand for optimally utilizing them for various applications continues to grow. This work narrows the gap between programmers and minimum execution time for matrix-based computations on a GPU. To minimize execution time, computation and communication time must be considered. For computation, the placement […]
Apr, 27

Orchestrated Scheduling and Prefetching for GPGPUs

In this paper, we present techniques that coordinate the thread scheduling and prefetching decisions in a General Purpose Graphics Processing Unit (GPGPU) architecture to better tolerate long memory latencies. We demonstrate that existing warp scheduling policies in GPGPU architectures are unable to effectively incorporate data prefetching. The main reason is that they schedule consecutive warps, […]
Apr, 27

H. 264 Parallel Optimization on Graphics Processors

Multimedia applications are present in most mobile hand-held devices. The H.264 standard is currently dominating the video compression world. H.264 has high computational complexity requiring large amount of processing resources. Many techniques emerged that optimize H.264 using parallelization on multicore systems ranging from groups of pictures until the smallest block of pixels. We propose a […]
Apr, 26

Using High Performance Computing for Optimizing Credit Risk Calculation

The volume of banks data calculation is increasing each year with extraordinary scale and with that, new forms of computation is needed. High performance computing is a very attractive field for optimization such bank calculous, which can give promising results. This paper shows a implementation of know model for assessing the credit risk of a […]
Apr, 26

A method for speeding up beam-tracing simulation using thread-level parallelization

In recent years, the computational power of modern processors has been increasing mainly because of the increase in the number of processor cores. Computationally intensive applications can gain from this trend only if they employ parallelism, such as thread-level parallelization. Geometric simulations can employ thread-level parallelization because the main part of a geometric simulation can […]
Apr, 26

Parallel Variable Distribution Algorithm for Constrained Optimization with Nonmonotone Technique

A modified parallel variable distribution (PVD) algorithm for solving large-scale constrained optimization problems is developed, which modifies quadratic subproblem QPl at each iteration instead of the QPl of the SQP-type PVD algorithm proposed by C. A. Sagastizabal and M. V. Solodov in 2002. The algorithm can circumvent the difficulties associated with the possible inconsistency of […]
Apr, 26

Design and Performance Analysis of Parallel Processing of SRTP Packets

Encryption of real-time multimedia data transfers is one of the tasks for telecommunication infrastructure which should be considered in order to reach essential level of security. Execution time of ciphering algorithm could play fundamental role in delay of the packets, therefore, it provides interesting challenge in terms of optimization methods. This work focuses on parallelization […]
Apr, 26

Accelerating Population Balance Model-based particulate process simulations via parallel computing

The use of Population Balance Models (PBM) for simulating dynamics of particulate systems are inevitably limited at some point by the demands they place on computational resources. PBMs are widely used to describe the time evolutions and distributions of many industrial particulate processes, and its efficient and quick simulation would certainly be beneficial for process […]
Apr, 26

Introduction into CUDA architecture of parallel computing webinar (in Russian)

The webinar will present CUDA – parallel computing architecture for graphics accelerators NVIDIA. It will focus on practical aspects of the use of technology. The webinar will be conducted by the staff of the company APC, supported by NVIDIA. The webinar will be held Thursday, May 16, 2013 at 11:00 am Moscow time. Duration – 1 […]
Apr, 25

Numerical Modeling of Atmospheric Vortices

Tornadoes such as Joplin, Mo (2011), Spencer, SD (1998), and Dallas, Texas (1957) induce extremely high wind velocities that devastate structures and lift off large objects in their path. Typically, a tornado takes up to an hour to materialize in the form of a narrow axisymmetric vortex and sustains that structure for 10-20 minutes after […]
Apr, 25

Modeling the spatio-temporal evolution of fracture networks and fluid-rock interactions in GPU: Applications to lithospheric geodynamics

In this thesis, I present the theory and modeling of poro-elasto-plastic rheology coupled to a non-linear diffusion equation with a step increase in permeability at the onset of slip. This theoretical model is implemented in the graphic processing unit (GPU) architecture and programmed using the nVidia CUDA programming language. The numerical models are benchmarked by […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: