Posts
Apr, 26
A method for speeding up beam-tracing simulation using thread-level parallelization
In recent years, the computational power of modern processors has been increasing mainly because of the increase in the number of processor cores. Computationally intensive applications can gain from this trend only if they employ parallelism, such as thread-level parallelization. Geometric simulations can employ thread-level parallelization because the main part of a geometric simulation can […]
Apr, 26
Parallel Variable Distribution Algorithm for Constrained Optimization with Nonmonotone Technique
A modified parallel variable distribution (PVD) algorithm for solving large-scale constrained optimization problems is developed, which modifies quadratic subproblem QPl at each iteration instead of the QPl of the SQP-type PVD algorithm proposed by C. A. Sagastizabal and M. V. Solodov in 2002. The algorithm can circumvent the difficulties associated with the possible inconsistency of […]
Apr, 26
Design and Performance Analysis of Parallel Processing of SRTP Packets
Encryption of real-time multimedia data transfers is one of the tasks for telecommunication infrastructure which should be considered in order to reach essential level of security. Execution time of ciphering algorithm could play fundamental role in delay of the packets, therefore, it provides interesting challenge in terms of optimization methods. This work focuses on parallelization […]
Apr, 26
Accelerating Population Balance Model-based particulate process simulations via parallel computing
The use of Population Balance Models (PBM) for simulating dynamics of particulate systems are inevitably limited at some point by the demands they place on computational resources. PBMs are widely used to describe the time evolutions and distributions of many industrial particulate processes, and its efficient and quick simulation would certainly be beneficial for process […]
Apr, 26
Introduction into CUDA architecture of parallel computing webinar (in Russian)
The webinar will present CUDA – parallel computing architecture for graphics accelerators NVIDIA. It will focus on practical aspects of the use of technology. The webinar will be conducted by the staff of the company APC, supported by NVIDIA. The webinar will be held Thursday, May 16, 2013 at 11:00 am Moscow time. Duration – 1 […]
Apr, 25
Numerical Modeling of Atmospheric Vortices
Tornadoes such as Joplin, Mo (2011), Spencer, SD (1998), and Dallas, Texas (1957) induce extremely high wind velocities that devastate structures and lift off large objects in their path. Typically, a tornado takes up to an hour to materialize in the form of a narrow axisymmetric vortex and sustains that structure for 10-20 minutes after […]
Apr, 25
Modeling the spatio-temporal evolution of fracture networks and fluid-rock interactions in GPU: Applications to lithospheric geodynamics
In this thesis, I present the theory and modeling of poro-elasto-plastic rheology coupled to a non-linear diffusion equation with a step increase in permeability at the onset of slip. This theoretical model is implemented in the graphic processing unit (GPU) architecture and programmed using the nVidia CUDA programming language. The numerical models are benchmarked by […]
Apr, 25
High Performance Computing with GPUs
A project was undertaken to improve the performance of a traditional CPU-based sequential program by modifying it for parallel execution in a GPU environment. A speedup of at least 1.5x and the preservation of the program’s accuracy and integrity were outlined as the two key goals of the project. Deal.II, a differential applications analysis library, […]
Apr, 25
Directive-based Approach to Heterogeneous Computing
The main result of my Ph.D dissertation was accULL, an implementation of the OpenACC standard. This implementation is based on two pieces of software I designed, YaCF (Yet Another Compiler Framework) and Frangollo. YaCF is basically a Python StS toolkit, heavily based on the pycparser project. It uses the C99 Frontend with some extensions to […]
Apr, 25
Faster Upper Body Pose Estimation and Recognition Using CUDA
Image processing techniques can be very time consuming when applied linearly on the Central Processing Unit (CPU). Many applications require processing to take place in real-time. The Upper Body Pose Estimation and Recognition system developed by Achmed and Connan has shown to be 88% accurate, but operates at less than real-time on the CPU. This […]
Apr, 25
3rd International Conference on High Performance Computing, HPC-UA 2013
Prospective authors are invited to submit extended abstracts, full papers or poster presentations on topics related to: HPC systems developing, benchmarking and administrating. Programming of HPC systems, scalability of algorithms and programs, heterogeneous programming with GP-GPU, FPGA, and other accelerators. HPC in GRIDs, Clouds, and distributed computing systems. Application of HPC in science and industry. […]
Apr, 23
Reduce, Reuse, Recycle (R^3): a Design Methodology for Sparse Matrix Vector Multiplication on Reconfigurable Platforms
Sparse Matrix Vector Multiplication (SpMV) is an important computational kernel in many scientific computing applications. Pipelining multiply-accumulate operations shifts SpMV from a computationally bounded kernel to an I/O bounded kernel. In this paper, we propose a design methodology and hardware architecture for SpMV that seeks to utilize system memory bandwidth as efficiently as possible, by […]

