Posts
Feb, 28
A high performance computing for AOM stock trading order matching using GPU
The task of trading orders matching in financial markets is a very challenging task since due to the speed of arriving request. In this paper, the GPUs technology and CUDA programming is explored as a potential technology to accelerate this task. The trading method in Automatic Order Matching (AOM) of Stock Exchange of Thailand (SET) […]
Feb, 28
Performance Assessment of A Multi-block Incompressible Navier-Stokes Solver using Directive-based GPU Programming in a Cluster Environment
OpenACC, a directive-based GPU programing standard, is emerging as a promising technology for massively-parallel accelerators, such as General-purpose computing on graphics processing units (GPGPU), Accelerated Processing Unit (APU) and Many Integrated Core Architecture (MIC). The heterogeneous nature of these accelerators call for careful designs of parallel algorithms and data management, which imposes a great hurdle […]
Feb, 28
Heterogenous Acceleration for Linear Algebra in Multi-Coprocessor Environments
We present an efficient and scalable programming model for the development of linear algebra in heterogeneous multi-coprocessor environments. The model incorporates some of the current best design and implementation practices for the heterogeneous acceleration of dense linear algebra (DLA). Examples are given as the basis for solving linear systems’ algorithms – the LU, QR, and […]
Feb, 28
Scheduling data flow program in xkaapi: A new affinity based Algorithm for Heterogeneous Architectures
Efficient implementations of parallel applications on heterogeneous hybrid architectures require a careful balance between computations and communications with accelerator devices. Even if most of the communication time can be overlapped by computations, it is essential to reduce the total volume of communicated data. The literature therefore abounds with ad-hoc methods to reach that balance, but […]
Feb, 27
2014 3rd International Conference on Knowledge Discovery, ICKD 2014
All papers of ICKD 2014 will be published in the International Journal of Computer Theory and Engineering (IJCTE)(ISSN: 1793-8201), and will be indexed by Electronic Journals Library, EBSCO, Engineering & Technology Digital Library, Google Scholar, INSPEC, Ulrich’s Periodicals Directory, Crossref, ProQuest, WorldCat, and EI (INSPEC, IET). 2014-04-05 T1. Novel Algorithms T2. Association Rules T3. Knowledge […]
Feb, 27
2014 3rd International Conference on Computing and Computer Vision, ICCCV 2014
All papers for the ICCCV 2014 will be published in the Journal of Image and Graphics (JOIG, ISSN: 2301-3699) as one volume, and will be indexed by Ulrich’s Periodicals Directory, Google Scholar, EBSCO, Engineering & Technology Digital Library and Electronic Journals Digital Library. 2014-04-01 Machine Vision, Image Processing, and Pattern Analysis Imaging Sensors Color and […]
Feb, 27
Parallel dual tree traversal on multi-core and many-core architectures for astrophysical N-body simulations
In astrophysical N-body simulations, Dehnen’s algorithm, implemented in the serial falcON code and based on a dual tree traversal, is faster than serial Barnes-Hut tree-codes, but outperformed by parallel CPU and GPU tree-codes. In this paper, we present a parallel dual tree traversal, implemented in the pfalcON code, targeting multi-core CPUs and manycore architectures (Xeon […]
Feb, 27
Face Recognition Using OpenCL
Face recognition is the biometric identification of human’s face and matching the image against a library of known faces. The algorithm used to simulate the above is Eigen faces algorithm. The software which is been proposed to implement is Open CL. Open CL (Open Computing Language) is an open standard for general purpose parallel programming […]
Feb, 27
G-Heart: A GPU-based System for Electrophysiological Simulation and Multi-modality Cardiac Visualization
Cardiac electrophysiological simulation and multi-modality visualization are computationally intensive and valuable in studying the structure, mechanism, and dynamics of heart. The existing multi-CPU based approaches can reduce the calculation time, but suffer from the hardware and communication cost problems and are inefficient for 3D data visualization. Compared with multi-CPU, the highly parallel and multi-core properties […]
Feb, 27
Exploitation of GPUs for the Parallelisation of Probably Parallel Legacy Code
General purpose Gpus provide massive compute power, but are notoriously difficult to program. In this paper we present a complete compilation strategy to exploit Gpus for the parallelisation of sequential legacy code. Using hybrid data dependence analysis combining static and dynamic information, our compiler automatically detects suitable parallelism and generates parallel OpenCl code from sequential […]
Feb, 27
Extending the Generalized Fermat Prime Number Search Beyond One Million Digits Using GPUs
Great strides have been made in recent years in the search for ever larger prime Generalized Fermat Numbers (GFN). We briefly review the history of the GFN prime search, and describe new implementations of the ‘Genefer’ software (now available as open source) using CUDA and optimised CPU assembler which have underpinned this unprecedented progress. The […]
Feb, 26
2014 3rd International Conference on Software and Computer Applications, ICSCA 2014
All papers for the ICSCA 2014 will be published in the Journal of Lecture Notes on Software Engineering (LNSE, ISSN: 2301-3559) as one volume, and will be indexed by DOAJ, Electronic Journals Library, Engineering & Technology Digital Library, EBSCO, Ulrich’s Periodicals Directory, International Computer Science Digital Library (ICSDL), ProQuest and Google Scholar. Software Engineering Artificial […]