Posts
Feb, 7
CFP: The 2017 International Workshop on Embedded Multicore Systems (ICPPEMS), 2017
The 2017 International Workshop on Embedded Multicore Systems to be held in conjunction with the 46th International Conference on Parallel Processing (ICPP 2017) https://sites.google.com/view/icppems2017 Embedded systems with multicore designs are of major focuses from both industry and academia. While embedded multicore systems will look to play an important role ahead for system designs, many challenging […]
Feb, 7
Machines and Algorithms
I discuss the evolution of computer architectures with a focus on QCD and with reference to the interplay between architecture, engineering, data motion and algorithms. New architectures are discussed and recent performance results are displayed. I also review recent progress in multilevel solver and integation algorithms.
Feb, 7
Recurrent Neural Networks for anomaly detection in the Post-Mortem time series of LHC superconducting magnets
This paper presents a model based on Deep Learning algorithms of LSTM and GRU for facilitating an anomaly detection in Large Hadron Collider superconducting magnets. We used high resolution data available in Post Mortem database to train a set of models and chose the best possible set of their hyper-parameters. Using Deep Learning approach allowed […]
Feb, 7
Introduction to the Special Issue on Digital Signal Processing in Radio Astronomy
Advances in astronomy are intimately linked to advances in digital signal processing (DSP). This special issue is focused upon advances in DSP within radio astronomy. The trend within that community is to use off-the-shelf digital hardware where possible and leverage advances in high performance computing. In particular, graphics processing units (GPUs) and field programmable gate […]
Feb, 7
A Dynamic Programming Model To Solve Optimisation Problems Using GPUs
This thesis presents a parallel, dynamic programming based model which is deployed on the GPU of a system to accelerate the solving of optimisation problems. This is achieved by simultaneously running GPU based computations, and memory transactions, allowing computation to never pause, and overcoming the memory constraints of solving large problem instances. Due to this […]
Feb, 7
Advanced Concurrency Control Algorithm Design and GPU System Support for High Performance In-Memory Data Management
The design and implementation of data management systems have been significantly affected by application demands and hardware advancements. On one hand, with the emerging of various new applications, the traditional one-size-fits-all data management system has evolved into domain specific systems optimized for each application (e.g., OLTP, OLAP, streaming, etc.). On the other hand, with increasing […]
Feb, 5
Critical Comparison of the Classification Ability of Deep Convolutional Neural Network Frameworks with Support Vector Machine Techniques in the Image Classification Process
Recently, a number of new image classification models have been developed to diversify the number of options available to prospective machine learning classifiers, such as Deep Learning. This is particularly important in the field of medical image classification as a misdiagnosis could have a severe impact on the patient. However, an assessment on the level […]
Feb, 5
Spark-GPU: An Accelerated In-Memory Data Processing Engine on Clusters
Apache Spark is an in-memory data processing system that supports both SQL queries and advanced analytics over large data sets. In this paper, we present our design and implementation of Spark-GPU that enables Spark to utilize GPU’s massively parallel processing ability to achieve both high performance and high throughput. Spark-GPU transforms a general-purpose data processing […]
Feb, 5
Fast Fourier Transforms over Prime Fields of Large Characteristic and their Implementation on Graphics Processing Units
Prime field arithmetic plays a central role in computer algebra and supports computation in Galois fields which are essential to coding theory and cryptography algorithms. The prime fields that are used in computer algebra systems, in particular in the implementation of modular methods, are often of small characteristic, that is, based on prime numbers that […]
Feb, 5
Clustering Throughput Optimization on the GPU
Large datasets in astronomy and geoscience often require clustering and visualizations of phenomena at different densities and scales in order to generate scientific insight. We examine the problem of maximizing clustering throughput for concurrent dataset clustering in spatial dimensions. We introduce a novel hybrid approach that uses GPUs in conjunction with multicore CPUs for algorithmic […]
Feb, 5
GraviDy: a GPU modular, parallel N-body integrator
A wide variety of outstanding problems in astrophysics involve the motion of a large number of particles ($Ngtrsim 10^{6}$) under the force of gravity. These include the global evolution of globular clusters, tidal disruptions of stars by a massive black hole, the formation of protoplanets and the detection of sources of gravitational radiation. The direct-summation […]
Feb, 2
Analysis and implementation of a BLAST-Like algorithm for MIC architectures
Sequence alignment is becoming increasingly important in our current day and age, and with the rise of coprocessors, it is important to adapt sequence alignment algorithms to the new architecture. Parallelization using SIMD technology has previously been achieved that implement alignment algorithms e efficiently such as SWIPE, described by Rognes in 2011. The Intel Xeon […]