Posts
Jan, 19
A framework for efficient execution on GPU and CPU+GPU systems
Technological limitations faced by the semi-conductor manufacturers in the early 2000’s restricted the increase in performance of the sequential computation units. Nowadays, the trend is to increase the number of processor cores per socket and to progressively use the GPU cards for highly parallel computations. Complexity of the recent architectures makes it difficult to statically […]
Jan, 18
The 2nd International Conference on Control, Automation and Robotics (ICCAR), 2016
ICCAR 2016 conference proceedings will be published by IEEE Conference Publication, which would be indexed by. ★ ICCAR 2016 is in the IEEE conference list. http://www.ieee.org/conferences_events/conferences/conferencedetails/index.html?Conf_ID=38085 ★Publication and Indexing History of ICCAR: ICCAR 2015, Singapore, May 20-22, 2015. Publication: IEEE Conference Proceedings Online: http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?reload=true&punumber=7153096 ★Keynote &Plenary Speakers Prof. Wei-Hsin Liao, The Chinese University of Hong […]
Jan, 16
Homomorphic Autocomplete
With the rapid progress in fully homomorpic encryption (FHE) and somewhat homomorphic encryption (SHE) schemes, we are witnessing renewed efforts to revisit privacy preserving protocols. Several works have already appeared in the literature that provide solutions to these problems by employing FHE or SHE techniques. These applications range from cloud computing to computation over confidential […]
Jan, 16
Performance Analysis of Roberts Edge Detection Using CUDA and OpenGL
The evolution of high-performance and programmable graphics processing units (GPUs) has generated considerable advancements in graphics and parallel computing. In this paper we present a Roberts filter based on edge detection algorithm using CUDA and OpenGL architectures. The basic idea is to use the Pixel Buffer Object (PBO) to create images with CUDA on a […]
Jan, 16
LHCb GPU acceleration project
The LHCb detector is due to be upgraded for processing high-luminosity collisions, which will increase data bandwidth to the event filter farm from 100 GB/s to 4 TB/s, encouraging us to look for new ways of accelerating Online reconstruction. The Coprocessor Manager is a new framework for integrating LHCb’s existing computation pipelines with massively parallel […]
Jan, 16
Contributions of hybrid architectures to depth imaging: a CPU, APU and GPU comparative study
In an exploration context, Oil and Gas (O&G) companies rely on HPC to accelerate depth imaging algorithms. Solutions based on CPU clusters and hardware accelerators are widely embraced by the industry. The Graphics Processing Units (GPUs), with a huge compute power and a high memory bandwidth, had attracted significant interest. However, deploying heavy imaging workflows, […]
Jan, 16
International Conference on Optoelectronics and Image Processing (ICOIP), 2016
=Publication= ICOIP 2016 conference Proceedings by IEEE *Reviewed by IEEE Xplore and Ei Compendex. =Keynote Speakers= Prof. Ryszard Tadeusiewicz AGH University of Science and Technology, Poland Prof. Alexander Balinsky Cardiff University, UK Prof. BEN-OTHMAN Jalel University of Paris 13, France =Agenda= June 10, 2016 – Registration & Conference Materials Collection June 11, 2016 – Keynote […]
Jan, 14
Odyssey: A Public GPU-Based Code for General-Relativistic Radiative Transfer in Kerr Spacetime
General-relativistic radiative transfer (GRRT) calculations coupled with the calculation of geodesics in the Kerr spacetime are an essential tool for determining the images, spectra and light curves from matter in the vicinity of black holes. Such studies are especially important for ongoing and upcoming millimeter/submillimeter (mm/sub-mm) Very Long Baseline Interferometry (VLBI) observations of the supermassive […]
Jan, 14
Aging in the three-dimensional Random Field Ising Model
We studied the nonequilibrium aging behavior of the Random Field Ising Model in three dimensions for various values of the disorder strength. This allowed us to investigate how the aging behavior changes across the ferromagnetic-paramagnetic phase transition. We investigated a large system size of $N=256^3$ spins and up to $10^8$ Monte Carlo sweeps. To reach […]
Jan, 14
A Case for Work-stealing on FPGAs with OpenCL Atomics
We provide a case study of work-stealing, a popular method for run-time load balancing, on FPGAs. Following the Cederman-Tsigas implementation for GPUs, we synchronize workitems not with locks, mutexes or critical sections, but instead with the atomic operations provided by Altera’s OpenCL SDK. We evaluate work-stealing for FPGAs by synthesizing a K-means clustering algorithm on […]
Jan, 14
Classification of Higgs Boson Tau-Tau decays using GPU accelerated Neural Networks
In particle physics, Higgs Boson to tau-tau decay signals are notoriously difficult to identify due to the presence of severe background noise generated by other decaying particles. Our approach uses neural networks to classify events as signals or background noise.
Jan, 14
A Survey Of Techniques for Approximate Computing
Approximate computing trades off computation quality with the effort expended and as rising performance demands confront with plateauing resource budgets, approximate computing has become, not merely attractive, but even imperative. In this paper, we present a survey of techniques for approximate computing (AC). We discuss strategies for finding approximable program portions and monitoring output quality, […]