Posts
Feb, 27
Parallel Shooting and Bouncing Ray Method on GPU Clusters for Analysis of Electro-Magnetic Scattering
This paper proposes an efficient parallel shooting and bouncing ray (SBR) method on the graphics processing unit (GPU) cluster for solving the electromagnetic scattering problems. At each incident direction, the parallel SBR method partitions the virtual aperture into sub-apertures, and distributes the computational process of each sub-aperture over GPU nodes. As ray tubes in the […]
Feb, 27
Utilizing Graphics Processing Units for Network Anomaly Detection
This research explores the benefits of using commonly-available graphics processing units (GPUs) to perform classification of network traffic using supervised machine learning algorithms. Two full factorial experiments are conducted using a NVIDIA GeForce GTX 280 graphics card. The goal of the first experiment is to create a baseline for the relative performance of the CPU […]
Feb, 25
A Wrapper of OpenCL library for gVirtus Framework
Integration of GPUs in a Cloud Computing environment is an emerging research area. The GPUs are used to perform processing extremely exigent in terms of power computation and the Cloud computing has the potential to become a major tool for scientists and engineers to access high performance resource and storage. The proposed work has been […]
Feb, 25
Advanced Video Coding on CPUs and GPUs: Parallelization and RD Analysis
Increasing need for high quality video communication and video streaming, and tremendous growth of video content on Internet stimulated development of highly efficient compression methods. H.264/AVC is the newest international video coding standard, which achieves compression gain of about 50% comparing the previous standards, keeping the same quality of reconstructed video [1]. However, such compression […]
Feb, 25
SOAP3-dp: Fast, Accurate and Sensitive GPU-based Short Read Aligner
To tackle the exponentially increasing throughput of Next-Generation Sequencing (NGS), most of the existing short-read aligners can be configured to favor speed in trade of accuracy and sensitivity. SOAP3-dp, through leveraging the computational power of both CPU and GPU with optimized algorithms, delivers high speed and sensitivity simultaneously. Compared with widely adopted aligners including BWA, […]
Feb, 25
Graphics Card as a Cheap Supercomputer
The current powerful graphics cards, providing stunning real-time visual effects for computer-based entertainment, have to accommodate powerful hardware components that are able to deliver the photo-realistic simulation to the end-user. Given the vast computing power of the graphics hardware, its producers very often offer a programming interface that makes it possible to use the computational […]
Feb, 25
Future of GPGPU Micro-Architectural Parameters
As graphics processing units (GPUs) are becoming increasingly popular for general purpose workloads (GPGPU), the question arises how such processors will evolve architecturally in the near future. In this work, we identify and discuss tradeoffs for three GPU architecture parameters: active thread count, compute-memory ratio, and cluster and warp sizing. For each parameter, we propose […]
Feb, 23
Multi-GPU Computing for Achieving Speedup in Real-time Aggregate Risk Analysis
Stochastic simulation techniques employed for portfolio risk analysis, often referred to as Aggregate Risk Analysis, can benefit from exploiting state-of-the-art highperformance computing platforms. In this paper, we propose parallel methods to speedup aggregate risk analysis for supporting real-time pricing. To achieve this an algorithm for analysing aggregate risk is proposed and implemented in C and […]
Feb, 23
Can PCM Benefit GPU? Reconciling Hybrid Memory Design with GPU Massive Parallelism for Energy Efficiency
In recent studies, phase changing memory (PCM) has shown promising energy efficiency for systems with a modest level of parallelism. But it remains an open question whether it can benefit GPU-like massively parallel systems. This work conducts the first systematic investigation into this question. It empirically shows that contrary to the promising results shown before […]
Feb, 23
Efficient Parallel and External Matching
We show that a simple algorithm for computing a matching on a graph runs in a logarithmic number of phases incurring work linear in the input size. The algorithm can be adapted to provide efficient algorithms in several models of computation, such as PRAM, External Memory, MapReduce and distributed memory models. Our CREW PRAM algorithm […]
Feb, 23
Adaptive Hardware-accelerated Terrain Tessellation
In this master thesis report, a scheme for adaptive hardware tessellation is presented. The scheme uses an offline processing approach where a height map is analyzed in terms of curvature and the result is stored in a resource called density map. This density map is then bound as a resource to the hardware tessellation stage […]
Feb, 23
Parallel Computer Vision: Person Data Extraction
Face recognition has been established in many environments these days. It is used in security systems, social media platforms or in digital cameras to support the user. In addition, the rapidly rising number of CPU cores in modern PCs or handhelds let us do more complex work on a single machine. The central question of […]