Posts
Feb, 6
Parallel k-Means Image Segmentation Using Sort, Scan & Connected Components on a GPU
Image segmentation is required to run fast and without supervision to speed up subsequent processes such as object recognition or other high level tasks. General purpose computing on the GPU is a powerful tool to perform efficient image processing and has been applied to the image segmentation problem. However, state-of-the-art approaches still perform parts of […]
Feb, 6
Implementation of Fast Artificial Neural Network for Pattern Classification on Heterogeneous System
Neural networks have been part of an attempt to emulate the learning curve of the human nervous system. Graphics Processing Units (GPUs) that come with a Graphics card have hundreds of processing cores, and have highly parallel architecture. Because of the highly parallel architecture of GPUs, it suits very well for parallel architecture such as […]
Feb, 6
A Multi-GPU Sources Reconstruction Method for Imaging Applications
A profile reconstruction method using a surface inverse currents technique implemented on GPU is presented. The method makes use of the internal fields radiated by an equivalent currents distribution retrieved from scattered field information that is collected from multiple incident fields. Its main advantage over other inverse source-based techniques is the use of surface formulation […]
Feb, 5
Grex: An efficient MapReduce framework for graphics processing units
In this paper, we present a new MapReduce framework, called Grex, designed to leverage general purpose graphics processing units (GPUs) for parallel data processing. Grex provides several new features. First, it supports a parallel split method to tokenize input data of variable sizes, such as words in e-books or URLs in web documents, in parallel […]
Feb, 5
GPU-Accelerated High-Accuracy Molecular Docking using Guided Differential Evolution
The objective in molecular docking is to determine the best binding mode of two molecules in silico. A common application of molecular docking is in drug discovery where a large number of ligands are docked into a protein to identify potential drug candidates. This is a computationally intensive problem especially if the flexibility of the […]
Feb, 5
Vortex Methods for Fluid Simulation in Computer Graphics
Fluid simulations for computer graphics applications have attracted the attention of many researchers and practitioners due to the enhanced realism that natural phenomena simulation adds to graphical applications. Vortex methods are receiving increasing attention from the computer graphics community for simple and direct modeling of complex flow phenomena such as turbulence. However, vortex methods have […]
Feb, 5
A Comparison of CPU and OpenCL Parallelization Methods for Correlation and Graph Layout Algorithms used in the Network Analysis of High Dimensional Data
MOTIVATION: Many algorithms used in analysis of high dimensional data require significant processing time due to the sheer number of values compared. We describe the results of the parallelization of two algorithms central to the functionality of the network analysis tool BioLayout Express 3D; the calculation of correlation (Pearson, Spearman Rank) coefficient matrices used to […]
Feb, 5
Real-Time Phase Masks for Interactive Stimulation of Optogenetic Neurons
Experiments with networks of optogenetically altered neurons require stimulation with high spatio-temporal selectivity. Computer-assisted holography is an energy-efficient method for robust and reliable addressing of single neurons on the millisecond-timescale inherent to biologial information processing. We show that real-time control of neurons can be achieved by a CUDA-based hologram computation.
Feb, 3
Parallelization of the QR Decomposition with Column Pivoting Using Column Cyclic Distribution on Multicore and GPU Processors
The QR decomposition with column pivoting (QRP) of a matrix is widely used for numerical rank revealing in applications. The performance of LAPACK implementation (DGEQP3) of the Householder QRP algorithm is limited by Level 2 BLAS operations required for updating the column norms. In this paper, we propose an implementation of the QRP algorithm using […]
Feb, 3
Hybrid CPU-GPU Distributed Framework for Large Scale Mobile Networks Simulation
Most of the existing packet-level simulation tools are designed to perform experiments modeling a small to medium scale networks. The main reason of this limitation is the amount of available computation power and memory in quasi mono-process simulation environment. To enable efficient packet-level simulation for large scale scenario, we introduce a new CPUGPU co-simulation framework […]
Feb, 3
JPEG 2000 Wireless Image Transmission System using Encryption Domain Authentication
In this paper, we propose a wireless high resolution video transmission system with encryption and authentication. The proposed system is implemented by JPEG 2000 coding. We implement JPEG 2000 coder by GPU in CUDA which is an integrated development environment for GPU, or by JPEG 2000 codec LSI. Moreover, the authentication system can check the […]
Feb, 3
Fast and Maliciously Secure Two-Party Computation Using the GPU
We describe, and implement, a maliciously secure protocol for secure two-party computation, based on Yao’s garbled circuit and an efficient OT extension, in a parallel computational model. The implementation is done using CUDA and yields the fastest results for maliciously secure two-party computation in a realistic and practical setting by using a simple consumer grade […]