Posts
Apr, 30
The 2013 International Conference on Network Computing and Information Security and the 2013 International Conference on Multimedia and Signal Processing, NCIS’13- CMSP’13
The 2013 International Conference on Network Computing and Information Security (NCIS’13) and the 2013 International Conference on Multimedia and Signal Processing (CMSP’13) will be jointly held at Guiyang, China in September 20-22, 2013. NCIS’13- CMSP’13 aims to provide a high-level international forum for scientists and researchers to present the state of the art of Network […]
Apr, 30
Automatic Compilation for Heterogeneous Architectures with Single Assignment C
In recent years, we have witnessed an increasing heterogeneity of computing resources. A typical laptop today combines at least one multicore processor with one general purpose graphics processing unit (GPGPU), while supercomputer nodes typically have several of each. Exploiting all these available computing resources effectively is very important, but also still very challenging. In this […]
Apr, 30
Improving Numerical Accuracy for Non-Negative Matrix Multiplication on GPUs using Recursive Algorithms
Scientific computing is only bound by the limits of Moore’s Law and the scalability of high performance mathematical library implementations. Most mathematical libraries however tend to focus only on general inputs, limiting their potential performance and scalability by not tailoring their implementation to specific inputs, such as non-negative inputs. By removing this limitation it is […]
Apr, 30
MPI Derived Datatypes Processing on Noncontiguous GPU-resident Data
Driven by the goals of efficient and generic communication of noncontiguous data layouts in GPU memory, for which solutions do not currently exist, we present a parallel, noncontiguous data-processing methodology through the MPI datatypes specification. Our processing algorithm utilizes a kernel on the GPU to pack arbitrary noncontiguous GPU data by enriching the datatypes encoding […]
Apr, 30
High Performance Data Leak Detection
We describe a novel deep packet inspection technique that provides precise quantitative measures for detecting data exfiltration. We point out the fundamental differences between our data leak detection and the conventional intrusion detection systems (IDS). The key to our solution is a powerful sampling algorithm and a sophisticated local alignment algorithm. Our sampling method has […]
Apr, 30
Mr. Scan: Extreme Scale Density-Based Clustering using a Tree-Based Network of GPGPU Nodes
Density-based clustering algorithms are a widely-used class of data mining techniques that can find irregularly shaped clusters and cluster data without prior knowledge of the number of clusters it contains. DBSCAN is the most well-known density-based clustering algorithm. We introduce our version of DBSCAN, called Mr. Scan, which uses a hybrid parallel implementation that combines […]
Apr, 29
RealTime GPU-Based Motion Planning for Task Executions
We present a realtime GPU-based motion planning algorithm for robot task executions. Many task execution strategies break down a high-level task planning problem into multiple low-level motion planning problems, and it is essential to solve those problems at interactive rates. In order to achieve high performance for the planning, our method exploits a high number […]
Apr, 29
Multigrid Optimization Methods for High Performance Computing
The aim of this work was the investigation of implementability and efficiency of an algorithm for solving optimal control problems on a new hardware architecture. For an academic test problem the collective smoothing multigrid method (CSMG) was realized on a commodity graphics card (GPU) and the performance in term of elapsed time compared to those […]
Apr, 29
Analysis of Multicore CPU and GPU Toward Parallelization of Total Focusing Method Ultrasound Reconstruction
Ultrasonic imaging and reconstruction tools are com-monly used to detect, identify and measure defects in different mechanical parts. Due to the complexity of the underlying physics, and due to the evergrowing quantity of acquired data, computation time is becoming a limitation to the opti-mal inspection of a mechanical part. This article presents the performances of […]
Apr, 29
Local Histogram Modification Based Contrast Enhancement with GPU Acceleration
This paper presents a novel local contrast enhancement algorithm based on local histogram modification. The computation of local contrast enhancement operators is usually slow though they produce better local contrast and details. We have addressed this issue by subtly designing a highly parallel algorithm, which could be easily implemented on Graphics Processing Units (GPU) to […]
Apr, 29
Split tiling for GPUs: automatic parallelization using trapezoidal tiles
Tiling is a key technique to enhance data reuse. For computations structured as one sequential outer "time" loop enclosing a set of parallel inner loops, tiling only the parallel inner loops may not enable enough data reuse in the cache. Tiling the inner loops along with the outer time loop enhances data locality but may […]
Apr, 27
CUDA Based CAMshift Algorithm for Object Tracking Systems
In this paper, we present an image object tracking system for GPGPU based CAMshift algorithm. For image object tracking, we use the parallel CAMshift tracking algorithm based on the HSV color image distribution of detected moving objects. In this, RGB-to-HSV color conversion, image masking such as open and close operation for image morphology, and computing […]