Posts
Mar, 9
GPU accelerated maximum cardinality matching algorithms for bipartite graphs
We design, implement, and evaluate GPU-based algorithms for the maximum cardinality matching problem in bipartite graphs. Such algorithms have a variety of applications in computer science, scientific computing, bioinformatics, and other areas. To the best of our knowledge, ours is the first study which focuses on GPU implementation of the maximum cardinality matching algorithms. We […]
Mar, 7
Solutions For Optimizing The Radix Sort Algorithmic Function Using The Compute Unified Device Architecture
In this paper, we have researched and developed solutions for optimizing the radix sort algorithmic function using the Compute Unified Device Architecture (CUDA). The radix sort is a common parallel primitive, an essential building block for many data processing algorithms, whose optimization improves the performance of a wide class of parallel algorithms useful in data […]
Mar, 7
GPU based Eulerian Assembly of Genomes
Advances in sequencing technologies have revolutionized the field of genomics by providing cost effective and high throughput solutions. In this paper, we develop a parallel sequence assembler implemented on general purpose graphic processor units (GPUs). Our work was largely motivated by a growing need in the genomic community for sequence assemblers and increasing use of […]
Mar, 7
GPU-Accelerated Standardand Multi-Population Cultural Algorithms
In this paper, we present three parallel cultural algorithms using CUDA-enabled GPUs. Firstly, we used the GPU to accelerate an expensive fitness function. Next, the parallel versions of both standard and multi-population CAs were presented. Experiments show that the standard CA with an expensive fitness function was made more than 600 times faster. On lightweight […]
Mar, 7
Using Graphical Processing Units for Deterministic Single Machine Scheduling Problems
This paper gives an introduction to how graphical processing units can be used in non-graphical related problems or tasks. First a history of GPU is provided. The next part focuses on GPU programming. A brief description is given about the available hardware facilities and the available programming languages. As an initial result of the project […]
Mar, 5
Accelerating Simulation of Agent-Based Models on Heterogeneous Architectures
The wide usage of GPGPU programming models and compiler techniques enables the optimization of data-parallel programs on commodity GPUs. However, mapping GPGPU applications running on discrete parts to emerging integrated heterogeneous architectures such as the AMD Fusion APU and Intel Sandy/Ivy bridge with the CPU and the GPU on the same die has not been […]
Mar, 5
Parallel Ray Tracing in Scientific Visualization
Ray tracing presents an efficient rendering algorithm for scientific visualization using common visualization tools and scales with increasingly large geometry counts while allowing for accurate physically-based visualization and analysis, which enables enhanced rendering and new visualization techniques. Interactivity is of great importance for data exploration and analysis in order to gain insight into large-scale data. […]
Mar, 5
Parallel Algorithm for Generation of Test Recommended Path using CUDA
Software testing of an application makes the user to find defect. The users, called testers, should test the various situations with test cases. In order to make test cases, many states and events have to be considered. It takes much time to create test cases with many states and events. Instead of using the common […]
Mar, 5
Performance Analysis of a Symmetric Cryptographic Algorithm on Multicore Architectures
In this paper, a performance analysis of the symmetric encryption algorithm AES (Advanced Encryption Standard) on various multicore architectures is presented. To this end, three implementations based on C language that use the parallel programming tools OpenMP, MPI and CUDA to be run on multicore processors, multicore clusters and GPU, respectively, were carried out. The […]
Mar, 5
Large-scale Virtual Acoustics Simulation at Audio Rates Using Three Dimensional Finite Difference Time Domain and Multiple GPUs
The computation of large-scale virtual acoustics using the 3D finite difference time domain (FDTD) is prohibitively computationally expensive, especially at high audio sample rates, when using traditional CPUs. In recent years the computer gaming industry has driven the development of extremely powerful Graphics Processing Units (GPUs). Through specialised development and tuning we can exploit the […]
Mar, 3
Low-Energy Application Parallelism 2013, LEAP 2013
LEAP 2013 is the place to learn about and share the latest advances in the use of high-performance parallel computing technology on low-power mobile CPU, GPU, FPGA and embedded processors. Two days of world-class education and networking will give developers, researchers, engineers and technology managers the vital knowledge they need to understand, assess and exploit […]
Mar, 2
OpenOF: Framework for Sparse Non-linear Least Squares Optimization on a GPU
In the area of computer vision and robotics non-linear optimization methods have become an important tool. For instance, all structure from motion approaches apply optimizations such as bundle adjustment (BA). Most often, the structure of the problem is sparse regarding the functional relations of parameters and measurements. The sparsity of the system has to be […]