Posts
May, 13
Cluster-Level Tuning of a Shallow Water Equation Solver on the Intel MIC Architecture
The paper demonstrates the optimization of the execution environment of a hybrid OpenMP+MPI computational fluid dynamics code (shallow water equation solver) on a cluster enabled with Intel Xeon Phi coprocessors. The discussion includes: – Controlling the number and affinity of OpenMP threads to optimize access to memory bandwidth; – Tuning the inter-operation of OpenMP and […]
May, 13
Fast Finite Solar Radiation Pressure Model Integration Using OpenGL
By coupling a common approach to vector graphics, OpenGL, high-fidelity solar-radiation pressure (SRP) effects are calculated easily and quickly with the power of graphics processing units (GPUs). For some missions SRP is a significant perturbation and a consideration wherein a simplified plate model does not suffice. OpenGL is a set of commands that interact with […]
May, 13
Impact of Modern OpenGL on FPS
In our work we choose several old and modern features of OpenGL that applications use to render scenes and compare their impact on the rendering speed. We aim our comparison not solely on these features, but also on the type of hardware used for the measurements. We run our tests on a professional graphics card […]
May, 13
Deriving Shape Grammars on the GPU
Due to growing demand for computer generated graphical content, procedural modeling has become an important topic in the gaming and movie industry. Creating vast amounts of content by hand requires excessive amounts of manual labor. Using a procedural rule set, entire worlds can be generated by a computer. However, the traditional CPU-based derivation of a […]
May, 13
K-Means on GPU: A Review
K-Means is the most popular clustering algorithm in data mining. The size of various data sets has increased tremendously day by day. Due to recent development in the shared memory inexpensive architecture like Graphics Processing Units (GPU). The general – purpose applications are implemented on GPU using Compute Unified Device Architecture (CUDA). Cost effectiveness of […]
May, 13
Performance Analysis of Sobel Edge Filter on Heterogeneous System Using OpenCL
The fundamental task required for any image or Video processing applications like video surveillance, medical imaging is Edge detection. Any of the filters available can be used to detect the edges. In this paper Sobel Edge filter is used for comparing the performance analysis on CPUs and GPUs and from this study it is found […]
May, 12
Geometric Algebra Enhanced Precompiler for C++, OpenCL and Mathematica’s OpenCLLink
The focus of this work is a simplified integration of algorithms expressed in Geometric Algebra (GA) into modern high level computer languages, namely C++, OpenCL and CUDA. A high runtime performance in terms of GA is achieved using symbolic simplification and code generation by a precompiler that is directly integrated into CMake-based build toolchains. Finally, […]
May, 12
Implementation of a distributed real-time video panorama pipeline for creating high quality virtual views
Today, we are continuously looking for more immersive video systems. Such systems, however, require more content, which can be costly to produce. A full panorama, covering regions of interest, can contain all the information required, but can be difficult to view in its entirety. In this thesis, we discuss a method for creating virtual views […]
May, 12
Entropy-based High Performance Computation of Boolean SNP-SNP Interactions Using GPUs
It is being increasingly accepted that traditional statistical Single Nucleotide Polymorphism (SNP) analysis of Genome-Wide Association Studies (GWAS) reveals just a small part of the heritability in complex diseases. Study of SNPs interactions identify additional SNPs that contribute to disease but that do not reach genome-wide significance or exhibit only epistatic effects. We have introduced […]
May, 12
Code Optimization and Performance Analysis of Oceanographic Software Package NEMO for GPGPU Systems
The paper presents our experience in code optimization and performance analysis of software package NEMO on hybrid parallel computer systems with accelerators. NEMO Ocean is a software package for oceanology, simulating ocean gyres and sea-ice models. Code optimization and performance analysis are performed for the case study of NEMO ORCA2_LIM configuration. All experiments are conducted […]
May, 12
Accelerating the scoring module of mass spectrometry-based peptide identification using GPUs
BACKGROUND: Tandem mass spectrometry-based database searching is currently the main method for protein identification in shotgun proteomics. The explosive growth of protein and peptide databases, which is a result of genome translations, enzymatic digestions, and post-translational modifications (PTMs), is making computational efficiency in database searching a serious challenge. Profile analysis shows that most search engines […]
May, 10
Dynamic Orchestration of Massively Data Parallel Execution
Graphics processing units (GPUs) are specialized hardware accelerators capable of rendering graphics much faster than conventional general-purpose processors. They are widely used in personal computers, tablets, mobile phones, and game consoles. Modern GPUs are not only efficient at manipulating computer graphics, but also are more effective than CPUs for algorithms where processing of large data […]