9064

Posts

Feb, 28

GEARS: A General and Efficient Algorithm for Rendering Shadows

We present a soft shadow rendering algorithm that is general, efficient, and accurate. The algorithm supports fully dynamic scenes, with moving and deforming blockers and receivers, and with changing area light source parameters. The algorithm computes for each output image pixel a tight but conservative approximation of the set of triangles that block the light […]
Feb, 28

Acceleration of the MMFF94 routines within OpenBabel using Eigen and OpenCL

Over the last few decades, computer modelling and computer simulation have become an invaluable tool for computational chemists interested in advancing their research and experiment in a more efficient, cost effective way with new molecules. As computer capabilities increase the demand for more accurate models and faster simulations has also grown. Some of these models […]
Feb, 28

A Fast and Efficient SIFT Detector Using the Mobile GPU

Emerging mobile applications, such as augmented reality, demand robust feature detection at high frame rates. We present an implementation of the popular Scale-Invariant Feature Transform (SIFT) feature detection algorithm that incorporates the powerful graphics processing unit (GPU) in mobile devices. Where the usual GPU methods are inefficient on mobile hardware, we propose a heterogeneous dataflow […]
Feb, 28

Accelerating Dynamic Binary Translation with GPUs

Binary translation is the emulation of one instruction set by another through translation of code. In binary translation sequences of instructions are translated from the source to the target instruction set. Dynamic binary translation (DBT) looks at a short sequence of code – typically on the order of a single basic block – then translate […]
Feb, 27

Multi-GPU based on multicriteria optimization for motion estimation system

Graphics processor units (GPUs) offer high performance and power efficiency for a large number of data-parallel applications. Previous research has shown that a GPU-based version of a neuromorphic motion estimation algorithm can achieve a x32 speedup using these devices. However, the memory consumption creates a bottleneck due to the expansive tree of signal processing operations […]
Feb, 27

Parallel simulation of population balance model-based particulate processes using multi-core CPUs and GPUs

Computer-aided modeling and simulation is a crucial step in developing, integrating and optimizing unit operations and subsequently entire processes, in the chemical/pharmaceutical industry. This study details two methods of reducing the computational time to solve complex process models, namely the population balance model which given the source terms can be very computationally intensive. Population balance […]
Feb, 27

Massive Parallel Implementation of ODE Solvers

The presented contribution maps the possibilities of exploitation of the massive parallel computational hardware (namely GPU) for solution of the initial value problems of ordinary differential equations. Two cases are discussed: parallel solution of a single ODE and parallel execution of scalar ODE solvers. Whereas the advantages of the special architecture in the case of […]
Feb, 27

Parallel Shooting and Bouncing Ray Method on GPU Clusters for Analysis of Electro-Magnetic Scattering

This paper proposes an efficient parallel shooting and bouncing ray (SBR) method on the graphics processing unit (GPU) cluster for solving the electromagnetic scattering problems. At each incident direction, the parallel SBR method partitions the virtual aperture into sub-apertures, and distributes the computational process of each sub-aperture over GPU nodes. As ray tubes in the […]
Feb, 27

Utilizing Graphics Processing Units for Network Anomaly Detection

This research explores the benefits of using commonly-available graphics processing units (GPUs) to perform classification of network traffic using supervised machine learning algorithms. Two full factorial experiments are conducted using a NVIDIA GeForce GTX 280 graphics card. The goal of the first experiment is to create a baseline for the relative performance of the CPU […]
Feb, 25

A Wrapper of OpenCL library for gVirtus Framework

Integration of GPUs in a Cloud Computing environment is an emerging research area. The GPUs are used to perform processing extremely exigent in terms of power computation and the Cloud computing has the potential to become a major tool for scientists and engineers to access high performance resource and storage. The proposed work has been […]
Feb, 25

Advanced Video Coding on CPUs and GPUs: Parallelization and RD Analysis

Increasing need for high quality video communication and video streaming, and tremendous growth of video content on Internet stimulated development of highly efficient compression methods. H.264/AVC is the newest international video coding standard, which achieves compression gain of about 50% comparing the previous standards, keeping the same quality of reconstructed video [1]. However, such compression […]
Feb, 25

SOAP3-dp: Fast, Accurate and Sensitive GPU-based Short Read Aligner

To tackle the exponentially increasing throughput of Next-Generation Sequencing (NGS), most of the existing short-read aligners can be configured to favor speed in trade of accuracy and sensitivity. SOAP3-dp, through leveraging the computational power of both CPU and GPU with optimized algorithms, delivers high speed and sensitivity simultaneously. Compared with widely adopted aligners including BWA, […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: