6307

Posts

Nov, 10

Run, Stencil, Run! – A Comparison of Modern Parallel Programming Paradigms

While the performance of supercomputers has increased dramatically during the last 15 years, programming models and programming languages have more or less remained constant. Two de facto standards, the Message Passing Interface (MPI) for programming distributed memory architectures and OpenMP for programming shared-memory architectures still dominate the field of computational science and engineering. As current […]
Nov, 10

Generation of planar radiographs from 3D anatomical models using the GPU

The rapid growth of the number of transistors on integrated circuits has enabled numerous advances in computational hardware. Computer graphics development benefited from these advances, reaching a stage where they deliver realistic and rich user experience through amazing graphics. GPUs are nowadays capable of processing massive amounts of data, by taking advantage of its intrinsic […]
Nov, 10

cudaBayesreg: Parallel Implementation of a Bayesian Multilevel Model for fMRI Data Analysis

Graphic processing units (GPUs) are rapidly gaining maturity as powerful general parallel computing devices. A key feature in the development of modern GPUs has been the advancement of the programming model and programming tools. Compute Unified Device Architecture (CUDA) is a software platform for massively parallel high-performance computing on Nvidia many-core GPUs. In functional magnetic […]
Nov, 10

Fast Hair Simulation and Rendering Using CUDA and OpenGL

Realistically modeling and animating human hair is an open challenge in computer graphics. Human hair is geometrically complex both from very large numbers of strands of hair on a head plus small microscopic variations on a single strand which contribute to its unique material properties. Additionally, due to complex interaction among all the hairs it […]
Nov, 10

GPU Cuda Performance on Two-Dimensional and Three-Dimensional VAWT Vortex Models

An analytical model of a vertical axis wind turbine was implemented using both a 2-D and a 3-D vortex model. The model requires significant amounts of computational resources and time compelling the use of an improved method for executing the algorithm in a highly parallelized fashion. Graphics Processing Units (GPUs), which are a new highly […]
Nov, 10

Understanding the impact of CUDA tuning techniques for Fermi

While the correctness of an NVIDIA CUDA program is easy to achieve, exploiting the GPU capabilities to obtain the best performance possible is a task for CUDA experienced programmers. Typical code tuning strategies, like choosing an appropriate size and shape for the threadblocks, programming a good coalescing, or maximize occupancy, are inter-dependent. Moreover, the choices […]
Nov, 10

Exploring Applications in CUDA

Modern Graphics Processing Units (GPUs) are capable of much more than supporting GUIs and generating 3D graphics. These devices are highly parallel, highly multithreaded multiprocessors harnessing a large amount of floating-point processing power for non-graphics problems. This project is based on experiments in CUDA C. These examples seek to demonstrate the potential speedups offered by […]
Nov, 10

GPU Acceleration of Matrix-based Methods in Computational Electromagnetics

This work considers the acceleration of matrix-based computational electromagnetic (CEM) techniques using graphics processing units (GPUs). These massively parallel processors have gained much support since late 2006, with software tools such as CUDA and OpenCL greatly simplifying the process of harnessing the computational power of these devices. As with any advances in computation, the use […]
Nov, 10

A CPU-GPU Hybrid Runtime for the Aeminium Language

Given that CPU clock speeds are stagnating, programmers are resorting to parallelism to improve the performance of their applications. Although such parallelism has usually been attained using either multicore architectures, multiple CPUs and/or clusters of machines, the GPU has since been used as an alternative. GPUs are an interesting resource because they can provide much […]
Nov, 10

Bit-Parallel Multiple Pattern Matching

Text matching with errors is a regular task in computational biology. We present an extension of the bit-parallel Wu-Manber algorithm to combine several searches for a pattern into a collection of fixed-length words. We further present an OpenCL parallelization of a redundant index on massively parallel multicore processors, within a framework of searching for similarities […]
Nov, 9

GrAVity: a massively parallel antivirus engine

In the ongoing arms race against malware, antivirus software is at the forefront, as one of the most important defense tools in our arsenal. Antivirus software is flexible enough to be deployed from regular users desktops, to corporate e-mail proxies and file servers. Unfortunately, the signatures necessary to detect incoming malware number in the tens […]
Nov, 9

Parallel Implementation of Otsu’s Binarization Approach on GPU

Fast algorithms are important for efficient image processing systems for handling large set of calculations. To speedup the processing, parallel implementation of an algorithm can be done using Graphics Processing Unit (GPU). GPU is general purpose computation hardware; programmability and low cost make it productive. Binarization is widely used technique in the image analysis and […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: