Posts
Apr, 19
A pseudospectral matrix method for time-dependent tensor fields on a spherical shell
We construct a pseudospectral method for the solution of time-dependent, non-linear partial differential equations on a three-dimensional spherical shell. The problem we address is the treatment of tensor fields on the sphere. As a test case we consider the evolution of a single black hole in numerical general relativity. A natural strategy would be the […]
Apr, 18
Building a Personal High Performance Computer with Heterogeneous Processors
Personal high performance computer (PHPC) requires lower cost and high performance. The Teraflops PHPC systems with special accelerator units like GPGPU have been presented, but they have difficulties in programming, compatibility and applicability. In this paper, we present HPP-PHPC, a hybrid architecture of heterogeneous processors connected by non-coherent off-chip system bus. The performance of HPP-PHPC […]
Apr, 18
Practical Pre-stack Kirchhoff Time Migration of Seismic Processing on General Purpose GPU
In this paper, we introduced three prototypes of GPGPU solutions on NVidia GeForce8800GT for a practical Pre-stack Kirchhoff Time Migration program. We presented how to re-design and re-implement the original CPU code to efficiency GPU code. The prototypes are more than at most 7.2 times faster than its CPU version on Intel’s P4 3.0G.
Apr, 18
GPU detectors for interference cancellation in chaos-based CDMA communications
Multi-user detection is an effective technique to reduce the mutual interference between users in code division multiple access (CDMA) communications at the cost of a larger number of arithmetic operations. It is shown that multi-user detection can be efficiently computed on graphics processors using a GPGPU approach. Specifically, two GPU parallel interference cancellation detectors for […]
Apr, 18
Efficient characterizations of composite materials electrical properties based on GPU accelerated finite difference method
In this paper, a GPU accelerated three-dimensional finite difference method is presented as an efficient approach of performing fast parallel simulations of composite materials. Using a NVIDIA GeForce 9800 series GPGPU and with an optimized CUDA implementation, a considerable speed-up (>20) was observed for simulations of large size problems. Further performance improvements could be achieved […]
Apr, 18
Many-Core vs. Many-Thread Machines: Stay Away From the Valley
We study the tradeoffs between many-core machines like Intel’s Larrabee and many-thread machines like Nvidia and AMD GPGPUs. We define a unified model describing a superposition of the two architectures, and use it to identify operation zones for which each machine is more suitable. Moreover, we identify an intermediate zone in which both machines deliver […]
Apr, 18
Linear optimization on modern GPUs
Optimization algorithms are becoming increasingly more important in many areas, such as finance and engineering. Typically, real problems involve several hundreds of variables, and are subject to as many constraints. Several methods have been developed trying to reduce the theoretical time complexity. Nevertheless, when problems exceed reasonable sizes they end up being very computationally intensive. […]
Apr, 18
Real-time 3D reconstruction and pose estimation for human motion analysis
In this paper, we present a markerless 3D motion capture system based on a volume reconstruction technique of non rigid bodies. It depicts a new approach for pose estimation in order to fit an articulated body model into the captured real-time information. We aim at analyzing athlete’s movements in real-time within a 3D interactive graphics […]
Apr, 18
TransCAIP: A Live 3D TV System Using a Camera Array and an Integral Photography Display with Interactive Control of Viewing Parameters
The system described in this paper provides a real-time 3D visual experience by using an array of 64 video cameras and an integral photography display with 60 viewing directions. The live 3D scene in front of the camera array is reproduced by the full-color, full-parallax autostereoscopic display with interactive control of viewing parameters. The main […]
Apr, 18
Accelerated multi-view stereo using parallel processing capababilities of the GPUS
This paper presents an accelerated implementation of a multi-view stereo pipeline using parallel processing capabilities of the GPUs. Our system takes as input a set of calibrated photographs and produces a textured 3D mesh of the scene. The pipeline is divided into three parts: dense stereo matching, multi-view correspondence linking and 3D model generation. First, […]
Apr, 18
CUDA-BLASTP: Accelerating BLASTP on CUDA-Enabled Graphics Hardware
Scanning protein sequence database is an often repeated task in computational biology and bioinformatics. However, scanning large protein databases, such as GenBank, with popular tools such as BLASTP requires long runtimes on sequential architectures. Due to the continuing rapid growth of sequence databases, there is a high demand to accelerate this task. In this paper, […]
Apr, 17
Fast Dynamic Voronoi Treemaps
The Voronoi Treemap is a space-filling treemap technique that relaxes the constraints of rectangular nodes. Its organic shapes maintain a one-to-one aspect ratio, are flexible with their placement, allowing stable zooming and dynamic data values. In this paper, we present algorithms for efficient computation and dynamic update of Voronoi Treemaps. Our GPGPU-based technique allows for […]