Posts
Apr, 10
A Real-Time Soft Shadow Rendering Algorithm by Occluder-Discretization
This paper presents a real-time soft shadow rendering algorithm based on the shadow-mapping technique. The key idea of this algorithm is to use only a single shadow map for a flat extended light source. The algorithm also uses the single shadow map to discretize the occluders to many flat patches which are parallel with the […]
Apr, 10
A Parallel Algorithm for Dot Product over Word-Size Finite Field Using Floating-Point Arithmetic
Recently, parallel computation has become necessary to take full advantage of the gains allowed by Moore’s law. Many scientific and engineering applications exhibit data parallelism but might not make full use of it. Some ubiquitous operations such that the dot product can easily be parallelized and then make good use of available hardware, like multi-core […]
Apr, 10
Fast seismic modeling and Reverse Time Migration on a GPU cluster
We have designed a fast parallel simulator that solves the acoustic wave equation on a GPU cluster. Solving the acoustic wave equation in an oil exploration industrial context aims at speeding up seismic modeling and Reverse Time Migration. We consider a finite difference approach on a regular mesh, in both 2D and 3D cases. The […]
Apr, 10
FFT Implementation on a Streaming Architecture
Fast Fourier Transform (FFT) is a useful tool for applications requiring signal analysis and processing. However, its high computational cost requires efficient implementations, specially if real time applications are used, where response time is a decisive factor. Thus, the computational cost and wide application range that requires FFT transforms has motivated the research of efficient […]
Apr, 10
Real-time Minute Change Detection on GPU for Cellular and Remote Sensor Imaging
Discovering subtle alterations of pairs of images taken from the same scene at different time intervals is called minute change detection problem. To achieve this goal, we have developed a framework that captures and highlights minute changes in digital images that are otherwise hidden to the human eye. Moreover, unnoticeable differences from image pairs that […]
Apr, 9
Depth-of-Field Blur Effects for First-Person Navigation in Virtual Environments
Depth-of-field blur effects are well-known depth cues in human vision. Computer graphics pipelines added DOF effects early to enhance imagery realism, but real-time VR applications haven’t yet introduced visual blur effects. The authors describe new techniques to improve blur rendering and report experimental results from a prototype video game implementation.
Apr, 9
Fast Motion Estimation on Graphics Hardware for H.264 Video Encoding
The video coding standard H.264 supports video compression with a higher coding efficiency than previous standards. However, this comes at the expense of an increased encoding complexity, in particular for motion estimation which becomes a very time consuming task even for today’s central processing units (CPU). On the other hand, modern graphics hardware includes a […]
Apr, 9
An approach of tool paths generation for CNC machining based on CUDA
This paper presents a new tool paths generation method for CNC machining based on GPU-CPU fusion calculation. CUDA, a general purpose parallel computing architecture, was provided by NVidia to resolve problems of mass data parallel computing. The new tool paths generation algorithm based on isoparametric method was redesigned to use CUDA. The final comparison experiment […]
Apr, 9
Scalable instruction set simulator for thousand-core architectures running on GPGPUs
Simulators are still the primary tools for development and performance evaluation of applications running on massively parallel architectures. However, current virtual platforms are not able to tackle the complexity issues introduced by 1000-core future scenarios. We present a fast and accurate simulation framework targeting extremely large parallel systems by specifically taking advantage of the inherent […]
Apr, 9
Kernel Fusion: An Effective Method for Better Power Efficiency on Multithreaded GPU
As one of the most popular accelerators, Graphics Processing Unit (GPU) has demonstrated high computing power in several application fields. On the other hand, GPU also produces high power consumption and has been one of the most largest power consumers in desktop and supercomputer systems. However, software power optimization method targeted for GPU has not […]
Apr, 9
A simple and efficient way to compute depth maps for multi-view videos
This paper deals with depth maps extraction from multi-view video. Contrary to standard stereo matching-based approaches, depth maps are computed here using optical flow estimations between consecutive views. We compare our approach with the one proposed in the Depth Estimation Reference Software (DERS) for normalization purposes in the ISO-MPEG 3DV group. Experiments conducted on sequences […]
Apr, 9
Data-parallel algorithms for large-scale real-time simulation of the cellular potts model on graphics processing units
In the following paper we present techniques for data-parallel execution of the cellular potts model (CPM) on graphics processing units (GPUs). We have developed data-structures and algorithms that are optimized to use available hardware resources on the GPU. To the best of our knowledge, this is the first attempt at using data-parallel techniques for simulating […]