Posts
Jul, 27
A very fast census-based stereo matching implementation on a graphics processing unit
In this paper a very fast graphics processing unit implementation of a local, census-correlation-based stereo matching algorithm is presented. In comparison to absolute or squared difference correlation techniques, the census transform is computational more expensive which led to the motivation of a GPU-based implementation. Due to the parallel architecture of modern graphics cards, complex algorithms […]
Jul, 26
Efficient Rasterization for Outdoor Radio Wave Propagation
Conventional beam tracing can be used for solving global illumination problems. It is an efficient algorithm and performs very well when implemented on the GPU. This allows us to apply the algorithm in a novel way to the problem of radio wave propagation. The simulation of radio waves is conceptually analogous to the problem of […]
Jul, 26
Scene independent real-time indirect illumination
A novel method for real-time simulation of indirect illumination is presented in this paper. The method, which we call direct radiance mapping (DRM), is based on basal radiance calculations and does not impose any restrictions on scene geometry or dynamics. This makes the method tractable for real-time rendering of arbitrary dynamic environments and for interactive […]
Jul, 26
Data-Aware Task Scheduling on Multi-accelerator Based Platforms
To fully tap into the potential of heterogeneous machines composed of multicore processors and multiple accelerators, simple offloading approaches in which the main trunk of the application runs on regular cores while only specific parts are offloaded on accelerators are not sufficient. The real challenge is to build systems where the application would permanently spread […]
Jul, 26
Interactive transparency rendering for large CAD models
Transparency is an important graphics effect that can be used to significantly increase the realism of the rendered scene or to enable more effective visual inspection in engineering visualization. In this paper, we propose achieving interactive transparency rendering of a static scene by sorting the triangles in back-to-front order on CPU and supplying the sorted […]
Jul, 26
Discontinuous Galerkin Time Domain for Maxwell’s equations on GPUs
In this paper, we discuss our approach on the GPU implementation of the Discontinuous Galerkin Time-Domain (DGTD) method to solve the time dependent Maxwell’s equations. We exploit the inherent DGTD parallelism and combine the GPU computing capabilities with the benefits of a local time-stepping strategy. The combination results in significant increase in efficiency and reduction […]
Jul, 26
High-quality surface splatting on today’s GPUs
Point-based geometries evolved into a valuable alternative to surface representations based on polygonal meshes, because of their conceptual simplicity and superior flexibility. Elliptical surface splats were shown to allow for high-quality anti-aliased rendering by sophisticated EWA filtering. Since the publication of the original software-based EWA splatting, several authors tried to map this technique to the […]
Jul, 25
Exploring Novel Parallelization Technologies for 3-D Imaging Applications
Multi-dimensional imaging techniques involve the processing of high resolution images commonly used in medical, civil and remote-sensing applications. A barrier commonly encountered in this class of applications is the time required to carry out repetitive operations on large matrices. Partitioning these large datasets can help improve performance, and lends the data to more efficient parallel […]
Jul, 25
An energy model for graphics processing units
We present an energy model for a graphics processing unit (GPU) that is based on the amount and type of work performed in various parts of the unit. By designing and running directed tests on a GPU, we measure the energy consumed when performing different arithmetic and memory operations, allowing us to accurately predict the […]
Jul, 25
High-Throughput Sequence Translation Using CUDA
Now high-throughput sequencing technologies can yield a huge volume of sequence data with affordable price, the explosion rate of sequence data is outpacing the performance improvement of CPU, which means trivial sequence analysis task may challenge existing serial programs running purely on CPU. This paper depicts a highly parallel sequence translation program running on a […]
Jul, 25
A Hardware-Accelerated Patch Search Engine for Image Completion
This paper proposes a GPU-accelerated patch search engine that efficiently Alls the unknown regions of an image caused by replacement or removal of part of the foreground. Previous approaches, such as inpainting and texture synthesis, are either fast, but not applicable for small-scale regions, or slow, but fills large regions with good quality. The algorithm […]
Jul, 25
Interactive Visualization of Volumetric White Matter Connectivity in DT-MRI Using a Parallel-Hardware Hamilton-Jacobi Solver
In this paper we present a method to compute and visualize volumetric white matter connectivity in diffusion tensor magnetic resonance imaging (DT-MRI) using a Hamilton-Jacobi (H-J) solver on the GPU (graphics processing unit). Paths through the volume are assigned costs that are lower if they are consistent with the preferred diffusion directions. The proposed method […]

