Posts
Sep, 11
Translation-invariant two-dimensional discrete wavelet transform on graphics processing units
The Discrete Wavelet Transform (DWT) is used in several signal and image processing applications. Due to the computational expense various approaches have been proposed. One approach is using graphics processing units (GPUs) as stream processors to speed up the calculation of the DWT. This paper presents a GPU implementation of the translation-invariant wavelet transform computed […]
Sep, 11
SSLPV: subsurface light propagation volumes
This paper presents the Subsurface Light Propagation Volume (SSLPV) method for real-time approximation of subsurface scattering effects in dynamic scenes with changing mesh topology and lighting. SSLPV extends the Light Propagation Volume (LPV) technique for indirect illumination in video games. We introduce a new consistent method for injecting flux from point light sources into an […]
Sep, 11
High performance technique for database applications using a hybrid GPU/CPU platform
Many database applications, such as sequence comparing, sequence searching, and sequence matching, etc, process large database sequences. we introduce a novel and efficient technique to improve the performance of database applications by using a Hybrid GPU/CPU platform. In particular, our technique solves the problem of the low efficiency resulting from running short-length sequences in a […]
Sep, 11
Collision-streams: fast GPU-based collision detection for deformable models
We present a fast GPU-based streaming algorithm to perform collision queries between deformable models. Our approach is based on hierarchical culling and reduces the computation to generating different streams. We present a novel stream registration method to compact the streams and efficiently compute the potentially colliding pairs of primitives. We also use a deferred front […]
Sep, 11
Texture compression of light maps using smooth profile functions
Light maps have long been a popular technique for visually rich real-time rendering in games. They typically contain smooth color gradients which current low bit rate texture compression techniques, such as DXT1 and ETC2, do not handle well. The application writer must therefore choose between doubling the bit rate by choosing a codec such as […]
Sep, 11
Lattice-boltzmann water waves
A model for real-time generation of deep-water waves is suggested. It is based on a lattice-Boltzmann (LB) technique. Computation of wave dynamics and (ray-traced) rendering for a lattice of size 10242 can be carried out simultaneously on a single graphics card at 25 frames per second. In addition to the computational speed, the LB technique […]
Sep, 11
A design pattern language for engineering (parallel) software: merging the PLPP and OPL projects
Parallel programming is stuck. To make progress, we need to step back and understand the software people wish to engineer. We do this with a design pattern language. This paper provides background for a lively discussion of this pattern language. We present the context for the problem, the layers in the design pattern language, and […]
Sep, 11
Automatic safety proofs for asynchronous memory operations
We present a work-in-progress proof system and tool, based on separation logic, for analysing memory safety of multicore programs that use asynchronous memory operations.
Sep, 11
Lightweight modular staging: a pragmatic approach to runtime code generation and compiled DSLs
Software engineering demands generality and abstraction, performance demands specialization and concretization. Generative programming can provide both, but the effort required to develop high-quality program generators likely offsets their benefits, even if a multi-stage programming language is used. We present lightweight modular staging, a library-based multi-stage programming approach that breaks with the tradition of syntactic quasi-quotation […]
Sep, 11
Early experiences with the intel many integrated cores accelerated computing technology
We report on early programming experiences with the Intel Many Integrated Core (Intel MIC) Co-processor. This new and x86 based technology is Intel’s answer to GPU-based accelerators by NVIDIA, AMD and others. Accelerators have generally sparked interest in the HPC community because they have the potential to significantly increase the compute power of the next […]
Sep, 11
Importance-driven compositing window management
In this paper we present importance-driven compositing window management, which considers windows not only as basic rectangular shapes but also integrates the importance of the windows’ content using a bottom-up visual attention model. Based on this information, importance-driven compositing optimizes the spatial window layout for maximum visibility and interactivity of occluded content in combination with […]
Sep, 11
Task Superscalar: An Out-of-Order Task Pipeline
We present emph{Task Super scalar}, an abstraction of instruction-level out-of-order pipeline that operates at the task-level. Like ILP pipelines, which uncover parallelism in a sequential instruction stream, task super scalar uncovers task-level parallelism among tasks generated by a sequential thread. Utilizing intuitive programmer annotations of task inputs and outputs, the task super scalar pipeline dynamically […]