Posts
Jul, 8
Fast GPU Garment Simulation and Collision Detection
This paper describes a technique for garment simulation and collision detection implemented on modern Graphics Processors (GPU). It exploits a mass-spring cloth model with velocity modification approach to overcome the super-elasticity. Our novel algorithms for cloth-body and cloth-cloth collision detection and response are based on image-space interference tests. For collision detection a 3D texture is […]
Jul, 8
Interactive BRDF Estimation for Mixed-Reality Applications
Recent methods in augmented reality allow simulating mutual light interactions between real and virtual objects. These methods are able to embed virtual objects in a more sophisticated way than previous methods. However, their main drawback is that they need a virtual representation of the real scene to be augmented in the form of geometry and […]
Jul, 8
Teaching Parallel Programming Models on a Shallow-Water Code
We present a software package that supports teaching different parallel programming models in a computational science and engineering context. It implements a Finite Volume solver for the shallow water equations, with application to tsunami simulation in mind. The numerical model is kept simple, using patches of Cartesian grids as computational domain, which can be connected […]
Jul, 8
Utilizing GPGPU in Computer Emulation
The article deals with the idea of computer emulation using the GPGPU technology in order to get performance improvements. Basic assumptions for using stream processing in computer emulation effectively are discussed and the structure of an emulator, together with the emulation technique are proposed. The emulator structure, in this case, is of distributed nature, so […]
Jul, 8
GPU-Optimized Molecular Dynamics Simulations
Protein and RNA biomolecular folding and assembly problems have important applications because misfolding events are associated with diseases like Alzheimer’s and Parkinson’s. However, simulating biologically relevant sized biomolecules on timescales that correspond to biological functions is an extraordinary challenge due to computational bottlenecks that are mainly involved in force calculations. We briefly review the molecular […]
Jul, 7
Random Walks for Image Cosegmentation
We recast the Cosegmentation problem using Random Walker (RW) segmentation as the core segmentation algorithm, rather than the traditional MRF approach adopted in the literature so far. Our formulation is similar to previous approaches in the sense that it also permits Cosegmentation constraints (which impose consistency between the extracted objects from >= 2 images) using […]
Jul, 7
Implementing Interactive 3D Segmentation on CUDA Using Graph-Cuts and Watershed Transformation
In this paper we present a novel scheme for a very fast implementation of volumetric segmentation using graph cuts. The main benefit of this work is our approach to non-grid region adjacency processing on CUDA which to our knowledge has not been done yet in any efficient way. The watershed transform radically reduces the number […]
Jul, 7
Fast algorithm of ray tracing based on KD-tree structure
According to the GPU storage characteristics, a parallel ray tracing algorithm is proposed in this paper, in which the KD-tree is adopted as the accelerating structure. The nodes are continuously spitted using intermediate plane of each axis, respectively, while the built KD-tree is stored in the texture memory of GPUs. The triangles in a scene […]
Jul, 7
Efficient Parallel RSA Decryption Algorithm for Many-core GPUs with CUDA
Cryptography is an important technique among various applications. In the telecommunication, cryptography is necessary when an untrusted medium is communicated in the network. RSA is a public-key cryptography algorithm to use a pair (N, E) as the public key and D as the private key. The N is the product of two large prime numbers […]
Jul, 7
Parallel Particle Swarm Optimization on Graphical Processing Unit for Pose Estimation
In this paper, we present a parallel implementation of the Particle Swarm Optimization (PSO) on GPU using CUDA. By fully utilizing the processing power of graphic processors, our implementation provides a speedup of 215x compared to a sequential implementation on CPU. This speedup is significantly superior to what has been reported in recent papers and […]
Jul, 6
Sparselet Models for Efficient Multiclass Object Detection
We develop intermediate representations for deformable part models, and show that such representations have favorable performance characteristics for multi-class problems where the number of classes is large. Our model uses sparse coding of part filters to represent each filter as a sparse linear combination of shared dictionary elements. This leads to an universal set of […]
Jul, 6
Parallel Memory Defragmentation on a GPU
High-throughput memory management techniques such as malloc/free or mark-and-sweep collectors often exhibit memory fragmentation leaving allocated objects interspersed with free memory holes. Memory defragmentation removes such holes by moving objects around in memory so that they become adjacent (compaction) and holes can be merged (coalesced) to form larger holes. However, known defragmentation techniques are slow. […]