Posts
Nov, 16
Using GPUs to Crack Android Pattern-based Passwords
We investigate the strength of patterns as secret signatures in Android’s pattern based authentication mechanism. Parallelism of GPU is exploited to exhaustively search for the secret pattern. Typically, searching for a pattern, composed of a number of nodes and edges, requires an exhaustive search for the pattern. In this work, we show that the use […]
Nov, 14
SAGE: Self-Tuning Approximation for Graphics Engines
Approximate computing, where computation accuracy is traded off for better performance or higher data throughput, is one solution that can help data processing keep pace with the current and growing overabundance of information. For particular domains such as multimedia and learning algorithms, approximation is commonly used today. We consider automation to be essential to provide […]
Nov, 14
Real-Time Screen Space Rendering of Cartoon Water
A non-photorealistic rendering style is constantly chosen in games to emphasize the fantasy of the story. In this scenario, the presence of natural elements such as water is common. Simulation and rendering of water in 3D worlds still presents some technical challenges though. This paper describes an approach to render cartoon style water in real […]
Nov, 14
Optical Flow via Locally Adaptive Fusion of Complementary Data Costs
Many state-of-the-art optical flow estimation algorithms optimize the data and regularization terms to solve ill-posed problems. In this paper, in contrast to the conventional optical flow framework that uses a single or fixed data model, we study a novel framework that employs locally varying data term that adaptively combines different multiple types of data models. […]
Nov, 14
On the origin of yet another channel
Cryptanalysis of a cryptographic function like stream, block or hash function usually requires human cryptanalytical skills and labour. However, some automation is possible – e.g., by randomness testing suites like NIST/Diehard that can be applied to test statistical properties of cryptographic function outputs. Yet such testing suites are limited only to predefined statistical functions.We propose […]
Nov, 14
A finite volume approach for the simulation of nonlinear dissipative acoustic wave propagation
A form of the conservation equations for fluid dynamics is presented, deduced using slightly less restrictive hypothesis than those necessary to obtain the well known Westervelt equation. This formulation accounts for full wave diffraction, nonlinearity, and thermoviscous dissipative effects. A CLAWPACK based, 2D finite volume method using the Roe linearization was implemented to obtain numerically […]
Nov, 13
Designing Scientific Applications on GPUs
Many of today’s complex scientific applications now require a vast amount of computational power. General purpose graphics processing units (GPGPUs) enable researchers in a variety of fields to benefit from the computational power of all the cores available inside graphics cards. Understand the Benefits of Using GPUs for Many Scientific Applications: Designing Scientific Applications on […]
Nov, 13
Anatomy of High-Performance Many-Threaded Matrix Multiplication
BLIS is a new framework for rapid instantiation of the BLAS. We describe how BLIS extends the "GotoBLAS approach" to implementing matrix multiplication (GEMM). While GEMM was previously implemented as three loops around an inner kernel, BLIS exposes two additional loops within that inner kernel, casting the computation in terms of the BLIS microkernel so […]
Nov, 13
Utilizing massive parallelism in decoding of modern error-correcting codes for accelerating communication systems simulations
In this paper a novel approximate algorithm for massively-parallel decoding of trellis based error correcting codes (ECC) is presented. The potential effect of using such optimized decoder on acceleration of simulations of modern communication systems implementing the most recent communication standards, such as LTE-A (Long Term Evolution – Advanced) is evaluated quantitatively by presenting an […]
Nov, 13
GPU Enhancement of the Trigger to Extend Physics Reach at the Large Hadron Collider
At the Large Hadron Collider (LHC), the trigger systems for the detectors must be able to process a very large amount of data in a very limited amount of time, so that the nominal collision rate of 40 MHz can be reduced to a data rate that can be stored and processed in a reasonable […]
Nov, 13
Lattice Simulations using OpenACC compilers
OpenACC compilers allow one to use Graphics Processing Units without having to write explicit CUDA codes. Programs can be modified incrementally using OpenMP like directives which causes the compiler to generate CUDA kernels to be run on the GPUs. In this article we look at the performance gain in lattice simulations with dynamical fermions using […]
Nov, 13
Indexing million of packets per second using GPUs
Network traffic recorders are devices that record massive volumes of network traffic for security applications, like retrospective forensic investigations. When deployed over very high-speed networks, traffic recorders must process and store millions of packets per second. To enable interactive explorations of such large traffic archives, packet indexing mechanisms are required. Indexing packets at wire rates […]