5525

Posts

Sep, 3

Interactive Reaction-Diffusion on Surface Tiles

This paper proposes to perform reaction-diffusion on surface tiles. The square tiles fit nicely and cost-effectively in GPU memory, whereas we also apply distortion minimization on tiles so as to precisely reduce the unbalanced scale and resolution problem of chemicals in the reaction- diffusion. The interconnection nature of tiles accounts for the surface topology, and […]
Sep, 3

High-Performance Computing with Accelerators

This issue of CiSE is based on work presented at the US National Science Foundation workshop, Path to Petascale: Adapting Geo/Chem/Astro Applications for Accelerators and Accelerator Clusters, held at the US National Center for Supercomputing Applications (NCSA) in early 2009. The workshop was designed to raise awareness about the emergence of accelerator-based high-performance computing (HPC) […]
Sep, 3

An 8.6 mW 25 Mvertices/s 400-MFLOPS 800-MOPS 8.91 mm Multimedia Stream Processor Core for Mobile Applications

For the demands of mobile multimedia applications, a stream processor core is designed with 8.91 mm2 area in 0.18 mum CMOS technology at 50 MHz. Several techniques and architectures are proposed to achieve high performance with low power consumption. First of all, an optimized core pipeline is designed with 2-issue VLIW architecture to achieve the […]
Sep, 3

Improving Scheduling Techniques in Heterogeneous Systems with Dynamic, On-Line Optimisations

Computational performance increasingly depends on parallelism, and many systems rely on heterogeneous resources such as GPUs and FPGAs to accelerate computationally intensive applications. However, implementations for such heterogeneous systems are often hand-crafted and optimised to one computation scenario, and it can be challenging to maintain high performance when application parameters change. In this paper, we […]
Sep, 3

Fused DTI/HARDI Visualization

High-angular resolution diffusion imaging (HARDI) is a diffusion weighted MRI technique that overcomes some of the decisive limitations of its predecessor, diffusion tensor imaging (DTI), in the areas of composite nerve fiber structure. Despite its advantages, HARDI raises several issues: complex modeling of the data, nonintuitive and computationally demanding visualization, inability to interactively explore and […]
Sep, 2

Using Graphics Processing Units for Logic Simulation of Electronic Designs

Logic simulation is the major verification technique used for electronic system designs. Speeding up logic simulation results in great savings and shorter time-to-market. We parallelize logic simulation using Graphics Processing Units (GPUs). We present a parallel cycle-based logic simulation algorithm that uses And Inverter Graphs (AIGs) as design representations. We partition the gates in the […]
Sep, 2

Using GPU to exploit parallelism on cryptography

In this article we explore the NVIDIA graphical processing units (GPU) computational power in cryptography using CUDA (Compute Unified Device Architecture) technology. CUDA makes the general purpose computing easy using the parallel processing presents in GPUs. To do this, the NVIDIA GPUs architectures and CUDA are presented, besides cryptography concepts. Furthermore, we do the comparison […]
Sep, 2

Generalized Voronoi Diagram Computation on GPU

We study the problem of using the GPU to compute the generalized Voronoi diagram (GVD) for higher-order sites, such as line segments and curves. This problem has applications in many fields, including computer animation, pattern recognition and so on. A number of methods have been proposed that use the GPU to speed up the computation […]
Sep, 2

A GPU Accelerated Algorithm for Compressive Sensing Based Image Super-Resolution

This paper presents a parallel algorithm designed for Super-resolution Image Reconstruction based on Compressive sensing in the ATI Stream platform. In the accelerating process, we select part of the serial program as the objects to be sped up according to the execution time of each stage, set appropriate parallel granularity to make full use of […]
Sep, 2

GPU-accelerated time-domain circuit simulation

Time-domain circuit simulation is often dominated by the transistor model evaluation time. An analysis of a test suite of 27 circuits shows 66% of the transient runtime is spent evaluating the core BSIM4 transistor model code. A modern graphics processing unit (GPU) is a highly paralled, high performance computer suitable for non-graphics tasks. Circuit simulation […]
Sep, 2

Accelerating String Matching Using Multi-Threaded Algorithm on GPU

Network Intrusion Detection System has been widely used to protect computer systems from network attacks. Due to the ever-increasing number of attacks and network complexity, traditional software approaches on uni-processors have become inadequate for the current high-speed network. In this paper, we propose a novel parallel algorithm to speedup string matching performed on GPUs. We […]
Sep, 2

GPU-based simulation of side-looking sonar images

This paper describes an implementation of a sonar image simulator optimized for running on a computer’s Graphics Processing Unit (GPU). GPUs are hardware-optimized to obtain maximum performance on computer graphics applications. Because these applications generally simulate focal plane images (i.e. optical systems, video, etc), some specific adaptations are required to render range images such as […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: