Posts
Apr, 26
Model-T: Rethinking the OS for terabit speeds
This paper presents Model-T, an OS network stack designed to scale to terabit rates through pipelined execution of micro operations. Model-T parallelizes execution on multicore chips and enforces lockstep processing to maximize shared L2 data cache (d-cache) hitrate. Executing all operations without hitting main memory more than once (if at all) is the key design […]
Apr, 26
Research on ATI-CAL for accelerating FBP reconstruction
Accelerating CT reconstruction algorithms with general purpose GPU has attracted plenty of attention in recent years. Many researchers have studied the techniques of implement CT reconstruction algorithms on different GPUs and different code development environment to explore their capability and performance of acceleration. This work is to investigate the performance of stream computing of filtered […]
Apr, 25
GPU accelerated fast FEM deformation simulation
In this paper we present a general FEM (finite element method) solution that enables fast dynamic deformation simulation on the newly available GPU (graphics processing unit) hardware with compute unified device architecture (CUDA) from NVIDIA. CUDA-enabled GPUs harness the power of 128 processors which allow data parallel computations. Compared to the previous GPGPU, it is […]
Apr, 25
A GPU implementation for two MIMO-OFDM detectors
Two real-valued signal models based on selective spanning with fast enumeration (SSFE) and layered orthogonal lattice detector (LORD) algorithms are implemented on a Nvidia graphics processing unit (GPU). A 2×2 multiple-input multiple-output (MIMO) antenna system with 16-quadrature amplitude modulation (16-QAM) is assumed. The chosen level update vector for SSFE is based on computer simulation results […]
Apr, 25
Parallel 3D Finite Difference Time Domain Simulations on Graphics Processors with Cuda
Parallel Finite Difference Time Domain (FDTD) method has been explored over past few years because of the expensive computation needed for its application. And General Purpose Graphics Processing Units (GPGPU), especially Computer Unit Device Architecture (CUDA) model, has been offered an efficient and simple solution. This paper analyzes parallel FDTD method and CUDA architecture, presents […]
Apr, 25
MultiGPU computing using MPI or OpenMP
The GPU computing follows the trend of GPGPU, driven by the innovations in both hardware and programming languages made available to nongraphic programmers. Since some problems require an important time to solve or data quantities that do not fit on one single GPU, the logical continuation was to make use of multiple GPUs. In order […]
Apr, 25
A real time Breast Microwave Radar imaging reconstruction technique using simt based interpolation
Breast Microwave Radar(BMR) is a novel imaging modality that is capable of producing high contrast images and can detect tumors of at least 4mm. To properly visualize the responses from the breast structures, BMR data sets must be reconstructed. In this paper, a real time BMR image formation technique is proposed. This approach is based […]
Apr, 25
Improving numerical reproducibility and stability in large-scale numerical simulations on GPUs
The advent of general purpose graphics processing units (GPGPU’s) brings about a whole new platform for running numerically intensive applications at high speeds. Their multi-core architectures enable large degrees of parallelism via a massively multi-threaded environment. Molecular dynamics (MD) simulations are particularly well-suited for GPU’s because their computations are easily parallelizable. Significant performance improvements are […]
Apr, 25
Parallelizing Motion JPEG 2000 with CUDA
Due to the rapid growth of graphics processing unit (GPU) processing capability, using GPU as a coprocessor for assisting the CPU in computing massive data has become indispensable. Nvidia’s CUDA general-purpose graphical processing unit (GPGPU) architecture can greatly benefit single instruction multiple thread (SIMT) styled, computationally expensive programs. Video encoding, to an extent, is an […]
Apr, 25
Financial Derivatives Modeling Using GPU’s
The architecture of the latest graphic processing unit (GPU) has surpassed the previous application-specific stream architecture. This has led to an architecture consisting of a number of uniform programmable units integrated on the same chip which facilitate the general-purpose computing beyond the graphic processing. With the multiple programmable units executing in parallel, the latest GPU […]
Apr, 25
GPU-Based Background Illumination Correction for Blue Screen Matting
Separation of foreground objects from an almost constant backing color for video applications is still a common problem ([1]). For non-realtime situations there is a wide variety of different powerful mathematical approaches that can deal with most of the matting problems. For SD/HD studio realtime keyers most solutions are not applicable due to their algorithm […]
Apr, 25
Scalable Software Defined FM-radio receiver running on desktop computers
Software Defined Radios (SDRs) are increasingly attractive to replace common hardware solutions. Current SDRs are mostly part of communication systems using hardware front ends containing DSPs or FPGAs. Processing on CPUs only is not common due to the huge amount of processing resources required. Most current CPUs are not able to handle this. The goal […]