4433

Posts

Jun, 15

Pedestrian detection system based on stereo vision for mobile robot

This paper presents a novel Graphics Processing Unit (GPU)-based system for pedestrian detection with stereo vision in real images on mobile robot. The process of obtaining a dense disparity map on a GPU for real-time applications and the edge property of the scene to extract a region of interest (ROI) is designed. After extracting the […]
Jun, 15

The accelerating implementation of BLAST with stream processor

Sequence alignment is one of the most fundamental and important operation in bioinformatics. Through sequence alignment, we can find the sequence’s information of function, structure and evolution. BLAST is one of the most popular algorithms in the field of sequence alignment. In this paper, we have designed a GPU-based parallel BLAST algorithm and implemented it […]
Jun, 15

A compiler for high performance computing with many-core accelerators

We introduce a newly developed compiler for high performance computing using many-core accelerators. A high peak performance of such accelerators attracts researchers who are always demanding faster computers. However, it is difficult to create an efficient implementation of an existing serial program for such accelerators even in the case of massively parallel problems. While existing […]
Jun, 15

Robust modified L2 local optical flow estimation and feature tracking

This paper describes a robust method for the local optical flow estimation and the KLT feature tracking performed on the GPU. Therefore we present an estimator based on the L^2 norm with robust characteristics. In order to increase the robustness at discontinuities we propose a strategy to adapt the used region size. The GPU implementation […]
Jun, 15

Xbox 360 System Architecture

This article covers the Xbox 360’s high-level technical requirements, a short system overview, and details of the CPU and the GPU. The Xbox 360 contains an aggressive hardware architecture and implementation targeted at game console workloads. The core silicon implements the product designers’ goal of providing game developers a hardware platform to implement their next-generation […]
Jun, 15

Keynote address: Immersive exploration of large datasets

Scientists, engineers and physicians are now confronted with a fire hose of data. Immersive visualization environments provide these users with a novel way of interacting and reasoning with large datasets. They allow them to utilize the entirety of their visual bandwidth, effectively engulfing the user in the data and enabling collaborative interaction. We present a […]
Jun, 14

Real-time numerical dispersion compensation using graphics processing unit for Fourier-domain optical coherence tomography

Numerical dispersion compensation for both standard and full-range Fourier-domain optical coherence tomography (FD-OCT) on the graphics processing unit (GPU) architecture has been implemented. The data acquisition, processing and image display were performed on a multi-thread, CPU-GPU heterogeneous computing system. The real-time ultra-high-resolution full-range complex-conjugate-free FD-OCT imaging was demonstrated at 68.4 frame/s with a frame size […]
Jun, 14

FPGA Based High Performance and Scalable Block LU Decomposition Architecture

Decomposition of a matrix into lower and upper triangular matrices (LU decomposition) is a vital part of many scientific and engineering applications, and the block LU decomposition algorithm is an approach well suited to parallel hardware implementation. This paper presents an approach to speed up implementation of the block LU decomposition algorithm using FPGA hardware. […]
Jun, 14

Parallelizing Peptide-Spectrum scoring using modern graphics processing units

Tandem mass spectrometry is a powerful experimental tool used in molecular biology to determine the composition of protein mixtures. In a tandem mass experiment, peptide ion selection algorithms generally select only the most abundant peptide ions for further fragmentation. Because of this, the low-abundance proteins in a sample rarely get identified. A Real-Time Peptide-Spectrum Matching […]
Jun, 14

A Highly Scalable Solution of an NP-Complete Problem Using CUDA

NP Complete problems are one of the most complex problems in computer science but their vast applications in real world always pushes the scientists to explore new ways to solve them. We extended the original problem definition of Boolean Satisfiability Problem to finding all satisfiable solutions of a given problem instance and used massively parallel […]
Jun, 14

In Situ Power Analysis of General Purpose Graphical Processing Units

In this paper, an in situ power analysis profiling over time for general purpose graphics processing units (GPGPU) is presented. Based on this method the power consumption of different modes of operations like data transfer between GPU and host CPU, basic single precision floating point arithmetic operations (addition, subtraction, multiplication) on the multiprocessor units and […]
Jun, 14

Realistic real-time rendering for large-scale forest scenes

Fast rendering of a large-scale forest landscape scene is important in many applications, as video games, Internet graphics applications, landscape or cityscape scene design and visualization, and virtual forestry. A challenge in virtual reality is realistic rendering of large scale scenes consisting of complex plant models. A series of level of detail tree models are […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: