3914

Posts

May, 6

Real-time 3D registration of stereo-vision based range images using GPU

3D registration is a computer vision technique of aligning multi-view range images with respect to a reference co-ordinate system. Aligning range images is an important and time-complex step in complete 3D reconstruction. In this paper, we propose a real-time 3D registration technique by employing the accelerated computing power of GPU (graphic processing unit). In the […]
May, 6

Performance Evaluation of Optimized Implementations of Finite Difference Method for Wave Propagation Problems on GPU Architecture

The scattering of acoustic waves in non-homogeneous media has been of practical interest for the petroleum industry, mainly in the determination of new oil deposits. A family of computational models that represent this phenomenon is based on finite difference methods. The simulation of these phenomena demands a high computational cost. In this work we employ […]
May, 6

Efficient nearest-neighbor computation for GPU-based motion planning

We present a novel k-nearest neighbor search algorithm (KNNS) for proximity computation in motion planning algorithm that exploits the computational capabilities of many-core GPUs. Our approach uses locality sensitive hashing and cuckoo hashing to construct an efficient KNNS algorithm that has linear space and time complexity and exploits the multiple cores and data parallelism effectively. […]
May, 6

Robust Adaptive 3-D Segmentation of Vessel Laminae From Fluorescence Confocal Microscope Images and Parallel GPU Implementation

This paper presents robust 3-D algorithms to segment vasculature that is imaged by labeling laminae, rather than the lumenal volume. The signal is weak, sparse, noisy, nonuniform, low-contrast, and exhibits gaps and spectral artifacts, so adaptive thresholding and Hessian filtering based methods are not effective. The structure deviates from a tubular geometry, so tracing algorithms […]
May, 6

Mechanical Characterization and Performance Optimization for GPU Fan-Sink Cooling Module Assembly

Three GPU fan-sink cooling module assembly mounting mechanisms are mechanically characterized to determine the relationships between the clamping forces and screw torques. The first-order screw torque solutions are determined from the statistical regressions according to current industry recommendations. The screw tension force theoretical solution is derived for application to the finite-element model to assess the […]
May, 6

Large-scale multi-dimensional document clustering on GPU clusters

Document clustering plays an important role in data mining systems. Recently, a flocking-based document clustering algorithm has been proposed to solve the problem through simulation resembling the flocking behavior of birds in nature. This method is superior to other clustering algorithms, including k-means, in the sense that the outcome is not sensitive to the initial […]
May, 5

Towards accelerating molecular modeling via multi-scale approximation on a GPU

Research efforts to analyze biomolecular properties contribute towards our understanding of biomolecular function. Calculating non-bonded forces (or in our case, electrostatic surface potential) is often a large portion of the computational complexity in analyzing biomolecular properties. Therefore, reducing the computational complexity of these force calculations, either by improving the computational algorithm or by improving the […]
May, 5

Towards real time vision based UUV navigation using GPU technology

The last decade has witnessed the establishment of image processing as a viable means of aiding underwater navigation. However, many such systems are only implemented in pre-processing and offline due to their excessive computational demands. Real-time techniques often require special purpose hardware or impose limitations on the system to obtain real-time performance at the expense […]
May, 5

The implementation of Multi-Scale Retinex image enhancement algorithm based on GPU via CUDA

The MSR (Multi-Scale Retinex) image enhancement algorithm can produce best performance in most cases, but the computation load is very huge especially for large image. In this paper, an efficient approach is proposed to accelerate MSR image enhancement speed on GPU via CUDA (Compute Unified Device Architecture). Time consuming modules such as multi-scale Gaussian filter, […]
May, 5

K-nearest neighbor search: Fast GPU-based implementations and application to high-dimensional feature matching

The k-nearest neighbor (kNN) search problem is widely used in domains and applications such as classification, statistics, and biology. In this paper, we propose two fast GPU-based implementations of the brute-force kNN search algorithm using the CUDA and CUBLAS APIs. We show that our CUDA and CUBLAS implementations are up to, respectively, 64X and 189X […]
May, 5

Optimization and parameter exploration using GPU based FDTD solvers

Graphical processing units (GPU) has been documented for the implementation of the FDTD technique. The use of these specialized processors for the implementation of numerical codes has been shown to significantly speed up the execution of these codes over standard CPU based solvers. With the execution of the FDTD method being reduced to a matter […]
May, 5

Programming Challenges for the Implementation of Numerical Quadrature in Atomic Physics on FPGA and GPU Accelerators

Although the need for heterogeneous chips in high performance numerical computing was identified by Chillemi and co-authors in 2001 it is only over the past five years that it has emerged as the new frontier for HPC. In this environment one or more accelerators works symbiotically, on each node, with a multi-core CPU. Two such […]
Page 574 of 782« First...102030...572573574575576...580590600...Last »

* * *

* * *

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.2
  • SDK: AMD APP SDK 2.9

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: