Shadi Alawneh, Dennis Peters
General Purpose computing on Graphics Processor Units (GPGPU) brings massively parallel computing (hundreds of compute cores) to the desktop at a reasonable cost, but requires that algorithms be carefully designed to take advantage of this power. The present work explores the possibilities of CUDA (NVIDIA Compute Unified Device Architecture) using GPGPU approach for 2D Triangulation […]
View View   Download Download (PDF)   
A. Grundhofer, O. Bimber
Recent radiometric compensation techniques make it possible to project images onto colored and textured surfaces. This is realized with projector-camera systems by scanning the projection surface on a per-pixel basis. Using the captured information, a compensation image is calculated that neutralizes geometric distortions and color blending caused by the underlying surface. As a result, the […]
View View   Download Download (PDF)   
Guillem Pratx, Garry Chinn, Frezghi Habte, Peter Olcott, Craig Levin
Advanced list-mode image reconstruction algorithms such as fully 3D list-mode ordered-subset expectation maximization (OSEM) are needed to exploit the potential performance of high-resolution PET systems with depth-of-interaction capabilities. However, such algorithms are computationally intensive. With the aim to accelerate list-mode 3D-OSEM, we investigated the use of graphics processing units (GPUs). Primarily designed to deliver high-definition […]
View View   Download Download (PDF)   
Wallace Lages, Carlucio Cordeiro, Dorgival Guedes
We present an architecture for rendering multiple views efficiently on a cluster of GPUs. The original scene is sampled by virtual cameras which are used later to reconstruct the desired views. We show that this image-based approach can be very scalable and support rendering at interactive rates.
View View   Download Download (PDF)   
Fumihiko Ino, Yuki Kotani, Yuma Munekawa, Kenichi Hagihara
This paper presents a parallel system capable of accelerating biological sequence alignment on the graphics processing unit (GPU) grid. The GPU grid in this paper is a desktop grid system that utilizes idle GPUs and CPUs in the office and home. Our parallel implementation employs a master-worker paradigm to accelerate Liu’s OpenGL-based algorithm that runs […]
View View   Download Download (PDF)   
Sammy Rogmans, Jiangbo Lu, Gauthier Lafruit
This paper presents the system-level overview of a real-time image- based rendering framework performing multiple intermediate view synthesis, completely on the Graphics Processing Unit (GPU). The software design achieves high-performance, yet maintains flexibility and ease of development through a hierarchical layered architecture. The framework implements the intermediate view synthesis by a chain of consecutive processing […]
View View   Download Download (PDF)   
Daniel A. Balciunas, Lucas P. Dulley, Marcelo K. Zuffo
We implemented a pipelined rendering system that pre-renders a reduced set of a scene using the raster method built in the graphics hardware. The computation performed by the graphics card is used as an estimate for evaluating the initial traversal points for a ray caster running on the CPU. This procedure replaces the use of […]
View View   Download Download (PDF)   
Zhiguo Xu, Rajive Bagrodia
High-fidelity simulations of mixed wired and wireless network systems are dependent on detailed simulation models, especially in the lower layers of the network stack. However, detailed modeling can result in prohibitive computation cost. In recent years, commercial graphics cards (GPUs) have drawn attention from the general computing community due to the superior computation capability. In […]
View View   Download Download (PDF)   
Zachary K Baker, Reid Porter
Vector and data-flow processors are particularly strong at dense, regular computation. Sparse, irregular data layouts cause problems because their unpredictable data access patterns prevent computational pipelines from filling effectively. A number of algorithms in image processing have been proposed which are not dense, and instead apply local neighborhood operations to a sparse, irregular set of […]
View View   Download Download (PDF)   
Yang Su, Zhijie Xu
The discrete wavelet transform (DWT) has been extensively used for image compression and denoising in the areas of image processing and computer vision. However, the intensive computation of DWT due to its inherent multilevel data decomposition and reconstruction operations brings a bottleneck that drastically reduces its performance and implementations for real-time applications when facing large […]
View View   Download Download (PDF)   
Thomas M. DuBois, Bryant Lee, Yi Wang, Marc Olano, Uzi Vishkin
The shading processors in graphics hardware are becoming increasingly general-purpose. We test, through simulation and benchmarking, the potential performance impact of replacing these processors with a fully general-purpose parallel processor, without the fixed-function graphics hardware legacy of current graphics processing units (GPUs). The representative general-purpose processor we test against is XMT (for explicit multi-threading), a […]
View View   Download Download (PDF)   
Naveen Kumar Bolla, P. J. Narayanan
The primitives of point-based representations are independent but are rendered using surfels, which approximate the immediate neighborhood of each point linearly. A large number of surfels are needed to convey the exact shape. Higher-order approximations of the local neighborhood have the potential to represent the shape using fewer primitives, simultaneously achieving higher rendering speeds. In […]
View View   Download Download (PDF)   
Page 1 of 3123

* * *

* * *

Follow us on Twitter

HGPU group

1662 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

337 people like HGPU on Facebook

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.3
  • SDK: AMD APP SDK 3.0

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: