Ian Buck
The raw compute performance of today’s graphics processor is truly amazing. With peak performance of over 60 GFLOPS, the compute power of the graphics processor (GPU) dwarfs that of today’s commodity CPU at a price of only a few hundred dollars. As the programmability and performance of modern graphics hardware continues to increase, many researchers […]
View View   Download Download (PDF)   
Baoquan Liu, Li-Yi Wei, Ying-Qing Xu, Enhua Wu
We present an accelerated depth peeling algorithm for order-independent transparency rendering on graphics hardware. Unlike traditional depth peeling which only peels one layer of transparent pixels per rendering pass, our algorithm peels multiple layers simultaneously per rendering pass. Our acceleration is achieved via our fragment program which sorts and writes multiple fragment colors and depths […]
View View   Download Download (PDF)   
Christopher Zach, Mario Sormann, Konrad Karner
We present a high performance reconstruction approach, which generates true 3D models from multiple views with known camera parameters. The complete pipeline from depth map generation over depth image integration to the final 3D model visualization is performed on programmable graphics processing units (GPUs). The proposed pipeline is suitable for long image sequences and uses […]
View View   Download Download (PDF)   
Alan Brunton, Jiying Zhao
In this paper, we propose a real-time video watermarking system on programmable graphics hardware. Real-time video watermarking is important to the use of digital video in legal proceedings, security surveillance, new reportage and commercial video transactions. The watermarking scheme implemented here is based on Wong’s scheme for image watermarking, and is designed to detect and […]
View View   Download Download (PDF)   
J. Montrym, H. Moreton
Graphics processing units (GPUs) continue to take on increasing computational workloads and support interactive rendering that approaches cinematic quality. The architectural drivers for GPUs are programmability, parallelism, bandwidth, and memory characteristics. This article describes how one team approached the design problem.
View View   Download Download (PDF)   
Chi-Wing Fu, Liang Wan, Tien-Tsin Wong, Chi-Sing Leung
Omnidirectional videos are usually mapped to planar domain for encoding with off-the-shelf video compression standards. However, existing work typically neglects the effect of the sphere-to-plane mapping. In this paper, we show that by carefully designing the mapping, we can improve the visual quality, stability and compression efficiency of encoding omnidirectional videos. Here we propose a […]
View View   Download Download (PDF)   
Lei Pan, Lixu Gu, Jianrong Xu
As the fast development of GPU, people tend to use it for more general purposes than its original graphic related work. The high parallel computation capabilities of GPU are welcomed by programmers who work at medical image processing which always have to deal with a large scale of voxel computation. The birth of NVIDIAreg CUDAtrade […]
View View   Download Download (PDF)   
Manuel Ujaldon, Joel Saltz
The paper describes a set of strategies for mapping irregular codes onto commodity graphics hardware. We start identifying the resources that current GPUs contain for solving indirect array accesses entirely on hardware, like vertices, textures and color tables. We then show how multiple indirections can be mapped onto the graphics pipeline, basically taking advantage of […]
View View   Download Download (PDF)   
Ricardo Marroquim, Andre Maximo, Ricardo Farias, Claudio Esperanca
We present a practical approach for implementing the projected tetrahedra (PT) algorithm for interactive volume rendering of unstructured data using programmable graphics cards. Unlike similar works reported earlier, our method employs two fragment shaders, one for computing the tetrahedra projections and another for rendering the elements. We achieve interactive rates by storing the model in […]
Specifications GPU NV40/41/42 Stream Processing Units 5 Core Clock 325 MHz Memory Clock 600/700 MHz Effective Memory Clock 1200/1400 MHz Memory Type DDR Amount of memory 128/256 MB Memory Bandwidth 19.2/22.4 GB/sec Buswidth 256 bit Tech process 130/110 nm Interface PCIe x16/AGP 8X PS/VS version 3.0/3.0 DirectX compliance 9.0c Retail Cards Based On This Board […]
Kalyan S. Perumalla
Graphics cards, traditionally designed as accelerators for computer graphics, have evolved to support more general-purpose computation. General Purpose Graphical Processing Units (GPGPUs) are now being used as highly efficient, cost-effective platforms for executing certain simulation applications. While most of these applications belong to the category of timestepped simulations, little is known about the applicability of […]
View View   Download Download (PDF)   
Julius Ohmer, Frederic Maire, Ross Brown
Kernel methods such as kernel principal component analysis and support vector machines have become powerful tools for pattern recognition and computer vision. Unfortunately the high computational cost of kernel methods is a limiting factor for real-time classification tasks when running on the CPU of a standard PC. Over the last few years, commodity Graphics Processing […]
View View   Download Download (PDF)   
Page 1 of 41234

* * *

* * *

Follow us on Twitter

HGPU group

1666 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

338 people like HGPU on Facebook

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.3
  • SDK: AMD APP SDK 3.0

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: