Vineeth Thamarassery Mekkat
We are witnessing a tremendous amount of change in the design of the modern microprocessor. With dozens of CPU cores on-chip recent multicore processors, the search for thread-level parallelism (TLP) is more significant than ever. In parallel, a very different processor architecture has emerged that aims to extract parallelism at an entirely different scale. Originally […]
View View   Download Download (PDF)   
Revanth N R, P. J. Narayanan
Graphics models are getting increasingly bulkier with detailed geometry, textures, normal maps, etc. There is a lot of interest to model and navigate through detailed models of large monuments. Many monuments of interest have both rich detail and large spatial extent. Rendering them for navigation on a single workstation is practically impossible, even given the […]
View View   Download Download (PDF)   
Daniel Cederman
The convergence of highly parallel many-core graphics processors with conventional multi-core processors is becoming a reality. To allow algorithms and data structures to scale efficiently on these new platforms, several important factors needs to be considered. (i) The algorithmic design needs to utilize the inherent parallelism of the problem at hand. Sorting, which is one […]
View View   Download Download (PDF)   
Kan Chen, Henry Johan
Simulating grass field in real-time has many applications, such as in virtual reality and games. Modeling accurate grass-grass, grass-object and grass-wind interactions requires a high computational cost. In this paper, we present a method to simulate grass field in real-time by considering grass field as a two dimensional grid-based continuum and shifting the complex interactions […]
View View   Download Download (PDF)   
Seung-Hun Yoo, Sung-Up Jo, Ki-Young Choi, Chang-Sung Jeong
This paper presents a framework for pixel-level multisensor image fusion algorithm using graphics hardware. When it comes to the fusion technology of recent times, not only its visibility through information maximization which is brought about by the progress in sensor technology but also the importance of fusion processing speed has increased. The GPU provides high-performance […]
View View   Download Download (PDF)   
Carlos Aguilar Melchor, Benoit Crespin, Philippe Gaborit, Vincent Jolivet, Pierre Rousseau
A Private Information Retrieval (PIR) scheme is a protocol in which a user retrieves a record out of n from a replicated database, while hiding from the database which record has been retrieved, as long as the different replicas do not collude. A specially interesting sub-field of research, called single-database PIR, deals with the schemes […]
Specifications GPU G84 FLOPS 139 GFLOPS Stream Processing Units 32 Core Clock 675 MHz Memory Clock 1450 MHz Effective Memory Clock 2900 MHz Memory Type GDDR3 Amount of memory 256/512 MB Memory Bandwidth 32 GB/sec Buswidth 128 bit Tech process 80 nm Interface PCIe 1.0 x16 PS/VS version 4.1/4.1 DirectX compliance 10 Retail Cards Based […]
Tony Cheneau, Aymen Boudguiga, Maryline Laurent
Cryptographically Generated Addresses (CGA) are today mainly used with the Secure Neighbor Discovery Protocol (SEND). Despite CGA generalization, current standards only show how to construct CGA with the RSA algorithm and SHA-1 hash function. This limitation may prevent new usages of CGA and SEND in mobile environments where nodes are energy and storage limited. In […]
View View   Download Download (PDF)   
Raul Cabido, David Concha, Juan Jose Pantrigo, Antonio S. Montemayor
This paper presents a novel application of the GPU processing power to a very computationally demanding articulated human body tracking problem in a view-based approach. This work includes some optimizations at the algorithmic level as well as some tricks at the implementation level using OpenGL and shader programming. An underlying particle filter framework is combined […]
View View   Download Download (PDF)   
James McCann, Nancy S. Pollard
We present an image editing program which allows artists to paint in the gradient domain with real-time feedback on megapixel-sized images. Along with a pedestrian, though powerful, gradient-painting brush and gradient-clone tool, we introduce an edge brush designed for edge selection and replay. These brushes, coupled with special blending modes, allow users to accomplish global […]
View View   Download Download (PDF)   
Victor Jimenez, Lluis Vilanova, Isaac Gelado, Marisa Gil, Grigori Fursin, Nacho Navarro
Heterogeneous architectures are currently widespread. With the advent of easy-to-program general purpose GPUs, virtually every recent desktop computer is a heterogeneous system. Combining the CPU and the GPU brings great amounts of processing power. However, such architectures are often used in a restricted way for domain-specific applications like scientific applications and games, and they tend […]
View View   Download Download (PDF)   
Daniel Cederman, Philippas Tsigas
In this paper we present GPU-Quicksort, an efficient Quicksort algorithm suitable for highly parallel multi-core graphics processors. Quicksort has previously been considered as an inefficient sorting solution for graphics processors, but we show that GPU-Quicksort often performs better than the fastest known sorting implementations for graphics processors, such as radix and bitonic sort. Quicksort can […]
View View   Download Download (PDF)   
Page 1 of 212

* * *

* * *

Follow us on Twitter

HGPU group

1666 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

338 people like HGPU on Facebook

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.3
  • SDK: AMD APP SDK 3.0

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: