Grzegorz Michalski, Norbert Sczygiol, Siergiei Leonov
This paper presents a simulation of the casting solidification process performed on graphics processors compatible with nVidia CUDA architecture. Indispensable for the parallel implementation of a computer simulation of the solidification process, it was necessary to modify the numerical model. The new approach shown in this paper allows the process of matrix building to be […]
View View   Download Download (PDF)   
Indira Munagani
A fingerprint matching algorithm with a novel set of matching parameters based on core points and triangular descriptors is proposed to discover rarity in fingerprints. The algorithm uses a mathematical and statistical approach to discover rare features in fingerprints which provides scientific validation for both ten-print and latent fingerprint evidence. A feature is considered rare […]
View View   Download Download (PDF)   
Chau-Yi Chou, Sheng-Hsiu Kuo, Chih-Wei Hsieh, Tsung-Che Tsai, Hsi-Ya Chang
Multi-core platforms become ubiquitous nowadays. Even laptops contain multi-core processors now. There are multiple cores in a chip or socket or die. A computing node contains multiple chips. Multi-core platforms are rapidly increasing and the number of cores on these platforms is increasing rapidly too. How to enjoy the benefits of parallel computing on the […]
View View   Download Download (PDF)   
Xinyan Zha, Sartaj Sahni
We develop GPU adaptations of the Aho-Corasick string matching algorithm for the the case when all data reside initially in the GPU memory and the results are to be left in this memory. We consider several refinements to a base GPU implementation and measure the performance gain from each refinement. Experiments conducted on an NVIDIA […]
View View   Download Download (PDF)   
S. Gorbunov, D. Rohr, K. Aamodt, T. Alt, H. Appelshauser, A. Arend, M. Bach, B. Becker, S. Bottger, T. Breitner, H. Busching, S. Chattopadhyay, J. Cleymans, I. Das,O. Djuvsland, H. Erdal, R. Fearick, O. S. Haaland, P. T. Hille, S. Kalcher, K. Kanaki, U. Kebschull, I. Kisel, M. Kretz, C. Lara, S. Lindal, V. Lindenstruth, A. A. Masoodi, G. Ovrebekk, R. Panse, J. Peschek, M. Ploskon, T. Pocheptsov, T. Rascanu, M. Richter, D. Rohrich, B. Skaali, T. Steinbeck, A. Szostak, J. Thader, T. Tveter, K. Ullaland, Z. Vilakazi, R. Weis, P. Zelnicek
The on-line event reconstruction in ALICE is performed by the High Level Trigger, which should process up to 2000 events per second in proton-proton collisions and up to 300 central events per second in heavy-ion collisions, corresponding to an input data stream of 30 GB/s. In order to fulfill the time requirements, a fast on-line […]
View View   Download Download (PDF)   
David B. Thomas, Wayne Luk
Artificial neural networks are a key tool for researchers attempting to understand and replicate the behaviour and intelligence found in biological neural networks. Software simulations offer great flexibility and the ability to select which aspects of biological networks to model, but are slow when operating on more complex biologically plausible models; while dedicated hardware solutions […]
View View   Download Download (PDF)   
Chih-Wei Hsieh, Sheng-Hsiu Kuo, Fang-An Kuo, Chau-Yi Chou
Multi-core platform enters the territory of high performance computing (HPC). Moreover, the NVIDA GT200 has 240 cores and performs thousands upon thousands of threads simultaneously. The role of the Graphics Processing Units (GPU)accelerator has become more and more important for scientific computing and computational fluid dynamic (CFD) to obtain result quickly and efficiently. In this […]
Du Liuge, Li Kang, Kong Fanmin
Parallel Finite Difference Time Domain (FDTD) method has been explored over past few years because of the expensive computation needed for its application. And General Purpose Graphics Processing Units (GPGPU), especially Computer Unit Device Architecture (CUDA) model, has been offered an efficient and simple solution. This paper analyzes parallel FDTD method and CUDA architecture, presents […]
Christian Obrecht, Frederic Kuznik, Bernard Tourancheau, Jean-Jacques Roux
Emerging many-core processors, like CUDA capable nVidia GPUs, are promising platforms for regular parallel algorithms such as the Lattice Boltzmann Method (LBM). Since the global memory for graphic devices shows high latency and LBM is data intensive, the memory access pattern is an important issue for achieving good performances. Whenever possible, global memory loads and […]
Tobias Brandvik, Graham Pullan
A new three-dimensional Navier-Stokes solver for flows in turbomachines has been developed. The new solver is based on the latest version of the Denton codes but has been implemented to run on graphics processing units (GPUs) instead of the traditional central processing unit. The change in processor enables an order-of-magnitude reduction in run-time due to […]
View View   Download Download (PDF)   
Ji Xu, Ying Ren, Wei Ge, Xiang Yu, Xiaozhen Yang, Jinghai Li
Molecular dynamics (MD) simulation is a powerful computational tool to studythe behavior of macromolecular systems. But many simulations of this field arelimited in spatial or temporal scale by the available computational resource.In recent years, graphics processing unit (GPU) provides unprecedentedcomputational power for scientific applications. Many MD algorithms suit withthe multithread nature of GPU. In this […]
View View   Download Download (PDF)   

* * *

* * *

Follow us on Twitter

HGPU group

1665 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

339 people like HGPU on Facebook

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.3
  • SDK: AMD APP SDK 3.0

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: