Chetan Rokhade
Many algorithms of practical interest require evaluation of a given function F on each point of a domain consisting of all k-partitions of an N-element set. Because the cardinality of such a domain grows rapidly for fixed k and increasing N, such algorithms are appealing candidates for parallelization; but to implement such parallelization efficiently in […]
View View   Download Download (PDF)   
Alberto Guillen, M. Isabel Garcia Arenas, Mark van Heeswijk, Dusan Sovilj, Amaury Lendasse, Luis Javier Herrera, Hector Pomares, Ignacio Rojas
Feature or variable selection still remains an unsolved problem, due to the infeasible evaluation of all the solution space. Several algorithms based on heuristics have been proposed so far with successful results. However, these algorithms were not designed for considering very large datasets, making their execution impossible, due to the memory and time limitations. This […]
View View   Download Download (PDF)   
Michael Schwall, Friedrich K. Jondral
High data rate waveforms for software defined radios (SDR) have to cope with frequency selective fading due to the mobile use in different harsh transmission environments. The received signal needs to be equalized in order to restore the transmitted information. Turbo equalization is a promising approach to deal with the inter-symbol interference occurring at the […]
View View   Download Download (PDF)   
Murat Uysal, Baha Sen, Canan Celik
Binary Space Partitioning (BSP) Tree can be used for hidden surface removal. In order to hide invisible surfaces, all surfaces are sorted back to front or front to back order. Traversal of BSP Trees for back to front order of faces requires calculation for all BSP Tree nodes, which can be made in parallel manner. […]
View View   Download Download (PDF)   
Simon John Pennycook
The gap between a supercomputer’s theoretical maximum ("peak") floating-point performance and that actually achieved by applications has grown wider over time. Today, a typical scientific application achieves only 5-20% of any given machine’s peak processing capability, and this gap leaves room for significant improvements in execution times. This problem is most pronounced for modern "accelerator" […]
Akhil Jindal, Nikhil Jindal, Divyashikha Sethia
With the introduction of GPGPUs, parallel programming has become simple and affordable. APIs such as NVIDIA’s CUDA have attracted many programmers to port their applications to GPGPUs. But writing CUDA codes still remains a challenging task. Moreover, the vast repositories of legacy serial C codes, which are still in wide use in the industry, are […]
View View   Download Download (PDF)   
Sathya Swaroop Ganta
Numerical solution models to Maxwell’s equations, which describe electromagnetic wave propagation phenomenon with complete clarity, are of atmost importance in pre-fabrication simulation analyses of the photonic and optoelectronic devices. The Finite Difference Time Domain (FDTD) method, which is based on modeling the differential equations as difference equations in a discretized domain in both space and […]
View View   Download Download (PDF)   
Vincent Stanley Dayes
Concern about the threats posed by natural proliferation of animal-borne human diseases like BSE ("mad cow disease") and by the possible use of animals as disease vectors in bioterrorism, have spurred heightened interest in the development of methods for rapid automated identification of individual animals of various societally and commercially important mammalian species. Just as […]
View View   Download Download (PDF)   
Mattias Svanstrom, Simon Hossjer
Today every personal computer and almost every work-related computer has a GPU powerful enough to be used as a supplementary computational device. One framework which enables utilization of this is called OpenCL. We asked the question how one writes efficient algorithms on these GPGPU devices. We found that there are two major ways to run […]
View View   Download Download (PDF)   
Masato Yoshimi, Takuya Kuhara, Kaname Nishimoto, Mitsunori Miki, Tomoyuki Hiroyasu
In this study, we visualize Pareto-optimum solutions derived from multiple-objective optimization using spherical self-organizing maps (SOMs) that lay out SOM data in three dimensions. There have been a wide range of studies involving plane SOMs where Pareto-optimal solutions are mapped to a plane. However, plane SOMs have an issue that similar data differing in a […]
View View   Download Download (PDF)   
E. Arianyan, S. A. Motamedi, M. Hekmatpanah, I. Arianyan
Finite difference time domain (FDTD) is a numerical method for solving differential equations like Maxwell’s equations. Normally, simulation time of these equations is very long and there has been a great effort to reduce it. The most recent and useful way to reduce the simulation time of these equations is through using GPUs. Graphical processing […]
View View   Download Download (PDF)   
Kyoko Nakao, Erika Matsui, Naoko Yoshii, Masami Takata, Kazuki Joe
In this paper, we present the optimization of the particle-based volume rendering for GPU platforms. In general, data transfer between CPU and GPU accompanies long latency. Using page lock memory of the CUDA runtime API, data area is selected so that the data transfer between CPU and GPU becomes faster to reduce the execution time. […]
View View   Download Download (PDF)   
Page 1 of 3123

* * *

* * *

Follow us on Twitter

HGPU group

1655 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

334 people like HGPU on Facebook

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.3
  • SDK: AMD APP SDK 3.0

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: