Moritz Kreutzer, Georg Hager, Gerhard Wellein, Andreas Pieper, Andreas Alvermann, Holger Fehske
The Kernel Polynomial Method (KPM) is a well-established scheme in quantum physics and quantum chemistry to determine the eigenvalue density and spectral properties of large sparse matrices. In this work we demonstrate the high optimization potential and feasibility of peta-scale heterogeneous CPU-GPU implementations of the KPM. At the node level we show that it is […]
View View   Download Download (PDF)   
Arne Vansteenkiste, Jonathan Leliaert, Mykola Dvornik, Felipe Garcia-Sanchez, Bartel Van Waeyenberge
We report on the design, verification and performance of mumax3, an open-source GPU-accelerated micromagnetic simulation program. This software solves the time- and space dependent magnetization evolution in nano- to micro scale magnets using a finite-difference discretization. Its high performance and low memory requirements allow for large-scale simulations to be performed in limited time and on […]
William R. French, Amulya K. Pervaje, Andrew P. Santos, Christopher R. Iacovella, Peter T. Cummings
We perform a large-scale statistical analysis (> 2000 independent simulations) of the elongation and rupture of gold nanowires, probing the validity and scope of the recently proposed ductile-to-brittle transition that occurs with increasing nanowire length [Wu et. al., Nano Lett., 12, 910-914 (2012)]. To facilitate a high-throughput simulation approach, we implement the second-moment approximation to […]
View View   Download Download (PDF)   
Zheyong Fan, Andreas Uppstu, Topi Siro, Ari Harju
We implement, optimize, and validate the linear-scaling Kubo-Greenwood quantum transport simulation on graphics processing units by examining resonant scattering in graphene. We consider two practical representations of the Kubo-Greenwood formula: a Green-Kubo formula based on the velocity auto-correlation and an Einstein formula based on the mean square displacement. The code is fully implemented on graphics […]
View View   Download Download (PDF)   
Alexander V. Yakubovich, Gennady Sushko, Stefan Schramm, Andrey V. Solov'yov
In this paper we have explored computationally the solidification process of large nickel clusters. This process has the characteristic features of the first order phase transition occurring in a finite system. The focus of our research is placed on the elucidation of correlated dynamics of a large ensemble of particles in the course of the […]
View View   Download Download (PDF)   
Lucian Covaci, Francois Peeters
The interplay between quantum Hall states and Cooper pairs is usually hindered by the suppression of the superconducting state due to the strong magnetic fields needed to observe the quantum Hall effect. From this point of view graphene is special since it allows the creation of strong pseudo-magnetic fields due to strain. We show that […]
View View   Download Download (PDF)   
Pierre Wahl, Dany-Sebastien Ly-Gagnon, Christof Debaes, David A.B. Miller, Hugo Thienpont
Numerical calculations with finite-difference time-domain (FDTD) on metallic nanostructures in a broad optical spectrum require an accurate approximation of the permittivity of dispersive materials. Here, we present the algorithms behind B-CALM (Belgium-California Light Machine), an open-source 3D-FDTD solver operating on Graphical Processing Units (GPU’s) with multi-pole dispersion models. Our modified architecture shows a reduction in […]
G. Thalhammer, R. Steiger, M. Meinschad, M. Hill, S. Bernet, M. Ritsch-Marte
Combining several methods for contact free micro-manipulation of small particles such as cells or micro-organisms provides the advantages of each method in a single setup. Optical tweezers, which employ focused laser beams, offer very precise and selective handling of single particles. On the other hand, acoustic trapping with wavelengths of about 1 mm allows the […]
View View   Download Download (PDF)   
Vincent Favre-Nicolin, Johann Coraux, Marie-Ingrid Richard, Hubert Renevier
Scattering maps from strained or disordered nanostructures around a Bragg reflection can be either computed quickly using approximations and a (fast) Fourier transform or obtained using individual atomic positions. In this article, it is shown that it is possible to compute up to 4*10^10 reflections*atoms*s^-1 using a single graphics card, and the manner in which […]
S. Kiess, M.Z. Shaikh, M. Gregoire, T. Bringewat, S. Simon, A. Tausendfreund, M. Zimmermann, G. Goch
Optical measurements based on laser light scattering by nanostructures provide fast and contactless measurement of the surface of nanostructures for defects. In this paper, a novel in-process measurement method based on coherent laser light scattering by sub-100-nm structures is presented. It is shown that nanostructure defects can be identified by their unique scattering pattern. This […]
Rohit Pathak, Satyadhar Joshi
The essence of high performance computing (HPC) in the field of computation nanotechnology and problems encountered by HPC arrangement in applying HPC to nano-enabled calculations have been presented in the paper. A proposal to optimize computations in an HPC setup has been formulated to make nanotechnology computations more effective and realistic on a CUDA based […]
View View   Download Download (PDF)   
S. Ihnatsenka
The performance potential for simulating quantum electron transport on graphical processing units (GPUs) is studied. Using graphene ribbons of realistic sizes as an example it is shown that GPUs provide significant speed-ups in comparison to central processing units as the transverse dimension of the ribbon grows. The recursive Green’s function algorithm is employed and implementation […]
View View   Download Download (PDF)   
Page 1 of 212

* * *

* * *

Follow us on Twitter

HGPU group

1665 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

339 people like HGPU on Facebook

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.3
  • SDK: AMD APP SDK 3.0

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: