7668

Time-dependent density-functional theory in massively parallel computer architectures: the OCTOPUS project

Xavier Andrade, Joseba Alberdi-Rodriguez, David A. Strubbe, Micael J. T. Oliveira, Fernando Nogueira, Alberto Castro, Javier Muguerza, Agustin Arruabarrena, Steven G. Louie, Alan Aspuru-Guzik, Angel Rubio, Miguel A. L. Marques
Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA 02138, USA
Journal of Physics: Condensed Matter, Volume 24, Number 23, 233202, 2012

@article{andrade2012time,

   title={Time-dependent density-functional theory in massively parallel computer architectures: the octopus project},

   author={Andrade, X. and Alberdi-Rodriguez, J. and Strubbe, D.A. and Oliveira, M.J.T. and Nogueira, F. and Castro, A. and Muguerza, J. and Arruabarrena, A. and Louie, S.G. and Aspuru-Guzik, A. and others},

   journal={Journal of Physics: Condensed Matter},

   volume={24},

   number={23},

   pages={233202},

   year={2012},

   publisher={IOP Publishing}

}

Download Download (PDF)   View View   Source Source   Source codes Source codes

Package:

2087

views

Octopus is a general-purpose density-functional theory (DFT) code, with a particular emphasis on the time-dependent version of DFT (TDDFT). In this paper we present the ongoing efforts to achieve the parallelization of octopus. We focus on the real-time variant of TDDFT, where the time-dependent Kohn-Sham equations are directly propagated in time. This approach has great potential for execution in massively parallel systems such as modern supercomputers with thousands of processors and graphics processing units (GPUs). For harvesting the potential of conventional supercomputers, the main strategy is a multi-level parallelization scheme that combines the inherent scalability of real-time TDDFT with a real-space grid domain-partitioning approach. A scalable Poisson solver is critical for the efficiency of this scheme. For GPUs, we show how using blocks of Kohn-Sham states provides the required level of data parallelism and that this strategy is also applicable for code optimization on standard processors. Our results show that real-time TDDFT, as implemented in octopus, can be the method of choice for studying the excited states of large molecular systems in modern parallel architectures.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: