{"id":10164,"date":"2013-07-31T15:26:34","date_gmt":"2013-07-31T12:26:34","guid":{"rendered":"http:\/\/hgpu.org\/?p=10164"},"modified":"2013-07-31T15:26:34","modified_gmt":"2013-07-31T12:26:34","slug":"graphics-processing-unit-acceleration-of-the-random-phase-approximation-in-the-projector-augmented-wave-method","status":"publish","type":"post","link":"https:\/\/hgpu.org\/?p=10164","title":{"rendered":"Graphics Processing Unit acceleration of the Random Phase Approximation in the projector augmented wave method"},"content":{"rendered":"<p>The Random Phase Approximation (RPA) for correlation energy in the grid-based projector augmented wave (gpaw) code is accelerated by porting to the Graphics Processing Unit (GPU) architecture. The acceleration is achieved by grouping independent vectors\/matrices and transforming the implementation from being memory bound to being computation\/latency bound. With this approach, both the CPU and GPU implementations have been enhanced. We tested the GPU implementation on a few representative systems: molecules (O2), bulk solids (Li2O and MoO3) and molecules adsorbed on metal surfaces (N2\/Ru(0001) and CO\/Ni(111)). Improvements from 10+ to 40+ have been achieved (8-GPUs versus 8-CPUs). A realistic RPA calculation for CO\/Ni(111) surface can be finished in 5.5 h using 8 GPUs. It is thus promising to employ non-self-consistent RPA for routine surface chemistry simulations.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The Random Phase Approximation (RPA) for correlation energy in the grid-based projector augmented wave (gpaw) code is accelerated by porting to the Graphics Processing Unit (GPU) architecture. The acceleration is achieved by grouping independent vectors\/matrices and transforming the implementation from being memory bound to being computation\/latency bound. With this approach, both the CPU and GPU [&hellip;]<\/p>\n","protected":false},"author":351,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[89,3,12],"tags":[98,196,14,166,20,176,1783,1226],"class_list":["post-10164","post","type-post","status-publish","format-standard","hentry","category-nvidia-cuda","category-paper","category-physics","tag-computational-physics","tag-condensed-matter","tag-cuda","tag-materials-science","tag-nvidia","tag-package","tag-physics","tag-tesla-c2075"],"views":2346,"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/10164","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/users\/351"}],"replies":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=10164"}],"version-history":[{"count":0,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/10164\/revisions"}],"wp:attachment":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=10164"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=10164"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=10164"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}