{"id":29055,"date":"2024-02-04T14:30:57","date_gmt":"2024-02-04T12:30:57","guid":{"rendered":"https:\/\/hgpu.org\/?p=29055"},"modified":"2024-02-04T14:30:57","modified_gmt":"2024-02-04T12:30:57","slug":"high-order-thread-safe-lattice-boltzmann-model-for-hpc-turbulent-flow-simulations","status":"publish","type":"post","link":"https:\/\/hgpu.org\/?p=29055","title":{"rendered":"High-order thread-safe lattice Boltzmann model for HPC turbulent flow simulations"},"content":{"rendered":"<p>We present a highly-optimized thread-safe lattice Boltzmann model in which the non-equilibrium part of the distribution function is locally reconstructed via recursivity of Hermite polynomials. Such a procedure allows the explicit incorporation of non-equilibrium moments of the distribution up to the order supported by the lattice. Thus, the proposed approach increases accuracy and stability at low viscosities without compromising performances and amenability to parallelization with respect to standard lattice Boltzmann models. The high-order thread-safe version, successfully employed here to simulate the flow in a smooth straight channel at Re\u03c4=180 and the axisymmetric turbulent jet at Re=7000, a) achieves peak performances (~5TeraFlop\/s and an arithmetic intensity of ~7FLOP\/byte on single GPU) by significantly reducing the memory footprint, b) retains the algorithmic simplicity of standard lattice Boltzmann computing and c) allows to perform stable simulations at vanishingly low viscosities. Our findings open attractive prospects for high-performance simulations of realistic turbulent flows on GPU-based architectures. Such expectations are confirmed by the excellent agreement among lattice Boltzmann, experimental, and DNS reference data.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>We present a highly-optimized thread-safe lattice Boltzmann model in which the non-equilibrium part of the distribution function is locally reconstructed via recursivity of Hermite polynomials. Such a procedure allows the explicit incorporation of non-equilibrium moments of the distribution up to the order supported by the lattice. Thus, the proposed approach increases accuracy and stability at [&hellip;]<\/p>\n","protected":false},"author":351,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[89,104,3],"tags":[1600,14,1795,1682,108,20,2066,2082,176],"class_list":["post-29055","post","type-post","status-publish","format-standard","hentry","category-nvidia-cuda","category-fluid-dynamics","category-paper","tag-cfd","tag-cuda","tag-fluid-dynamics","tag-hpc","tag-lattice-boltzmann-model","tag-nvidia","tag-nvidia-a100","tag-nvidia-geforce-rtx-3090","tag-package"],"views":1402,"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/29055","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/users\/351"}],"replies":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=29055"}],"version-history":[{"count":0,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/29055\/revisions"}],"wp:attachment":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=29055"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=29055"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=29055"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}