{"id":1042,"date":"2010-10-30T05:24:00","date_gmt":"2010-10-30T05:24:00","guid":{"rendered":"http:\/\/hgpu.org\/?p=1042"},"modified":"2010-10-30T05:24:00","modified_gmt":"2010-10-30T05:24:00","slug":"high-performance-direct-gravitational-n-body-simulations-on-graphics-processing-units-ii-an-implementation-in-cuda","status":"publish","type":"post","link":"https:\/\/hgpu.org\/?p=1042","title":{"rendered":"High performance direct gravitational N-body simulations on graphics processing units II: An implementation in CUDA"},"content":{"rendered":"<p>We present the results of gravitational direct N-body simulations using the graphics processing unit (GPU) on a commercial NVIDIA GeForce 8800GTX designed for gaming computers. The force evaluation of the N -body problem is implemented in \u201cCompute Unified Device Architecture\u201d (CUDA) using the GPU to speedup the calculations. We tested the implementation on three different N -body codes: two direct N -body integration codes, using the 4th order predictor\u2013corrector Hermite integrator with block time-steps, and one Barnes-Hut treecode, which uses a 2nd order leapfrog integration scheme. The integration of the equations of motions for all codes is performed on the host CPU. We find that for N \u00a0&#62;\u00a0512 particles the GPU outperforms the GRAPE-6Af, if some softening in the force calculation is accepted. Without softening and for very small integration time-steps the GRAPE still outperforms the GPU. We conclude that modern GPUs offer an attractive alternative to GRAPE-6Af special purpose hardware. Using the same time-step criterion, the total energy of the N -body system was conserved better than to one in 10 6 on the GPU, only about an order of magnitude worse than obtained with GRAPE-6Af. For N \u00a0 \u00a010 5 the 8800GTX outperforms the host CPU by a factor of about 100 and runs at about the same speed as the GRAPE-6Af.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>We present the results of gravitational direct N-body simulations using the graphics processing unit (GPU) on a commercial NVIDIA GeForce 8800GTX designed for gaming computers. The force evaluation of the N -body problem is implemented in \u201cCompute Unified Device Architecture\u201d (CUDA) using the GPU to speedup the calculations. We tested the implementation on three different [&hellip;]<\/p>\n","protected":false},"author":351,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[96,89,3],"tags":[1794,14,256,258,20,183,257],"class_list":["post-1042","post","type-post","status-publish","format-standard","hentry","category-astrophysics","category-nvidia-cuda","category-paper","tag-astrophysics","tag-cuda","tag-gravitation","tag-n-body-simulation","tag-nvidia","tag-nvidia-geforce-8800-gtx","tag-stellar-dynamics"],"views":2453,"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/1042","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/users\/351"}],"replies":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=1042"}],"version-history":[{"count":0,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/1042\/revisions"}],"wp:attachment":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=1042"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=1042"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=1042"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}