{"id":9539,"date":"2013-06-06T23:05:58","date_gmt":"2013-06-06T20:05:58","guid":{"rendered":"http:\/\/hgpu.org\/?p=9539"},"modified":"2013-06-06T23:05:58","modified_gmt":"2013-06-06T20:05:58","slug":"parallel-implementation-of-finite-element-codes-using-cuda","status":"publish","type":"post","link":"https:\/\/hgpu.org\/?p=9539","title":{"rendered":"Parallel Implementation of Finite Element Codes using CUDA"},"content":{"rendered":"<p>The purpose of this work is to study the performance of parallel computation of Finite Element Method using the NVIDIA&#8217;s CUDA. The numerical experiments are performed only on the stiffness matrix using the conjugate gradient method. In addition, the generalized minimal residual method is considered to solve the Stokes problem using both PETSc and CUDA. The speed up related to the data size will be demonstrated. Future improvements of the parallel computations will also be addressed.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The purpose of this work is to study the performance of parallel computation of Finite Element Method using the NVIDIA&#8217;s CUDA. The numerical experiments are performed only on the stiffness matrix using the conjugate gradient method. In addition, the generalized minimal residual method is considered to solve the Stokes problem using both PETSc and CUDA. [&hellip;]<\/p>\n","protected":false},"author":351,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_feature_clip_id":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"jetpack_post_was_ever_published":false},"categories":[11,89,3],"tags":[1782,580,14,1037,212,20,1232,1090],"class_list":["post-9539","post","type-post","status-publish","format-standard","hentry","category-computer-science","category-nvidia-cuda","category-paper","tag-computer-science","tag-conjugate-gradient-solver","tag-cuda","tag-fem","tag-finite-element-method","tag-nvidia","tag-nvidia-geforce-gts-450","tag-nvidia-geforce-gtx-560"],"views":3503,"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/9539","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/users\/351"}],"replies":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=9539"}],"version-history":[{"count":0,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/9539\/revisions"}],"wp:attachment":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=9539"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=9539"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=9539"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}