{"id":6452,"date":"2011-12-01T16:50:44","date_gmt":"2011-12-01T14:50:44","guid":{"rendered":"http:\/\/hgpu.org\/?p=6452"},"modified":"2011-12-01T16:50:44","modified_gmt":"2011-12-01T14:50:44","slug":"gpu-acceleration-of-solving-parabolic-partial-differential-equations-using-difference-equations","status":"publish","type":"post","link":"https:\/\/hgpu.org\/?p=6452","title":{"rendered":"GPU Acceleration of Solving Parabolic Partial Differential Equations Using Difference Equations"},"content":{"rendered":"<p>Parabolic partial differential equations are often used to model systems involving heat transfer, acoustics, and electrostatics. The need for more complex models with increasing precision drives greater computational demands from processors. Since solving these types of equations is inherently parallel, GPU computing offers an attractive solution for  drastically decreasing time to completion, power usage, and increasing the computation per dollar. However, since GPU computing involves a much different programming paradigm  than traditional processors, techniques for optimizing solvers must still be developed. This paper presents several optimization strategies for accelerating solvers using CUDA to implement difference equations  and compares their performances to a standard processor. The results demonstrate that different strategies should be used for different GPU cards, such as the C1060 and GTX 480, resulting in up to 197 times and  257 times single-precision and up to 133 and 163 times doubleprecision speedups respectively.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Parabolic partial differential equations are often used to model systems involving heat transfer, acoustics, and electrostatics. The need for more complex models with increasing precision drives greater computational demands from processors. Since solving these types of equations is inherently parallel, GPU computing offers an attractive solution for drastically decreasing time to completion, power usage, and [&hellip;]<\/p>\n","protected":false},"author":351,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[11,89,3],"tags":[1782,14,810,20,379,298,550,551,199],"class_list":["post-6452","post","type-post","status-publish","format-standard","hentry","category-computer-science","category-nvidia-cuda","category-paper","tag-computer-science","tag-cuda","tag-differential-equations","tag-nvidia","tag-nvidia-geforce-gtx-480","tag-optimization","tag-partial-differential-equations","tag-pdes","tag-tesla-c1060"],"views":2045,"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/6452","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/users\/351"}],"replies":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=6452"}],"version-history":[{"count":0,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/6452\/revisions"}],"wp:attachment":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=6452"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=6452"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=6452"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}