{"id":13021,"date":"2014-11-03T22:36:59","date_gmt":"2014-11-03T20:36:59","guid":{"rendered":"http:\/\/hgpu.org\/?p=13021"},"modified":"2014-11-03T22:36:59","modified_gmt":"2014-11-03T20:36:59","slug":"profiling-of-data-parallel-processors","status":"publish","type":"post","link":"https:\/\/hgpu.org\/?p=13021","title":{"rendered":"Profiling of Data-Parallel Processors"},"content":{"rendered":"<p>Profiling data can help to improve an application with respect to various objectives like execution time, energy consumption or even thermal sensor placement for an upcoming device. This survey reviews state-of-the-art profiling tools for dataparallel processors like Nsight, PAPI and TAU as well as Lynx. Additionally, the attained knowledge is utilized to detect the bottleneck of a reduction kernel for a CUDA-enabled device.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Profiling data can help to improve an application with respect to various objectives like execution time, energy consumption or even thermal sensor placement for an upcoming device. This survey reviews state-of-the-art profiling tools for dataparallel processors like Nsight, PAPI and TAU as well as Lynx. Additionally, the attained knowledge is utilized to detect the bottleneck [&hellip;]<\/p>\n","protected":false},"author":351,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[11,89,3],"tags":[1782,14,20,67,1390],"class_list":["post-13021","post","type-post","status-publish","format-standard","hentry","category-computer-science","category-nvidia-cuda","category-paper","tag-computer-science","tag-cuda","tag-nvidia","tag-performance","tag-tesla-k20"],"views":2105,"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/13021","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/users\/351"}],"replies":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=13021"}],"version-history":[{"count":0,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/13021\/revisions"}],"wp:attachment":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=13021"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=13021"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=13021"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}