{"id":10462,"date":"2013-09-06T23:28:34","date_gmt":"2013-09-06T20:28:34","guid":{"rendered":"http:\/\/hgpu.org\/?p=10462"},"modified":"2013-09-06T23:28:34","modified_gmt":"2013-09-06T20:28:34","slug":"d5-5-2-architectural-techniques-to-exploit-slack-accuracy-trade-offs","status":"publish","type":"post","link":"https:\/\/hgpu.org\/?p=10462","title":{"rendered":"D5.5.2 &#8211; Architectural Techniques to exploit SLACK &amp; ACCURACY trade-offs"},"content":{"rendered":"<p>In this work we are (a) exploring memory slack for the state-of-the-art many-core CPUs and GPUs, (b) present techniques to eliminate slack, and (c) explore the architectural parameters to improve power e\u000eciency. Dynamic Voltage-Frequency Scaling (DVFS) is one of the most bene\fcial techniques for CPU&#8217;s to improve power e\u000eciency. The end of Dennard scaling however, in which as technology advances the available voltage range shrinks, is threatening the e\u000bectiveness of DVFS. This is very common in GPUs today and will become a severe limitation for many-cores in the near future. In this report we are analysing the impact of core DVFS for di\u000berent memory frequencies into state of the art GPUs. Because of the limitations imposed by either the programming models or the hardware itself we could not apply DVFS on embedded low power GPUs. Therefore we swift our attention to general purpose multi-cores and demonstrate signi\fcant energy bene\fts from our proposed execution scheme. For the GPU evaluation part we are using the NVIDIA-CUDA toolkit and some custom micro-benchmarks. Our analysis shows that DVFS can give signi\fcant energy bene\ft at architectures with restricted memory bandwidth, such as embedded or mobile GPUs (although this is restricted to simulated runs only due to limitations ). Finally our work (a) proposes and evaluates a novel execution scheme for general purpose many-cores, and (b) investigates and intriguing future direction and reveal that energy ine\u000eciencies of GPUs are not related with memory slack but with the mechanisms used to hide slack which seems to compromise applications locality.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In this work we are (a) exploring memory slack for the state-of-the-art many-core CPUs and GPUs, (b) present techniques to eliminate slack, and (c) explore the architectural parameters to improve power e\u000eciency. Dynamic Voltage-Frequency Scaling (DVFS) is one of the most bene\fcial techniques for CPU&#8217;s to improve power e\u000eciency. The end of Dennard scaling however, [&hellip;]<\/p>\n","protected":false},"author":351,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[3],"tags":[451,884,1467],"class_list":["post-10462","post","type-post","status-publish","format-standard","hentry","category-paper","tag-benchmarking","tag-memory","tag-power-efficient-computing"],"views":1943,"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/10462","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/users\/351"}],"replies":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=10462"}],"version-history":[{"count":0,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/10462\/revisions"}],"wp:attachment":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=10462"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=10462"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=10462"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}