{"id":28876,"date":"2023-12-18T11:10:18","date_gmt":"2023-12-18T09:10:18","guid":{"rendered":"https:\/\/hgpu.org\/?p=28876"},"modified":"2023-12-18T11:10:18","modified_gmt":"2023-12-18T09:10:18","slug":"cusz-i-high-fidelity-error-bounded-lossy-compression-for-scientific-data-on-gpus","status":"publish","type":"post","link":"https:\/\/hgpu.org\/?p=28876","title":{"rendered":"cuSZ-I: High-Fidelity Error-Bounded Lossy Compression for Scientific Data on GPUs"},"content":{"rendered":"<p>Error-bounded lossy compression is a critical technique for significantly reducing scientific data volumes. Compared to CPU-based scientific compressors, GPU-accelerated compressors exhibit substantially higher throughputs, which can thus better adapt to GPU-based scientific simulation applications. However, a critical limitation still lies in all existing GPU-accelerated error-bounded lossy compressors: they suffer from low compression ratios, which strictly restricts their scope of usage. To address this limitation, in this paper, we propose a new design of GPU-accelerated scientific error-bounded lossy compressor, namely cuSZ-I, which has achieved the following contributions: (1) A brand new GPU-customized interpolation-based data pre-diction method is raised in cuSZ-I for extensively improving the compression ratio and the decompression data quality. (2) The Huffman encoding module in cuSZ-I has been improved for both efficiency and stability. (3) cuSZ-I is the first work to integrate the highly effective NVIDIA bitcomp lossless compression module to maximally boost the compression ratio for GPU-accelerated lossy compressors with nearly negligible speed degradation. In experimental evaluations, with the same magnitude of compression throughput as existing GPU-accelerated compressors, in terms of compression ratio and quality, cuSZ-I outperforms other state-of-the-art GPU-based scientific lossy compressors to a significant extent. It gains compression ratio improvements by up to 500% under the same error bound or PSNR. In several real-world use cases, cuSZ-I also achieves the optimized performance, having the minimized time cost for distributed lossy data transmission tasks and the highest decompression data visualization quality.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Error-bounded lossy compression is a critical technique for significantly reducing scientific data volumes. Compared to CPU-based scientific compressors, GPU-accelerated compressors exhibit substantially higher throughputs, which can thus better adapt to GPU-based scientific simulation applications. However, a critical limitation still lies in all existing GPU-accelerated error-bounded lossy compressors: they suffer from low compression ratios, which strictly [&hellip;]<\/p>\n","protected":false},"author":351,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[11,89,3],"tags":[832,1782,14,20,2066,2109],"class_list":["post-28876","post","type-post","status-publish","format-standard","hentry","category-computer-science","category-nvidia-cuda","category-paper","tag-compression","tag-computer-science","tag-cuda","tag-nvidia","tag-nvidia-a100","tag-nvidia-a40"],"views":1318,"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/28876","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/users\/351"}],"replies":[{"embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=28876"}],"version-history":[{"count":0,"href":"https:\/\/hgpu.org\/index.php?rest_route=\/wp\/v2\/posts\/28876\/revisions"}],"wp:attachment":[{"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=28876"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=28876"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hgpu.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=28876"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}