Home

KFUPM ePrints

In this section

MODIFIED SZ LOSSY COMPRESSION ON-FPGA USING OPENCL

MODIFIED SZ LOSSY COMPRESSION ON-FPGA USING OPENCL. Masters thesis, King Fahd University of Petroleum and Minerals.

PDF (Master Thesis)
ThesisFinalversion.pdf - Submitted Version
Restricted to Repository staff only until 3 January 2025.
Download (7MB)

Arabic Abstract

التطبيقات الحالية للحوسبة عالية الأداء تنتج بشكل متزايد بيانات كبيرة للغاية. بالنسبة لبعض التطبيقات ، يمكن أن ينتج ما يصل إلى بيتا بايت في فترة زمنية قصيرة. يعتبر هذا الكم الهائل من البيانات عبئ ً ا ثقيلا. وبالتالي ، أصبح نقل البيانات وتخزينها عقبة أمام مثل هذه الأنظمة. في ظل هذا الظرف ، وخاصة مع معدل التحسين البطيء في كل من أ ٕ نظمة التخزين ونطاق الا ٕ دخال/الاخراج ، يصبح ضغط البيانات حلاً ً جذابا. تم اعتماد الضغط المنقصوص -والذي يتم التحكم في نسبة فقدان البيانات عن طريق تحديد نسبة مقبولة للخطأ- على نطاق واسع وهذا بالتأكيد بسبب نسبة الضغط العالية التي يمكن أن تقدمها مقارنة بالضواغط التقليدية غير المنقوصة إلى جانب قدرتها على تقييد الخطأ الذي تم إدخاله استناداً على نطاق محدد مسبقا. تعاني معظم الضواغط المنقوصة -المحددة بنسبة خطأ معين- من إنتاجية منخفضة جدا في كل من الضغط وإلغاء الضغط. ويرجع ذلك أساسا إلى الطبيعة المتسلسلة لهذه الخوارزميات ، والتي تؤدي إلى تقييد استخدامها في أنظمة التطبيقات عالية الحوسبة نظرا لارتفاع معدل توليد البيانات لهذه الأجهزة عالية الأداء. أدى الحجم الهائل للبيانات التي يتم إنتاجها في وقت قصير إلى الحاجة إلى استخدام هذه الضواغط على المسرعات للحصول على أداء أ ٕ فضل من حيث الانتاجية ووقت الضغط. SZهو عبارة عن ضاغط منقصوص حيث يمكن التحكم في نسبة فقدان البيانات عن طريق تحديد نسبة مقبولة للخطأ حقق هذا الضاغط نسب ضغط عالية مقارنة بالضواغط الأخرى. إنها بالأساس خوارزمية قائمة على التنبؤ حيث تستغل ارتباط البيانات فيما بينها ودائماً ما نجد هذا الارتباط في البيانات العلمية. في هذا العمل ، نقترح تنفيذ SZعلى المسرعات ونقدم بعض التعديلات لتحقيق تنفيذ أكثر كفاءة على هذه الأجهزة. أولاً ، نقدم مخطط ترميز ثابت جديد يهدف إ ٕ لى تقليل تعقيد تنفيذ الترميز المستخدم اصلا بالخوارزمية . بالاضافة إلى ذلك ، تم اقتراح طريقة جديدة تمتاز بالبساطة والفعالية لضغط النقاط التي لا يمكن التنبؤ بها بواسطة الخوارزمية، ويأتي هذا التعديل بدلاً من تمثيل التحليل الثنائي الذي لا يمكن تنفيذه على المسرعات. لقد قمنا بتنفيذ التعديلات المقترحة في البرنامج وأثبتنا أنها تعطي نتائج مماثلة للخوارزمية الأصلية. تم تنفيذ النسخة المقترحة من الضاغط على المسرع FPGAباستخدام OpenCLواثبتت التجارب أن نسختنا تفوقت في الانتاجية على النسخة الأصلية للخوارزمية والتي تعمل على الحاسب الالي بمعدل يقارب الثمانية مرات وتقوقت على النسخ التي تستهدف الأجهزة بنسب متفاوتة أيضا

English Abstract

Current applications of high-performance computing (HPC) are increasingly producing extremely large data. For some applications, it can produce up to Peta bytes in a short period of time. This extensive amount of data is considered a heavy burden for exascale systems. Consequently, data movement and storage have become an obstacle for such systems. Under this circumstance, especially with the slow rate of improvement in both storage systems and I/O bandwidth, data compression becomes an appealing solution. Error-controlled lossy compressors have been adopted widely and this is surely because of the high compression ratio that they could offer compared with the traditional lossless compressors besides their ability to restrict the introduced error to a certain predefined range. Most lossy compressors suffer from very low throughput in both compression and decompression. This is mainly due to the sequential nature of these algorithms, which yield to restricting their use in HPC systems because of the high rate of data generation for these high-performant machines. The huge volume of data generated in a short time led to the need of implementing these compressors in hardware to gain a significant improvement in terms of throughput and compression time. SZ [2] is a state-of-the-art error-bounded compressor that achieved high compression ratios compared to other compressors. It is a prediction-based algorithm that exploits data correlation in scientific data sets. In this work, we propose implementing SZ in hardware and we introduce some modifications to achieve a more effcient hardware implementation. Firstly, we introduce a new fixed coding scheme that aims to reduce the complexity of Huffman coding implementation. In addition, a new simpler but also effcient way to compress unpredicted points is proposed instead of the binary analysis representation which makes the algorithm more suitable for Hardware implementation. We have implemented the proposed modifications in software and demonstrated that they give comparable results in comparison to the original SZ algorithm. Our proposed version of SZ was implemented in FPGA using OpenCL. Experiments show that our hardware implementation can improve SZ’s compression throughput by 8.5× over the production version running on a CPU and by 1.7× and 7.5×, respectively, compared with other FPGA implementations while maintaining the same error-bound and compression ratios.

Item Type:	Thesis (Masters)
Subjects:	Computer
Department:	College of Computing and Mathematics > Computer Engineering
Committee Advisor:	El-Maleh, Aiman H.
Committee Members:	Elrabaa, M.E.S. and Mudawar, Muhamed
Depositing User:	HAZIM AHMED (g201806080)
Date Deposited:	04 Jan 2024 07:51
Last Modified:	04 Jan 2024 07:51
URI:	http://eprints.kfupm.edu.sa/id/eprint/142732