![](/images/graphics-bg.png)
Implementing and Evaluating an Heterogeneous, Scalable, Tridiagonal Linear System Solver with OpenCL to Target FPGAs, GPUs, and CPUs
المؤلفون المشاركون
Macintosh, Hamish J.
Banks, Jasmine E.
Kelson, Neil A.
المصدر
International Journal of Reconfigurable Computing
العدد
المجلد 2019، العدد 2019 (31 ديسمبر/كانون الأول 2019)، ص ص. 1-13، 13ص.
الناشر
Hindawi Publishing Corporation
تاريخ النشر
2019-10-13
دولة النشر
مصر
عدد الصفحات
13
التخصصات الرئيسية
تكنولوجيا المعلومات وعلم الحاسوب
الملخص EN
Solving diagonally dominant tridiagonal linear systems is a common problem in scientific high-performance computing (HPC).
Furthermore, it is becoming more commonplace for HPC platforms to utilise a heterogeneous combination of computing devices.
Whilst it is desirable to design faster implementations of parallel linear system solvers, power consumption concerns are increasing in priority.
This work presents the oclspkt routine.
The oclspkt routine is a heterogeneous OpenCL implementation of the truncated SPIKE algorithm that can use FPGAs, GPUs, and CPUs to concurrently accelerate the solving of diagonally dominant tridiagonal linear systems.
The routine is designed to solve tridiagonal systems of any size and can dynamically allocate optimised workloads to each accelerator in a heterogeneous environment depending on the accelerator’s compute performance.
The truncated SPIKE FPGA solver is developed first for optimising OpenCL device kernel performance, global memory bandwidth, and interleaved host to device memory transactions.
The FPGA OpenCL kernel code is then refactored and optimised to best exploit the underlying architecture of the CPU and GPU.
An optimised TDMA OpenCL kernel is also developed to act as a serial baseline performance comparison for the parallel truncated SPIKE kernel since no FPGA tridiagonal solver capable of solving large tridiagonal systems was available at the time of development.
The individual GPU, CPU, and FPGA solvers of the oclspkt routine are 110%, 150%, and 170% faster, respectively, than comparable device-optimised third-party solvers and applicable baselines.
Assessing heterogeneous combinations of compute devices, the GPU + FPGA combination is found to have the best compute performance and the FPGA-only configuration is found to have the best overall estimated energy efficiency.
نمط استشهاد جمعية علماء النفس الأمريكية (APA)
Macintosh, Hamish J.& Banks, Jasmine E.& Kelson, Neil A.. 2019. Implementing and Evaluating an Heterogeneous, Scalable, Tridiagonal Linear System Solver with OpenCL to Target FPGAs, GPUs, and CPUs. International Journal of Reconfigurable Computing،Vol. 2019, no. 2019, pp.1-13.
https://search.emarefa.net/detail/BIM-1168485
نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)
Macintosh, Hamish J.…[et al.]. Implementing and Evaluating an Heterogeneous, Scalable, Tridiagonal Linear System Solver with OpenCL to Target FPGAs, GPUs, and CPUs. International Journal of Reconfigurable Computing No. 2019 (2019), pp.1-13.
https://search.emarefa.net/detail/BIM-1168485
نمط استشهاد الجمعية الطبية الأمريكية (AMA)
Macintosh, Hamish J.& Banks, Jasmine E.& Kelson, Neil A.. Implementing and Evaluating an Heterogeneous, Scalable, Tridiagonal Linear System Solver with OpenCL to Target FPGAs, GPUs, and CPUs. International Journal of Reconfigurable Computing. 2019. Vol. 2019, no. 2019, pp.1-13.
https://search.emarefa.net/detail/BIM-1168485
نوع البيانات
مقالات
لغة النص
الإنجليزية
الملاحظات
Includes bibliographical references
رقم السجل
BIM-1168485
قاعدة معامل التأثير والاستشهادات المرجعية العربي "ارسيف Arcif"
أضخم قاعدة بيانات عربية للاستشهادات المرجعية للمجلات العلمية المحكمة الصادرة في العالم العربي
![](/images/ebook-kashef.png)
تقوم هذه الخدمة بالتحقق من التشابه أو الانتحال في الأبحاث والمقالات العلمية والأطروحات الجامعية والكتب والأبحاث باللغة العربية، وتحديد درجة التشابه أو أصالة الأعمال البحثية وحماية ملكيتها الفكرية. تعرف اكثر
![](/images/kashef-image.png)