Performance Optimization of 3D Lattice Boltzmann Flow Solver on a GPU
Joint Authors
Lee, Myoungho
Tran, Nhat-Phuong
Hong, Sugwon
Source
Issue
Vol. 2017, Issue 2017 (31 Dec. 2017), pp.1-16, 16 p.
Publisher
Hindawi Publishing Corporation
Publication Date
2017-01-16
Country of Publication
Egypt
No. of Pages
16
Main Subjects
Abstract EN
Lattice Boltzmann Method (LBM) is a powerful numerical simulation method of the fluid flow.
With its data parallel nature, it is a promising candidate for a parallel implementation on a GPU.
The LBM, however, is heavily data intensive and memory bound.
In particular, moving the data to the adjacent cells in the streaming computation phase incurs a lot of uncoalesced accesses on the GPU which affects the overall performance.
Furthermore, the main computation kernels of the LBM use a large number of registers per thread which limits the thread parallelism available at the run time due to the fixed number of registers on the GPU.
In this paper, we develop high performance parallelization of the LBM on a GPU by minimizing the overheads associated with the uncoalesced memory accesses while improving the cache locality using the tiling optimization with the data layout change.
Furthermore, we aggressively reduce the register uses for the LBM kernels in order to increase the run-time thread parallelism.
Experimental results on the Nvidia Tesla K20 GPU show that our approach delivers impressive throughput performance: 1210.63 Million Lattice Updates Per Second (MLUPS).
American Psychological Association (APA)
Tran, Nhat-Phuong& Lee, Myoungho& Hong, Sugwon. 2017. Performance Optimization of 3D Lattice Boltzmann Flow Solver on a GPU. Scientific Programming،Vol. 2017, no. 2017, pp.1-16.
https://search.emarefa.net/detail/BIM-1203277
Modern Language Association (MLA)
Tran, Nhat-Phuong…[et al.]. Performance Optimization of 3D Lattice Boltzmann Flow Solver on a GPU. Scientific Programming No. 2017 (2017), pp.1-16.
https://search.emarefa.net/detail/BIM-1203277
American Medical Association (AMA)
Tran, Nhat-Phuong& Lee, Myoungho& Hong, Sugwon. Performance Optimization of 3D Lattice Boltzmann Flow Solver on a GPU. Scientific Programming. 2017. Vol. 2017, no. 2017, pp.1-16.
https://search.emarefa.net/detail/BIM-1203277
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references
Record ID
BIM-1203277