%0 Conference Proceedings %T Modified Incomplete Cholesky Preconditioned Conjugate Gradient Algorithm on GPU for the 3D Parabolic Equation %+ College of Computer Science and Technology [Hangzhou] %+ Zhijiang College %A Gao, Jiaquan %A Li, Bo %A He, Guixia %Z Part 4: Session 4: Multi-core Computing and GPU %< avec comité de lecture %( Lecture Notes in Computer Science %B 10th International Conference on Network and Parallel Computing (NPC) %C Guiyang, China %Y Ching-Hsien Hsu %Y Xiaoming Li %Y Xuanhua Shi %Y Ran Zheng %I Springer %3 Network and Parallel Computing %V LNCS-8147 %P 298-307 %8 2013-09-19 %D 2013 %R 10.1007/978-3-642-40820-5_25 %K conjugate gradient algorithm %K modified incomplete Cholesky preconditioner %K parabolic equation %K GPU %Z Computer Science [cs]Conference papers %X In this study, for solving the three-dimensional partial differential equation ut = uxx + uyy + uzz, an efficient parallel method based on the modified incomplete Cholesky preconditioned conjugate gradient algorithm (MICPCGA) on the GPU is presented. In our proposed method, for this case, we overcome the drawbacks that the MIC preconditioner is generally difficult to be parallelized on the GPU due to the forward/backward substitutions, and thus present an efficient parallel implementation method on the GPU. Moreover, a vector kernel for the sparse matrix-vector multiplication, and optimization of vector operations by grouping several vector operations into a single kernel are adopted. Numerical results show that our proposed forward/backward substitutions and MICPCGA on the GPU both can achieve a significant speedup, and compared to an approximate inverse SSOR preconditioned conjugate gradient algorithm (SSORPCGA), our proposed MICPCGA obtains a bigger speedup, and outperforms it in solving the three-dimensional partial differential equation. %G English %2 https://inria.hal.science/hal-01513768/document %2 https://inria.hal.science/hal-01513768/file/978-3-642-40820-5_25_Chapter.pdf %L hal-01513768 %U https://inria.hal.science/hal-01513768 %~ IFIP-LNCS %~ IFIP %~ IFIP-NPC %~ IFIP-LNCS-8147