An Efficient Method for Determining Full Point-to-Point Latency of Arbitrary Indirect HPC Networks

Chengchun Liu; Zhang Yang; Limin Xiao; Baicheng Yan; Zhihao Wang; Hongyun Tian

doi:10.1007/978-3-030-05677-3_5

Conference Papers Year : 2018

An Efficient Method for Determining Full Point-to-Point Latency of Arbitrary Indirect HPC Networks

(1) , (2) , (1) , (1) , (1) , (2)

1
2

Chengchun Liu

Function : Author

School of Computer Science and Engineering [Beijing]

Zhang Yang

Function : Author
PersonId : 1053401

IAPCM - Institute of Applied Physics and Computational Mathematics [Beijing]

Limin Xiao

Function : Author
PersonId : 1006934

School of Computer Science and Engineering [Beijing]

Baicheng Yan

Function : Author

School of Computer Science and Engineering [Beijing]

Zhihao Wang

Function : Author

School of Computer Science and Engineering [Beijing]

Hongyun Tian

Function : Author

IAPCM - Institute of Applied Physics and Computational Mathematics [Beijing]

Abstract

Point-to-point latency is one of the most important metrics for high performance computer networks and is used widely in communication performance modeling, link-failure detection, and application optimization. However, it is often hard to determine the full-scale point-to-point latency of large scale HPC networks since it often requires measurements to the square of the number of terminal nodes. In this paper, we propose an efficient method to generate measurement plans for arbitrary indirect HPC networks and reduces the measurement requirements from $$O(n^2)$$ to m, which is often O(n) in modern indirect networks containing n nodes and m links, thus significantly reduces the latency measure overhead. Both analysis and experiments show that the proposed method can reduce the overhead of large-scale fat-tree networks by orders of magnitudes.

Domains

Computer Science [cs]

Fichier principal

477597_1_En_5_Chapter.pdf (542.21 Ko)

Origin	Files produced by the author(s)
licence	CC BY 4.0 - Attribution

Connect in order to contact the contributor

https://inria.hal.science/hal-02279556

Submitted on : Thursday, September 5, 2019-1:31:24 PM

Last modification on : Thursday, May 16, 2024-5:28:04 PM

Long-term archiving on : Thursday, February 6, 2020-5:00:48 AM

Dates and versions

hal-02279556 , version 1 (05-09-2019)

Licence

CC BY 4.0 - Attribution

Identifiers

HAL Id : hal-02279556 , version 1
DOI : 10.1007/978-3-030-05677-3_5

Cite

Chengchun Liu, Zhang Yang, Limin Xiao, Baicheng Yan, Zhihao Wang, et al.. An Efficient Method for Determining Full Point-to-Point Latency of Arbitrary Indirect HPC Networks. 15th IFIP International Conference on Network and Parallel Computing (NPC), Nov 2018, Muroran, Japan. pp.52-63, ⟨10.1007/978-3-030-05677-3_5⟩. ⟨hal-02279556⟩

An Efficient Method for Determining Full Point-to-Point Latency of Arbitrary Indirect HPC Networks

Abstract

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share