%0 Conference Proceedings %T Smart Ring: A Model of Node Failure Detection in High Available Cloud Data Center %+ Zhejiang University %A Xu, Lei %A Chen, Wenzhi %A Wang, Zonghui %A Ni, Huafei %A Wu, Jiajie %Z Part 8: Web, Communication, and Cloud Computing %< avec comité de lecture %( Lecture Notes in Computer Science %B 9th International Conference on Network and Parallel Computing (NPC) %C Gwangju, South Korea %Y James J. Park %Y Albert Zomaya %Y Sang-Soo Yeo %Y Sartaj Sahni %I Springer %3 Network and Parallel Computing %V LNCS-7513 %P 279-288 %8 2012-09-06 %D 2012 %R 10.1007/978-3-642-35606-3_33 %K Cloud Data Center %K High Availability %K Node Failure Detection %Z Computer Science [cs]Conference papers %X Nowadays most of cloud data centers deploy high available system in order to provide continuous services, so it’s very important for a high available cluster to detect the node failure (physical machine failure) accurately and timely in a low bandwidth occupation way. However, compared to the traditional cluster environment, the scale of cloud data center increases rapidly with the use of virtualization, so traditional node failure detection models have already faced several new problems. In this paper, we present a three roles and two layers node failure detection model, named as Smart Ring, which fits cloud data center well and strikes a balance between accuracy, instantaneity and bandwidth occupation. It can simultaneously detect the status of physical machines and virtual machines and deal well with multiple nodes failure and network partition. Our experiment results show that Smart Ring has a better performance than most existing models. %G English %Z TC 10 %Z WG 10.3 %2 https://inria.hal.science/hal-01551364/document %2 https://inria.hal.science/hal-01551364/file/978-3-642-35606-3_33_Chapter.pdf %L hal-01551364 %U https://inria.hal.science/hal-01551364 %~ IFIP-LNCS %~ IFIP %~ IFIP-AICT %~ IFIP-TC %~ IFIP-TC10 %~ IFIP-NPC %~ IFIP-WG10-3 %~ IFIP-LNCS-7513