%0 Conference Proceedings
%T Random Forest Based on Federated Learning for Intrusion Detection
%+ School of Innovation, Design and Engineering [Vasteras]
%+ Tietoevry
%A Markovic, Tijana
%A Leon, Miguel
%A Buffoni, David
%A Punnekkat, Sasikumar
%Z Part 2: Cybersecurity Fraud Intrusion/Anomaly Detection
%< avec comité de lecture
%@ 978-3-031-08332-7
%( IFIP Advances in Information and Communication Technology
%B 18th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI)
%C Hersonissos, Greece
%Y Ilias Maglogiannis
%Y Lazaros Iliadis
%Y John Macintyre
%Y Paulo Cortez
%I Springer International Publishing
%3 Artificial Intelligence Applications and Innovations
%V AICT-646
%N Part I
%P 132-144
%8 2022-06-17
%D 2022
%R 10.1007/978-3-031-08333-4_11
%K Intrusion detection
%K Random Forest
%K Federated learning
%Z Computer Science [cs]Conference papers
%X Vulnerability of important data is increasing everyday with the constant evolution and increase of sophisticated cyber security threats that can seriously affect the business processes. Hence, it is important for organizations to define and implement appropriate mechanisms such as intrusion detection systems to protect their valuable data. In recent years, various machine learning approaches were proposed for intrusion detection, where Random Forest (RF) is recognized as one of the most suitable algorithms. Machine learning algorithms are data-oriented and storing data for training on the centralized server can increase the vulnerability of the whole system. In this paper, we are using a federated learning approach that independently trains data subsets on multiple clients and sends only the resulting models for aggregation to a server. This considerably reduces the need for sending all data to a centralised server. Different RF-based federated learning versions were evaluated on four intrusion detection benchmark datasets (KDD, NSL-KDD, UNSW-NB15, and CIC-IDS-2017). In our experiments, the global RF on the server achieved higher accuracy than the maximum achieved with individual RFs on the clients in the case of two out of four datasets, and it was very close to the maximum for the third dataset. Even in the fourth case, the global RF performed better than the average accuracy, although it fell behind the maximum.
%G English
%Z TC 12
%Z WG 12.5
%2 https://inria.hal.science/hal-04317161/document
%2 https://inria.hal.science/hal-04317161/file/527511_1_En_11_Chapter.pdf
%L hal-04317161
%U https://inria.hal.science/hal-04317161
%~ LORIA2
%~ IFIP
%~ IFIP-AICT
%~ IFIP-TC
%~ IFIP-WG
%~ IFIP-TC12
%~ IFIP-AIAI
%~ IFIP-WG12-5
%~ IFIP-AICT-646