Combining Machine and Automata Learning for Network Traffic Classification

Zeynab Sabahi-Kaviani; Fatemeh Ghassemi; Zahra Alimadadi

doi:10.1007/978-3-030-57852-7_2

Conference Papers Year : 2020

Combining Machine and Automata Learning for Network Traffic Classification

(1) , (1, 2) , (1)

1
2

Zeynab Sabahi-Kaviani

Function : Author

University of Tehran

Fatemeh Ghassemi

Function : Author
PersonId : 999433

University of Tehran

Institute for Research in Fundamental Sciences [Tehran]

Zahra Alimadadi

Function : Author

University of Tehran

Abstract

Viewing the generated packets of an application as the words of a language, automata learning can be used to derive the behavioral packet-based model of applications. The alphabets of the learned automata, manually defined in terms of packets, may cause overfitting. As some packets always appear together, we apply machine learning techniques to automatically define the alphabet set based on the timing and statistical features of packets. Using the learned automata models, the classifier should detect the accepted words of the models in the input. To improve this time-consuming process, we present a framework, called NeTLang, that identifies the application model in terms of k-testable languages. The classification problem is reduced to observing only

$\varTheta (k)$ symbols from the input with the help of machine learning techniques. Our framework utilizes the two diverse automata learning and machine learning techniques to build on their strengths (to be fast and accurate) and to eliminate their weaknesses (i.e., ignoring temporal relations among packets). According to our results, NeTLang outperforms the state-of-the-art methods using each technique alone.

Keywords

Domains

Computer Science [cs]

Fichier principal

495613_1_En_2_Chapter.pdf (545)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-03165385

Submitted on : Wednesday, March 10, 2021-4:05:20 PM

Last modification on : Tuesday, May 25, 2021-12:28:02 PM

Long-term archiving on : Friday, June 11, 2021-7:07:08 PM

Dates and versions

hal-03165385 , version 1 (10-03-2021)

Licence

Attribution

Identifiers

HAL Id : hal-03165385 , version 1
DOI : 10.1007/978-3-030-57852-7_2

Cite

Zeynab Sabahi-Kaviani, Fatemeh Ghassemi, Zahra Alimadadi. Combining Machine and Automata Learning for Network Traffic Classification. 3rd International Conference on Topics in Theoretical Computer Science (TTCS), Jul 2020, Tehran, Iran. pp.17-31, ⟨10.1007/978-3-030-57852-7_2⟩. ⟨hal-03165385⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP-LNCS IFIP IFIP-TC IFIP-TC1 IFIP-LNCS-12281

75 View

101 Download

Combining Machine and Automata Learning for Network Traffic Classification

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share