Data Type Classification: Hierarchical Class-to-Type Modeling - Advances in Digital Forensics XII
Conference Papers Year : 2016

Data Type Classification: Hierarchical Class-to-Type Modeling

Nicole Beebe
  • Function : Author
  • PersonId : 1028696
Lishu Liu
  • Function : Author

Abstract

Data and file type classification research conducted over the past ten to fifteen years has been dominated by competing experiments that only vary the number of classes, types of classes, machine learning technique and input vector. There has been surprisingly little innovation on fundamental approaches to data and file type classification. This chapter focuses on the empirical testing of a hypothesized, two-level hierarchical classification model and the empirical derivation and testing of several alternative classification models. Comparative evaluations are conducted on ten classification models to identify a final winning, two-level classification model consisting of five classes and 52 lower-level data and file types. Experimental results demonstrate that the approach leads to very good class-level classification performance, improved classification performance for data and file types without high entropy (e.g., compressed and encrypted data) and reasonably-equivalent classification performance for high-entropy data and file types.
Fichier principal
Vignette du fichier
431606_1_En_17_Chapter.pdf (216.25 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

hal-01758687 , version 1 (04-04-2018)

Licence

Identifiers

Cite

Nicole Beebe, Lishu Liu, Minghe Sun. Data Type Classification: Hierarchical Class-to-Type Modeling. 12th IFIP International Conference on Digital Forensics (DF), Jan 2016, New Delhi, India. pp.325-343, ⟨10.1007/978-3-319-46279-0_17⟩. ⟨hal-01758687⟩
106 View
407 Download

Altmetric

Share

More