Using Approximate Matching to Reduce the Volume of Digital Data

Frank Breitinger; Christian Winter; York Yannikos; Tobias Fink; Michael Seefried

doi:10.1007/978-3-662-44952-3_11

Conference Papers Year : 2014

Using Approximate Matching to Reduce the Volume of Digital Data

(1, 2) , (3) , (3) , (1) , (1)

1
2
3

Frank Breitinger

Function : Author

Hochschule Darmstadt

CASED - Center for Advanced Security Research Darmstadt [Darmstadt]

Christian Winter

Function : Author

Fraunhofer SIT - Fraunhofer Institute for Secure Information Technology [Darmstadt]

York Yannikos

Function : Author

Fraunhofer SIT - Fraunhofer Institute for Secure Information Technology [Darmstadt]

Tobias Fink

Function : Author

Hochschule Darmstadt

Michael Seefried

Function : Author

Hochschule Darmstadt

Abstract

Digital forensic investigators frequently have to search for relevant files in massive digital corpora – a task often compared to finding a needle in a haystack. To address this challenge, investigators typically apply cryptographic hash functions to identify known files. However, cryptographic hashing only allows the detection of files that exactly match the known file hash values or fingerprints. This paper demonstrates the benefits of using approximate matching to locate relevant files. The experiments described in this paper used three test images of Windows XP, Windows 7 and Ubuntu 12.04 systems to evaluate fingerprint-based comparisons. The results reveal that approximate matching can improve file identification – in one case, increasing the identification rate from 1.82% to 23.76%.

Keywords

Domains

Computer Science [cs]

Fichier principal

978-3-662-44952-3_11_Chapter.pdf (1022.23 Ko)

Origin	Files produced by the author(s)
licence	CC BY 4.0 - Attribution

Connect in order to contact the contributor

https://inria.hal.science/hal-01393769

Submitted on : Tuesday, November 8, 2016-10:48:19 AM

Last modification on : Friday, November 21, 2025-2:42:02 PM

Long-term archiving on : Wednesday, March 15, 2017-12:04:26 AM

Dates and versions

hal-01393769 , version 1 (08-11-2016)

Licence

CC BY 4.0 - Attribution

Identifiers

HAL Id : hal-01393769 , version 1
DOI : 10.1007/978-3-662-44952-3_11

Cite

Frank Breitinger, Christian Winter, York Yannikos, Tobias Fink, Michael Seefried. Using Approximate Matching to Reduce the Volume of Digital Data. 10th IFIP International Conference on Digital Forensics (DF), Jan 2014, Vienna, Austria. pp.149-163, ⟨10.1007/978-3-662-44952-3_11⟩. ⟨hal-01393769⟩

Using Approximate Matching to Reduce the Volume of Digital Data

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share