Identification of Access Control Policy Sentences from Natural Language Policy Documents - Data and Applications Security and Privacy XXXI
Conference Papers Year : 2017

Identification of Access Control Policy Sentences from Natural Language Policy Documents

Masoud Narouei
  • Function : Author
  • PersonId : 999038
Hamed Khanpour
  • Function : Author
  • PersonId : 1026644
Hassan Takabi
  • Function : Author
  • PersonId : 1026645

Abstract

Access control mechanisms are a necessary and crucial design element to any application’s security. There are a plethora of accepted access control models in the information security realm. However, attribute-based access control (ABAC) has been proposed as a general model that could overcome the limitations of the dominant access control models (i.e., role-based access control) while unifying their advantages. One issue with migrating to an ABAC model is the information that needs to be encoded in the model is typically buried within existing natural language artifacts, hence difficult to interpret. This requires processing natural language documents and extracting policies from those documents. Software requirements and policy documents are the main sources of declaring organizational policies, but they are often huge and consist of a lot of general descriptive sentences that lack any access control content. Manually processing these documents to extract policies and then using them to build a model is a laborious and expensive process. This paper is the first step towards a new policy engineering approach for ABAC by processing policy documents and identifying access control contents. We take advantage of multiple natural language processing techniques including pointwise mutual information to identify access control policy sentences within natural language documents. We evaluate our approach on documents from different domains including conference management, education, and healthcare. Our methodology effectively identifies policy sentences with an average recall and precision of 90% on all datasets, which bested the state-of-the-art by 5%.
Fichier principal
Vignette du fichier
453481_1_En_5_Chapter.pdf (379 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

hal-01684368 , version 1 (15-01-2018)

Licence

Identifiers

Cite

Masoud Narouei, Hamed Khanpour, Hassan Takabi. Identification of Access Control Policy Sentences from Natural Language Policy Documents. 31th IFIP Annual Conference on Data and Applications Security and Privacy (DBSEC), Jul 2017, Philadelphia, PA, United States. pp.82-100, ⟨10.1007/978-3-319-61176-1_5⟩. ⟨hal-01684368⟩
114 View
268 Download

Altmetric

Share

More