MTLAT: A Multi-Task Learning Framework Based on Adversarial Training for Chinese Cybersecurity NER - Network and Parallel Computing
Conference Papers Year : 2021

MTLAT: A Multi-Task Learning Framework Based on Adversarial Training for Chinese Cybersecurity NER

Abstract

With the continuous development of cybersecurity texts, the importance of Chinese cybersecurity named entity recognition (NER) is increasing. However, Chinese cybersecurity texts contain not only a large number of professional security domain entities but also many English person and organization entities, as well as a large number of Chinese-English mixed entities. Chinese Cybersecurity NER is a domain-specific task, current models rarely focus on the cybersecurity domain and cannot extract these entities well. To tackle these issues, we propose a Multi-Task Learning framework based on Adversarial Training (MTLAT) to improve the performance of Chinese cybersecurity NER. Extensive experimental results show that our model, which does not use any external resources except static word embedding, outperforms state-of-the-art systems on the Chinese cybersecurity dataset. Moreover, our model outperforms the BiLSTM-CRF method on Weibo, Resume, and MSRA Chinese general NER datasets by 4.1%, 1.04%, 1.79% F1 scores, which proves the universality of our model in different domains.
Fichier principal
Vignette du fichier
511910_1_En_4_Chapter.pdf (591.37 Ko) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-03768765 , version 1 (04-09-2022)

Licence

Identifiers

Cite

Yaopeng Han, Zhigang Lu, Bo Jiang, Yuling Liu, Chen Zhang, et al.. MTLAT: A Multi-Task Learning Framework Based on Adversarial Training for Chinese Cybersecurity NER. 17th IFIP International Conference on Network and Parallel Computing (NPC), Sep 2020, Zhengzhou, China. pp.43-54, ⟨10.1007/978-3-030-79478-1_4⟩. ⟨hal-03768765⟩
35 View
39 Download

Altmetric

Share

More