MTLAT: A Multi-Task Learning Framework Based on Adversarial Training for Chinese Cybersecurity NER

Yaopeng Han; Zhigang Lu; Bo Jiang; Yuling Liu; Chen Zhang; Zhengwei Jiang; Ning Li

doi:10.1007/978-3-030-79478-1_4

Conference Papers Year : 2021

MTLAT: A Multi-Task Learning Framework Based on Adversarial Training for Chinese Cybersecurity NER

(1, 2) , (1, 2) , (1, 2) , (1, 2) , (1) , (1, 2) , (1)

1
2

Yaopeng Han

Function : Author
PersonId : 1161088

Institute of Information Engineering [Beijing]

School of Cyber Security

Zhigang Lu

Function : Author
PersonId : 1161089

Institute of Information Engineering [Beijing]

School of Cyber Security

Bo Jiang

Function : Author
PersonId : 1161090

Institute of Information Engineering [Beijing]

School of Cyber Security

Yuling Liu

Function : Author
PersonId : 1161091

Institute of Information Engineering [Beijing]

School of Cyber Security

Chen Zhang

Function : Author
PersonId : 1161092

Institute of Information Engineering [Beijing]

Zhengwei Jiang

Function : Author
PersonId : 1161093

Institute of Information Engineering [Beijing]

School of Cyber Security

Ning Li

Function : Author
PersonId : 1020661

Institute of Information Engineering [Beijing]

Abstract

With the continuous development of cybersecurity texts, the importance of Chinese cybersecurity named entity recognition (NER) is increasing. However, Chinese cybersecurity texts contain not only a large number of professional security domain entities but also many English person and organization entities, as well as a large number of Chinese-English mixed entities. Chinese Cybersecurity NER is a domain-specific task, current models rarely focus on the cybersecurity domain and cannot extract these entities well. To tackle these issues, we propose a Multi-Task Learning framework based on Adversarial Training (MTLAT) to improve the performance of Chinese cybersecurity NER. Extensive experimental results show that our model, which does not use any external resources except static word embedding, outperforms state-of-the-art systems on the Chinese cybersecurity dataset. Moreover, our model outperforms the BiLSTM-CRF method on Weibo, Resume, and MSRA Chinese general NER datasets by 4.1%, 1.04%, 1.79% F1 scores, which proves the universality of our model in different domains.

Keywords

Cybersecurity Named entity recognition Adversarial training Multi-task learning

Domains

Computer Science [cs]

Fichier principal

511910_1_En_4_Chapter.pdf (591.37 Ko)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-03768765

Submitted on : Sunday, September 4, 2022-7:37:04 PM

Last modification on : Tuesday, November 12, 2024-11:34:03 AM

Long-term archiving on : Monday, December 5, 2022-6:29:34 PM

Dates and versions

hal-03768765 , version 1 (04-09-2022)

Licence

Attribution

Identifiers

HAL Id : hal-03768765 , version 1
DOI : 10.1007/978-3-030-79478-1_4

Cite

Yaopeng Han, Zhigang Lu, Bo Jiang, Yuling Liu, Chen Zhang, et al.. MTLAT: A Multi-Task Learning Framework Based on Adversarial Training for Chinese Cybersecurity NER. 17th IFIP International Conference on Network and Parallel Computing (NPC), Sep 2020, Zhengzhou, China. pp.43-54, ⟨10.1007/978-3-030-79478-1_4⟩. ⟨hal-03768765⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP-LNCS IFIP IFIP-TC IFIP-TC10 IFIP-NPC IFIP-WG10-3 IFIP-LNCS-12639

35 View

39 Download

MTLAT: A Multi-Task Learning Framework Based on Adversarial Training for Chinese Cybersecurity NER

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share