Using FLOSS for Storing, Processing and Linking Corpus Data - Open Source Systems
Conference Papers Year : 2020

Using FLOSS for Storing, Processing and Linking Corpus Data

Damir Mukhamedshin
  • Function : Author
  • PersonId : 1132701
Olga Nevzorova
  • Function : Author
  • PersonId : 1132702
Alexander Kirillovich
  • Function : Author
  • PersonId : 1132703

Abstract

Corpus data is widely used to solve different linguistic, educational and applied problems. The Tatar corpus management system (http://tugantel.tatar) is specifically developed for Turkic languages. The functionality of our corpus management system includes a search of lexical units, morphological and lexical search, a search of syntactic units, a search of N-grams and others. The search is performed using open source tools (database management system MariaDB, Redis data store). This article describes the process of choosing FLOSS for the main components of our system and also processing a search query and building a linked open dataset based on corpus data.
Fichier principal
Vignette du fichier
496591_1_En_17_Chapter.pdf (519.47 Ko) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-03647263 , version 1 (20-04-2022)

Licence

Identifiers

Cite

Damir Mukhamedshin, Olga Nevzorova, Alexander Kirillovich. Using FLOSS for Storing, Processing and Linking Corpus Data. 16th IFIP International Conference on Open Source Systems (OSS), May 2020, Innopolis, Russia. pp.177-182, ⟨10.1007/978-3-030-47240-5_17⟩. ⟨hal-03647263⟩
32 View
33 Download

Altmetric

Share

More