Non-coding RNA Sequences Identification and Classification Using a Multi-class and Multi-label Ensemble Technique - Artificial Intelligence Applications and Innovations (AIAI 2018)
Conference Papers Year : 2018

Non-coding RNA Sequences Identification and Classification Using a Multi-class and Multi-label Ensemble Technique

Michalis Stavridis
  • Function : Author
  • PersonId : 1033590
Aigli Korfiati
  • Function : Author
  • PersonId : 1033591
Georgios Sakellaropoulos
  • Function : Author
  • PersonId : 1033592
Konstantinos Theofilatos
  • Function : Author
  • PersonId : 1012004

Abstract

High throughput sequencing RNA-sequencing technologies and modern in silico techniques have expanded our knowledge on short non-coding RNAs. These sequences were initially split into various categories based on their cellular functionality and their sequential, thermodynamic and structural properties believing that their sequence can be used as an identifier to distinguish them. However, recent evidence has indicated that the same sequences can act and function as more than one type of non-coding RNAs with a striking example of mature microRNA sequences which can also be transfer RNA fragments. Most of the existing computational methods for the prediction of non-coding RNA sequences have emphasized on the prediction of only one type of noncoding RNAs and even the ones designed for multiclassification do not support multiple labeling and are thus not able to assign a sequence to more than one non-coding RNA type. In the present paper, we introduce a new multilabel- multiclass method based on the combination of multiobjective evolutionary algorithms and multi-label implementations of Random Forests to optimize the feature selection process and assign short RNA sequences to one or more non-coding RNA types. The overall methodology clearly outperformed other machine learning techniques which were used for the same purpose and it is applicable to data coming from RNA-sequencing experiments.
Fichier principal
Vignette du fichier
468652_1_En_17_Chapter.pdf (382.89 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

hal-01821313 , version 1 (22-06-2018)

Licence

Identifiers

Cite

Michalis Stavridis, Aigli Korfiati, Georgios Sakellaropoulos, Seferina Mavroudi, Konstantinos Theofilatos. Non-coding RNA Sequences Identification and Classification Using a Multi-class and Multi-label Ensemble Technique. 14th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), May 2018, Rhodes, Greece. pp.179-188, ⟨10.1007/978-3-319-92016-0_17⟩. ⟨hal-01821313⟩
308 View
79 Download

Altmetric

Share

More