Random Forests with a Steepend Gini-Index Split Function and Feature Coherence Injection

Mandlenkosi Victor Gwetu; Jules-Raymond Tapamo; Serestina Viriri

doi:10.1007/978-3-030-45778-5_17

Conference Papers Year : 2020

Random Forests with a Steepend Gini-Index Split Function and Feature Coherence Injection

(1) , (1, 2) , (1)

1
2

Mandlenkosi Victor Gwetu

Function : Author
PersonId : 1102811

University of KwaZulu-Natal [Durban, Afrique du Sud]

Jules-Raymond Tapamo

Function : Author

University of KwaZulu-Natal [Durban, Afrique du Sud]

School of Computer Science

Serestina Viriri

Function : Author

University of KwaZulu-Natal [Durban, Afrique du Sud]

Abstract

Although Random Forests (RFs) are an effective and scalable ensemble machine learning approach, they are highly dependent on the discriminative ability of the available individual features. Since most data mining problems occur in the context of pre-existing data, there is little room to choose the original input features. Individual RF decision trees follow a greedy algorithm that iteratively selects the feature with the highest potential for achieving subsample purity. Common heuristics for ranking this potential include the gini-index and information gain metrics. This study seeks to improve the effectiveness of RFs through an adapted gini-index splitting function and a feature engineering technique. Using a structured framework for comparative evaluation of RFs, the study demonstrates that the effectiveness of the proposed methods is comparable with conventional gini-index based RFs. Improvements in the minimum accuracy recorded over some UCI data sets, demonstrate the potential for a hybrid set of splitting functions.

Keywords

Domains

Fichier principal

487577_1_En_17_Chapter.pdf (769)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-03266471

Submitted on : Monday, June 21, 2021-5:32:18 PM

Last modification on : Monday, November 25, 2024-2:50:03 PM

Long-term archiving on : Wednesday, September 22, 2021-7:03:56 PM

Dates and versions

hal-03266471 , version 1 (21-06-2021)

Licence

Attribution

Identifiers

HAL Id : hal-03266471 , version 1
DOI : 10.1007/978-3-030-45778-5_17

Cite

Mandlenkosi Victor Gwetu, Jules-Raymond Tapamo, Serestina Viriri. Random Forests with a Steepend Gini-Index Split Function and Feature Coherence Injection. 2nd International Conference on Machine Learning for Networking (MLN), Dec 2019, Paris, France. pp.255-272, ⟨10.1007/978-3-030-45778-5_17⟩. ⟨hal-03266471⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP-LNCS IFIP IFIP-TC IFIP-TC6 IFIP-LNCS-12081 IFIP-MLN

93 View

87 Download

Random Forests with a Steepend Gini-Index Split Function and Feature Coherence Injection

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share