|
Digital
Library of the European Council for Modelling
and Simulation |
Title: |
A Novel Oversampling
Technique To Handle Imbalanced Datasets |
Authors: |
Ayat
Mahmoud, Ayman El-Kilany, Farid Ali, Sherif Mazen |
Published in: |
2020). ECMS 2020 Proceedings
Edited by: Mike Steglich, Christian Muller, Gaby Neumann, Mathias Walther,
European Council for Modeling and Simulation. DOI: http://doi.org/10.7148/2020 ISSN:
2522-2422 (ONLINE) ISSN:
2522-2414 (PRINT) ISSN:
2522-2430 (CD-ROM) ISBN: 978-3-937436-68-5 Communications of the ECMS , Volume 34, Issue 1, June 2020, United Kingdom |
Citation
format: |
Ayat Mahmoud, Ayman El-Kilany, Farid
Ali, Sherif Mazen (2020). A Novel Oversampling Technique To Handle Imbalanced
Datasets, ECMS 2020 Proceedings Edited By:
Mike Steglich, Christian Mueller, Gaby Neumann, Mathias Walther European Council
for Modeling and Simulation. doi: 10.7148/2020-0177 |
DOI: |
https://doi.org/10.7148/2020-0177 |
Abstract: |
With the
amount of data is growing extensively in different domains in the recent
years, the data imbalance problem arises frequently. A dataset is called
imbalanced when the data of a certain class has significantly more instances
than that of other classes of the same dataset. This imbalanced nature of the
data negatively affects the performance of a classifier since
misclassification of data may cause data analysis results to be inaccurate
and hence leads to wrong business decisions. This paper presents a study of
the different techniques that are used to handle the imbalanced dataset, and
finally proposes a novel oversampling technique to tackle the binary
classification of imbalanced dataset problem. |
Full
text: |