A Feature Selection Study on the Bot-IoT Dataset Using Ensemble Classification Techniques
IoT is an emerging giant in the field of technol- ogy, taking over traditional systems, providing interconnected- ness, convenience, efficiency, and automation, making our lives unimaginably better. However, security for these IoT systems is challenging, especially due to their interconnectedness, m...
Сохранить в:
| Главные авторы: | , |
|---|---|
| Формат: | Статья |
| Язык: | English |
| Опубликовано: |
Institute of Electrical and Electronics Engineers Inc.
2024
|
| Темы: | |
| Online-ссылка: | https://dspace.ncfu.ru/handle/123456789/29212 |
| Метки: |
Добавить метку
Нет меток, Требуется 1-ая метка записи!
|
| Краткое описание: | IoT is an emerging giant in the field of technol- ogy, taking over traditional systems, providing interconnected- ness, convenience, efficiency, and automation, making our lives unimaginably better. However, security for these IoT systems is challenging, especially due to their interconnectedness, making them vulnerable to various cyber threats. The rising tide of IoT botnets, especially, presents a unique challenge. This has urgently increased the need for Intrusion Detection research. Modern Intrusion Detection approaches often employ Machine Learning for effective results. Feature Selection is extremely important while creating Machine Learning Classification models to avoid overfitting and poor performance. This paper focuses on running a Feature Selection study on the Bot-IoT dataset provided by UNSW to increase the accuracy of a ML model. The paper tests 5 types of Feature Selection methods, from Filter- based, Wrapper-based and Embedded methods, combined with two distinct ensemble classifiers: Random Forest + Adaboost and XGBoost. Each combination is tested with the dataset, and the accuracy is compared to find the most effective and versatile feature selection method that can assist both Stacking and Voting- type Ensemble classifiers. The results show that Karl Pearson can provide the best accuracy when applied to both Ensemble Classifiers. |
|---|