Publicação

Classification of anomalies in microservices using an XGboost-based approach with data balancing and hyperparameter tuning

Ver documento

Detalhes bibliográficos
Resumo:Microservice architecture has emerged as a leading paradigm for decomposing large monolithic applications into smaller, autonomous services. Although this approach offers many advantages, its complexity, distributed nature, and substantial scale create significant challenges for monitoring and anomaly detection. The vast volume of generated data further exacerbates computational load and detection latency, complicating the identification of anomalies. This study analyses the impact of data balancing and hyperparameter tuning on anomaly classification in microservices and introduces ADMXGB - Anomaly Detection in Microservices using XGBoost, a XGBoost-based framework tailored for anomaly detection in microservices that seamlessly integrates data balancing with hyperparameter tuning. We propose guidelines to determine appropriate threshold values that balance sensitivity with false positives, and show that the framework is model-agnostic, enabling integration with different machine learning algorithms beyond XGBoost. Validation was performed using a four-stage process encompassing preprocessing, training, validation, and testing. ADMXGB demonstrated improvements in both Accuracy and F1-Score, reaching 99.96% in both metrics on the TraceRCA dataset, outperforming the baseline XGBoost method by a margin of 1.46% in Accuracy and 45.62% in F1-Score. Moreover, ADMXGB achieves reductions in execution time (-86.3%) and memory usage (-21.7%), while maintaining an acceptable CPU overhead. These findings highlight the robustness of ADMXGB in delivering high-accuracy classification in a microservice environment.
Autores principais:Barata, Luís
Outros Autores:Lopes, Eurico; Inácio, Pedro; Freire, Mário
Assunto:Anomaly Detection Data Balancing Oversampling Undersampling Hybridsampling Microservices Hyperparameter Optimization
Ano:2025
País:Portugal
Tipo de documento:preprint
Tipo de acesso:acesso aberto
Instituição associada:Instituto Politécnico de Castelo Branco
Idioma:inglês
Origem:Repositório Científico do Instituto Politécnico de Castelo Branco
Descrição
Resumo:Microservice architecture has emerged as a leading paradigm for decomposing large monolithic applications into smaller, autonomous services. Although this approach offers many advantages, its complexity, distributed nature, and substantial scale create significant challenges for monitoring and anomaly detection. The vast volume of generated data further exacerbates computational load and detection latency, complicating the identification of anomalies. This study analyses the impact of data balancing and hyperparameter tuning on anomaly classification in microservices and introduces ADMXGB - Anomaly Detection in Microservices using XGBoost, a XGBoost-based framework tailored for anomaly detection in microservices that seamlessly integrates data balancing with hyperparameter tuning. We propose guidelines to determine appropriate threshold values that balance sensitivity with false positives, and show that the framework is model-agnostic, enabling integration with different machine learning algorithms beyond XGBoost. Validation was performed using a four-stage process encompassing preprocessing, training, validation, and testing. ADMXGB demonstrated improvements in both Accuracy and F1-Score, reaching 99.96% in both metrics on the TraceRCA dataset, outperforming the baseline XGBoost method by a margin of 1.46% in Accuracy and 45.62% in F1-Score. Moreover, ADMXGB achieves reductions in execution time (-86.3%) and memory usage (-21.7%), while maintaining an acceptable CPU overhead. These findings highlight the robustness of ADMXGB in delivering high-accuracy classification in a microservice environment.