Model to Predict Incoming Tech Support Demand in a Banking Company Applying CRISP-DM and Machine Learning

Descripción del Articulo

In modern-day businesses, forecast systems had become into a valuable tool in areas such as finances, advertisement, healthcare and more. The amount of data available online, make it possible to analyze and prevent future customer and market behaviors. The goal of the study is to train machine learn...

Descripción completa

Detalles Bibliográficos
Autores: Vásquez, Jeison, Ojeda, Piero, Wong, Lenis
Formato: artículo
Fecha de Publicación:2024
Institución:Universidad Peruana de Ciencias Aplicadas
Repositorio:UPC-Institucional
Lenguaje:inglés
OAI Identifier:oai:repositorioacademico.upc.edu.pe:10757/675707
Enlace del recurso:https://doi.org/10.1109/Confluence60223.2024.10463443
http://hdl.handle.net/10757/675707
Nivel de acceso:acceso embargado
Materia:CRISP-DM
Demand
Forecast
IT service
Machine Learning
https://purl.org/pe-repo/ocde/ford#3.00.00
Descripción
Sumario:In modern-day businesses, forecast systems had become into a valuable tool in areas such as finances, advertisement, healthcare and more. The amount of data available online, make it possible to analyze and prevent future customer and market behaviors. The goal of the study is to train machine learning models to predict incoming tech support demand. The algorithms selected in this study are: Random Forest, Multilayer Perceptron and Long Short-Term Memory. The methodology used in this study is Cross Industry Standard Process for Data Mining, which comprises the following phases: business understanding, data understanding, data preparation, modeling, and evaluation. The dataset gathered was comprised by 17,847 tech support issue tickets, collected between January 2020 and May 2023 (Ten temporality variables were identified, with the variable 'year' standing out as the most relevant within this specific dataset). The amount of data endowed the models with adaptability and accuracy when generating predictions. The results obtained showed that the Random Forest algorithm achieved an R2 metric of 0.80, positioning it as the technique that exhibited the best fit with the study's dataset.
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).