Application of KNN algorithm for predicting celiac disease using clinical and serological variables

Descripción del Articulo

Celiac disease is an autoimmune condition with a global prevalence close to 1%, often underdiagnosed due to low clinical suspicion, which increases both morbidity and mortality. In this context, the application of the K-Nearest Neighbors (KNN) algorithm emerged as a predictive model to support the d...

Descripción completa

Detalles Bibliográficos
Autores: Levano, Daniel, Cerdán León, Flor Elizabeth, Salazar Giraldo, Cesar Rolando, Vasquez Castro, Jadira Dina, Carbajal Bazán, Marita Abigail, Zea Mendoza, Aldana Camila
Formato: artículo
Fecha de Publicación:2025
Institución:Universidad La Salle
Repositorio:Revistas - Universidad La Salle
Lenguaje:español
OAI Identifier:oai:ojs.revistas.ulasalle.edu.pe:article/311
Enlace del recurso:https://revistas.ulasalle.edu.pe/innosoft/article/view/311
https://doi.org/10.48168/innosoft.s24.a311
https://purl.org/42411/s24/a311
https://n2t.net/ark:/42411/s24/a311
Nivel de acceso:acceso abierto
Materia:autoimmune
Django
desease
KNN
prediction
autoinmune
enfermedad
predicción
Descripción
Sumario:Celiac disease is an autoimmune condition with a global prevalence close to 1%, often underdiagnosed due to low clinical suspicion, which increases both morbidity and mortality. In this context, the application of the K-Nearest Neighbors (KNN) algorithm emerged as a predictive model to support the detection of this disease using clinical and serological variables. A supervised model was developed using the KNN algorithm and clinical and serological data extracted from an academic dataset containing 2,206 records. To address class imbalance, the Synthetic Minority Over-sampling Technique (SMOTE) was applied. The data were split for training and validation, optimizing the classification parameter through cross-validation. In addition, a web platform was developed to support data input, analysis, and output, allowing the uploading, processing, and generation of medical reports with role-based access and diagnostic probability estimation. The model achieved 94% accuracy, 97% precision, and 91% sensitivity. The algorithm proved to be effective for predicting celiac disease based on clinical and serological data, and its web-based implementation enables practical integration in clinical environments.
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).