Application of KNN algorithm for predicting celiac disease using clinical and serological variables

Descripción del Articulo

Celiac disease is an autoimmune condition with a global prevalence close to 1%, often underdiagnosed due to low clinical suspicion, which increases both morbidity and mortality. In this context, the application of the K-Nearest Neighbors (KNN) algorithm emerged as a predictive model to support the d...

Descripción completa

Detalles Bibliográficos
Autores: Levano, Daniel, Cerdán León, Flor Elizabeth, Salazar Giraldo, Cesar Rolando, Vasquez Castro, Jadira Dina, Carbajal Bazán, Marita Abigail, Zea Mendoza, Aldana Camila
Formato: artículo
Fecha de Publicación:2025
Institución:Universidad La Salle
Repositorio:Revistas - Universidad La Salle
Lenguaje:español
OAI Identifier:oai:ojs.revistas.ulasalle.edu.pe:article/311
Enlace del recurso:https://revistas.ulasalle.edu.pe/innosoft/article/view/311
https://doi.org/10.48168/innosoft.s24.a311
https://purl.org/42411/s24/a311
https://n2t.net/ark:/42411/s24/a311
Nivel de acceso:acceso abierto
Materia:autoimmune
Django
desease
KNN
prediction
autoinmune
enfermedad
predicción
id REVUSALLE_6f5e76517f30cc1bb4e375dc2f70fcca
oai_identifier_str oai:ojs.revistas.ulasalle.edu.pe:article/311
network_acronym_str REVUSALLE
network_name_str Revistas - Universidad La Salle
repository_id_str
spelling Application of KNN algorithm for predicting celiac disease using clinical and serological variablesAplicación del algoritmo KNN para la predicción de enfermedad celíaca utilizando variables clínicas y serológicas Levano, DanielCerdán León, Flor ElizabethSalazar Giraldo, Cesar RolandoVasquez Castro, Jadira DinaCarbajal Bazán, Marita Abigail Zea Mendoza, Aldana CamilaautoimmuneDjangodeseaseKNNpredictionautoinmuneDjangoenfermedadKNNpredicciónCeliac disease is an autoimmune condition with a global prevalence close to 1%, often underdiagnosed due to low clinical suspicion, which increases both morbidity and mortality. In this context, the application of the K-Nearest Neighbors (KNN) algorithm emerged as a predictive model to support the detection of this disease using clinical and serological variables. A supervised model was developed using the KNN algorithm and clinical and serological data extracted from an academic dataset containing 2,206 records. To address class imbalance, the Synthetic Minority Over-sampling Technique (SMOTE) was applied. The data were split for training and validation, optimizing the classification parameter through cross-validation. In addition, a web platform was developed to support data input, analysis, and output, allowing the uploading, processing, and generation of medical reports with role-based access and diagnostic probability estimation. The model achieved 94% accuracy, 97% precision, and 91% sensitivity. The algorithm proved to be effective for predicting celiac disease based on clinical and serological data, and its web-based implementation enables practical integration in clinical environments.La enfermedad celíaca corresponde a una condición autoinmune con una prevalencia cercana al 1% a nivel global, frecuentemente subdiagnosticada debido a la escasa sospecha clínica, lo que incrementa su morbilidad y mortalidad. En este contexto, la aplicacion del algoritmo K-Nearest Neighbors (KNN) surgió como un modelo predictivo para contribuir a la detección de esta enfermedad mediante variables clínicas y serológicas. Se diseñó un modelo supervisado con el algoritmo KNN utilizando variables clínicas y serológicas extraídas de una base de datos académica de 2,206 registros. Para balancear las clases, se aplicó la técnica de sobremuestreo sintético (SMOTE). Los datos fueron segmentados para entrenamiento y validación, optimizando el parámetro de clasificación mediante validación cruzada. Además, se desarrolló una plataforma web diseñada para admitir el ingreso, análisis y emisión que permite la carga, procesamiento y generación de reportes médicos con acceso por roles y estimación de probabilidad diagnóstica. Este modelo alcanzó una exactitud del 94%, una precisión del 97 % y una sensibilidad del 91 %. El algoritmo demostró ser útil para la predicción de la enfermedad celíaca a partir de datos clínicos y serológicos, y su implementación en la web permite su integración práctica en entornos clínicos.Universidad La Salle2025-09-30info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionJournal paperArtículos originalesapplication/pdftext/htmlhttps://revistas.ulasalle.edu.pe/innosoft/article/view/311https://doi.org/10.48168/innosoft.s24.a311https://purl.org/42411/s24/a311https://n2t.net/ark:/42411/s24/a311Innovation and Software; Vol 6 No 2 (2025): September - February; 74-89Innovación y Software; Vol. 6 Núm. 2 (2025): Septiembre - Febrero; 74-892708-09352708-0927https://doi.org/10.48168/innosoft.s24https://purl.org/42411/s24https://n2t.net/ark:/42411/s24reponame:Revistas - Universidad La Salleinstname:Universidad La Salleinstacron:USALLEspahttps://revistas.ulasalle.edu.pe/innosoft/article/view/311/405https://revistas.ulasalle.edu.pe/innosoft/article/view/311/406Derechos de autor 2026 Innovación y Softwarehttps://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessoai:ojs.revistas.ulasalle.edu.pe:article/3112026-03-09T08:00:15Z
dc.title.none.fl_str_mv Application of KNN algorithm for predicting celiac disease using clinical and serological variables
Aplicación del algoritmo KNN para la predicción de enfermedad celíaca utilizando variables clínicas y serológicas
title Application of KNN algorithm for predicting celiac disease using clinical and serological variables
spellingShingle Application of KNN algorithm for predicting celiac disease using clinical and serological variables
Levano, Daniel
autoimmune
Django
desease
KNN
prediction
autoinmune
Django
enfermedad
KNN
predicción
title_short Application of KNN algorithm for predicting celiac disease using clinical and serological variables
title_full Application of KNN algorithm for predicting celiac disease using clinical and serological variables
title_fullStr Application of KNN algorithm for predicting celiac disease using clinical and serological variables
title_full_unstemmed Application of KNN algorithm for predicting celiac disease using clinical and serological variables
title_sort Application of KNN algorithm for predicting celiac disease using clinical and serological variables
dc.creator.none.fl_str_mv Levano, Daniel
Cerdán León, Flor Elizabeth
Salazar Giraldo, Cesar Rolando
Vasquez Castro, Jadira Dina
Carbajal Bazán, Marita Abigail
Zea Mendoza, Aldana Camila
author Levano, Daniel
author_facet Levano, Daniel
Cerdán León, Flor Elizabeth
Salazar Giraldo, Cesar Rolando
Vasquez Castro, Jadira Dina
Carbajal Bazán, Marita Abigail
Zea Mendoza, Aldana Camila
author_role author
author2 Cerdán León, Flor Elizabeth
Salazar Giraldo, Cesar Rolando
Vasquez Castro, Jadira Dina
Carbajal Bazán, Marita Abigail
Zea Mendoza, Aldana Camila
author2_role author
author
author
author
author
dc.subject.none.fl_str_mv autoimmune
Django
desease
KNN
prediction
autoinmune
Django
enfermedad
KNN
predicción
topic autoimmune
Django
desease
KNN
prediction
autoinmune
Django
enfermedad
KNN
predicción
description Celiac disease is an autoimmune condition with a global prevalence close to 1%, often underdiagnosed due to low clinical suspicion, which increases both morbidity and mortality. In this context, the application of the K-Nearest Neighbors (KNN) algorithm emerged as a predictive model to support the detection of this disease using clinical and serological variables. A supervised model was developed using the KNN algorithm and clinical and serological data extracted from an academic dataset containing 2,206 records. To address class imbalance, the Synthetic Minority Over-sampling Technique (SMOTE) was applied. The data were split for training and validation, optimizing the classification parameter through cross-validation. In addition, a web platform was developed to support data input, analysis, and output, allowing the uploading, processing, and generation of medical reports with role-based access and diagnostic probability estimation. The model achieved 94% accuracy, 97% precision, and 91% sensitivity. The algorithm proved to be effective for predicting celiac disease based on clinical and serological data, and its web-based implementation enables practical integration in clinical environments.
publishDate 2025
dc.date.none.fl_str_mv 2025-09-30
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Journal paper
Artículos originales
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv https://revistas.ulasalle.edu.pe/innosoft/article/view/311
https://doi.org/10.48168/innosoft.s24.a311
https://purl.org/42411/s24/a311
https://n2t.net/ark:/42411/s24/a311
url https://revistas.ulasalle.edu.pe/innosoft/article/view/311
https://doi.org/10.48168/innosoft.s24.a311
https://purl.org/42411/s24/a311
https://n2t.net/ark:/42411/s24/a311
dc.language.none.fl_str_mv spa
language spa
dc.relation.none.fl_str_mv https://revistas.ulasalle.edu.pe/innosoft/article/view/311/405
https://revistas.ulasalle.edu.pe/innosoft/article/view/311/406
dc.rights.none.fl_str_mv Derechos de autor 2026 Innovación y Software
https://creativecommons.org/licenses/by/4.0
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Derechos de autor 2026 Innovación y Software
https://creativecommons.org/licenses/by/4.0
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
text/html
dc.publisher.none.fl_str_mv Universidad La Salle
publisher.none.fl_str_mv Universidad La Salle
dc.source.none.fl_str_mv Innovation and Software; Vol 6 No 2 (2025): September - February; 74-89
Innovación y Software; Vol. 6 Núm. 2 (2025): Septiembre - Febrero; 74-89
2708-0935
2708-0927
https://doi.org/10.48168/innosoft.s24
https://purl.org/42411/s24
https://n2t.net/ark:/42411/s24
reponame:Revistas - Universidad La Salle
instname:Universidad La Salle
instacron:USALLE
instname_str Universidad La Salle
instacron_str USALLE
institution USALLE
reponame_str Revistas - Universidad La Salle
collection Revistas - Universidad La Salle
repository.name.fl_str_mv
repository.mail.fl_str_mv
_version_ 1864286165097512960
score 13.118823
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).